The University Senate of Michigan Technological University

 

Proposal 20-14
(Voting Units: Academic)

 

“Proposal for a New Non-Departmental Graduate Certificate in Data Science”

 February 28, 2014

 

 

Contacts: Laura Brown (Computer Science), Mari W. Buche (School of Business

& Economics), Gowtham S (Information Technology Services), Timothy Havens

(Electrical & Computer Engineering/Computer Science), Jacqueline Huntoon

(Graduate School), Saeid Nooshabadi (Electrical & Computer

Engineering/Computer Science), and Allan Struthers (Mathematics)

 

e-mail: datascience@mtu.edu

 

 

 

Executive Summary

 

This proposal describes a plan for a new Graduate Certificate in Data Science. This program will augment the proposal for a Master of Science (M.S.) in Data Science. Like its M.S. counterpart the Graduate Certificate in Data in Science the has three main objectives: i) to attract students from various disciplines who wish to learn the basics of data analysis, data science, and computing tools; ii) to teach students basic skills in communication and build their awareness of business contexts; and iii) to provide students the opportunity to gain the basic skills that give them the ability to analyze large data sets, including Big Data.

 

Our goal is to have all the core data science courses and most approved data science

courses offered online by 2016, which we note would allow off-campus students to fully

complete the Graduate Certificate in Data Sciences with online offerings.

 

 

 

 

 

 

 

1. Background

The proposed Graduate Certificate is offered as subset of M.S. in Data Science and is

offered to science and engineering graduates who wish to upgrade their qualification to

be able to work in a profession with a primarily role to manage and analyze data.

 

2. Justification and Estimated Market

The program we are proposing will significantly increase the number of data scientists that

Michigan Tech can offer to the workforce. The Graduate Certificate program in Data

Science like its M.S. counterpart will provide students with strong academic training in data

analysis in a range of areas (e.g., physical sciences, geosciences, geoinformatics,

bioinformatics, cheminformatics, environmental, social sciences, business and commerce)

while at the same time introduce essential business acumen, communication and teamwork skills highly valued by industry and government.

 

The proposed program emphasizes data analytics from a general perspective, but the skills to be learned are applicable to a diverse range of areas, including business analytics, computer science and engineering, and informatics. To support the interdisciplinary nature of the Data Science program, applications from multiple areas will be included in the coursework.

 

The proposal Data Science program is in line with Michigan Tech strategic plan1 to “be a

leader in creating solutions for society's challenges through education and interdisciplinary

endeavors that advance sustainable economic prosperity…”

 

3. Competitive Analysis

Established computer science, business analytics, and statistics master’s degrees and

certificate programs already exist, both in the U.S. and abroad, and provide specializations in data mining and predictive analytics. However, despite interest and recognized need, there are as yet only a few programs dedicated to data science in the U.S. Further, the existing programs have been designed around business data with a less domain-specific scientific focus. These master’s programs include Northwestern’s new M.S. in Analytics, DePaul’s M.S. in Predictive Analytics, University of San Francisco’s M.S. in Analytics, LSU’s M.S. in Analytics, Rutgers’s Professional Science Master’s (PSM) of Business and Science in Analytics, and NCSU’s M.S. in Analytics (also a PSM program).

 

Finally, there is increased recognition by federal agencies that supporting Big Data research is important. For example, the National Institutes of Health (NIH) director, Dr. Francis Collins, recently convened a “Data and Informatics Working Group” that made several key recommendations aimed at fostering NIH sponsored research in Big Data. Other federal agencies have also signaled interest in Big Data research, including National Science Foundation, DARPA, Department of Energy, and Department of Defense.

 

 

 

____________________

1 STRATEGIC PLAN https://www.banweb.mtu.edu/pls/owa/strategic_plan2.p_display

 

4. Detailed Description of Graduate Certificate in Data Science

i. Title:

Graduate Certificate in Data Science

 

ii. Catalog description:

The non-departmental Data Science program at Michigan Tech provides a foundation for the emerging field of “Big Data” science, including the use of data mining, predictive analytics, cloud computing, and business skills, with a domain specific specialization. The main threads of analytic techniques, programming practice, domain knowledge, business acumen, and communication skills are intertwined in this program.

 

The Graduate Certificate in Data Sciences provides the basic skills in data analytics, data

management, business and communication skills. Entry into this program assumes basic

knowledge in statistical and mathematical techniques, programming, and communications.

 

iii. Credits:

Graduate Certificate in Data Science                        15 credits (minimum)

 

iv. Course work:

In accordance with Senate policy, the requirements for the interdisciplinary Graduate

Certificate in Data Sciences are a minimum 15 credits of coursework, including the required

12 credits of core courses and 3 credits of approved Data Science electives. All other

requirements are per Senate proposals 11-10 and 4-11. Because this is an interdisciplinary

certificate, a maximum of six credits can be earned at the 3000-4000 level.

 

Coursework Summary

 

Core courses for M.S. Data Science (12 credits):

 

The four required core 3-credit courses focus on basic skills in data science analytics, data

mining, and business analytics. These courses are:

● UN 5550 - Introduction to Data Science (3 credits)2

● MA 4790 - Predictive Modeling (3 credits)

● CS 4821 / MA 4795 - Data Mining (3 credits)

● BA 5200 - Information Systems Management and Data Analytics (3 credits)3

 

 

 

 

 

 

 

___________________________________

2 New course to be designed for Fall 2014; submitted to curriculum proposal (binder) Fall 2013. This

course will be administered by the Office of Dean of the Graduate School, and executed by Data Science

faculty across the campus.

3 Revise course BA 5200-Strategic IS Management; submitted to curriculum proposal (binder) Fall 2013

 

 

Approved Data Science elective courses for M.S. Data Science (minimum of 3 credits):

 

The remaining 3 credits for the graduate certificate must be taken from the approved 3-credit Data Science elective courses that are as part of the M.S. program in Data Science as below:

 

    CS 5841 / EE 5841 - Machine Learning (3 credits)4

 

    CS 5491 - Cloud Computing (3 credits)5

 

    CS 5471 Advanced Topics in Computer Security (3 credits)6

 

    MA 5781 - Time Series Analysis and Forecasting (3 credits)7

 

    BA 5740 - Managing Innovation & Technology (3 credits)

 

    PSY 5210 - Advanced Statistical Analysis and Design I (4 credits)

 

    FW 5083 - Bioinformatics Programming and Skills (3 credits)8

 

Foundational Prerequisite Requirement:

It is expected that students seeking enrollment in this program will have sufficient foundational skills and aptitude in computer programming, statistical analysis, information systems and databases. The required foundational skills may have been obtained through formal academic qualifications, work experience, or a combination. Students will be encouraged to develop

their foundational skills before coming to Michigan Tech to start the graduate certificate program in Data Science. After taking the entrance assessment exam and evaluation of the students application, students will receive advice regarding their skill competence and may be required to take specific foundational courses (Appendix I) as necessary to acquire the required level of foundational skills.   As students matriculate in the program, their assigned advisors will continually monitor students progress to ensure that students are given all the necessary advice that they need to be successful in the program. Appendix I provides a list of foundational courses.

 

v.Online delivery:

Our goal is to have all the core data science courses and most approved data science courses offered online by 2016, which we note would allow off-campus students to fully complete the Graduate Certificate in Data Sciences with online offerings. Note that BA 5200 - Information Systems Management and Business Analytics will be offered as an online course starting in 2014. Additionally, the approved Data Science courses, CS 5841 / EE 5841 - Machine Learning and CS 5491 - Cloud Computing, will be offered as online courses in 2016.

 

 

4 New course to be designed for Spring 2015; submitted to curriculum proposal (binder) Fall 2013

5 New course to be designed for Spring 2015; submitted to curriculum proposal (binder) Fall 2013

6 New course to be designed for Spring 2015; submitted to curriculum proposal (binder) Fall 2013

7 Graduate version of MA 4780, submitted to curriculum proposal (binder) Fall 2013, to be offered as a split-level undergraduate/ graduate course. The graduate version of this course contains additional theoretical material and substantial project work.

8 Graduate version of FW 4099, submitted to curriculum proposal (binder) Fall 2013, to be offered as a

split-level undergraduate/ graduate course. The graduate version of this course contains additional theoretical material and substantial project work.


vi. Description of new or revised Data Science courses (offered as part of

M .S. in Data Science program):

 

 

All new (or revised) courses were added (modified) in the curriculum proposal (binder)

process of Fall 2013.

 

UN 5550 - Introduction to Data Science (new ) (3 credits)

 

This course provides an introduction to Big Data concepts, with focus on data management, data modeling, visualization, security, cloud computing, and data science from different perspectives: computer science, business, social science, bioinformatic, engineering, etc. This course also introduces the tools for data analytics such as SPSS Modeler, R, SAS, Python, and MATLAB. It involves two case study projects, each of which is integrated with communication and business skills.

 

BA 5200 - Information Systems Management and Data Analytics (revision)  (3 credits)

 

BA 5200 Focuses on management of Information Systems /Information Technology within

the business environment. Topics include Information Technology infrastructure and

architecture, organizational impact of innovation, change management, human-machine

interaction, and contemporary management issues involving data analytics. Class format

includes lecture, group discussion, and integrative case studies.9

 

CS 5841 / EE 5841 - Machine Learning (new) (3 credits)

This course will explore the foundational techniques of machine learning. Topics are pulled

from the areas of unsupervised and supervised learning. Specific methods covered include

naive Bayes, decision trees, support vector machines (SVMs), ensemble, and clustering

methods.

 

CS 5471 - Advanced Topics in Computer Security (new) (3 credits)

This course covers various aspects of producing trusted computer information systems.

Topics may vary; network perimeter protection, host-level protection, authentication technologies, formal analysis techniques, and intrusion detection will be emphasized.  Current systems will be examined and critiqued.

 

 

_________________________

9 Expanded description of BA 5220: This course is a restructuring of the existing course BA 5200 - Strategic IS Management to achieve a more acute focus on data analytics. The course incorporates experiential application of methods and analysis of business case studies focusing on contemporary issues in data analytics (i.e., Big Data) to include comprehension of business and organizational context, visualization and interpretation of results, reporting of outcomes from data analytics, evaluation of alternative techniques, and other current topics.  Multiple online resources will be employed, including Teradata University. Students in this class will utilize open source software (e.g. Hadoop and NoSQL), developing skills applicable to industry. Ethical foundations and managerial constraints will be integrated throughout the course

 

CS 5491 - Cloud Computing (new )  (3 credits)

 

This course provides an overview of the principles, methods, and leading technologies of cloud computing technologies. Topics include cloud computing concepts and architecture: Hadoop, MapReduce; standards; implementation strategies; Software as a Service (SaaS); Platform as a Service (PaaS); Infrastructure as a Service (IaaS); workload patterns and resource management; migrating to the cloud; and case studies and best practices. Students in this class will build their own cloud application using services from providers such as Amazon or IBM.

 

 

 

5. Estimated Costs For Financial Evaluation

The Graduate Certificate in Data Science program is a subset of M.S. in Data Science program, and does not incur any cost in addition and beyond the M.S. program. The approval of Graduate Certificate program must be subject to the approval of the M.S. program.

 

 

 

6. Planned Implementation Date

This program has an anticipated start in Fall semester, 2014. This program will be offered as a regular program. The program will be extended into an online program as soon as it is established and practical to do so. We envision a start date of Fall 2016 for the online

delivery of this program.

 

 

 

7. Program Governance

Like other non-departmental and interdisciplinary programs at Michigan Tech, the Data

Science program will be administered through the Graduate School, which will have the overall responsibility and final oversight for the program. The program will have the same

management structure that governs the M.S. in Data Science program.

 

 

 

 

 

 

Appendix I: Foundational Skills Courses

 

It is expected that students seeking enrollment in this program will have sufficient foundational skills and aptitude in computer programming, statistical analysis, information systems and databases.

 

 

For those students who need additional training in these areas, the courses listed below will help build skills necessary for successful completion of the certificate.  Not all students will need to take these courses as such the foundational courses are not required.  Note, for students coming from a Bachelors program at Michigan Tech, the foundational courses do not double-count for both the B.S/B.A. program and the Graduate Certificate in Data Science.

 

 

Note that 2000 level courses listed here cannot be counted towards the requirement for the Graduate Certificate in Data Science degree, but may be necessary for a given student to build their foundational knowledge.

 

 

    MA 2330 - Introduction to Linear Algebra (Credits: 3)

 

    MA 3710 - Engineering Statistics (Credits: 3)

 

    MA 3715 - Biostatistics (Credits: 3)

 

    MA 3740 - Statistical Programming and Analysis (Credits: 3)

 

    MIS 2000 - IS/IT Management (Credits: 3)

 

    MIS 2100 - Introduction to Business Programming (Credits: 3)

 

    MIS 3100 - Business Database Management (Credits: 3)

 

    MKT 3600 - Marketing Research (Credits: 3)

 

    CS 2321 - Data Structures (Credits: 3)

 

    CS 3425 - Database (Credits: 3)

 

    SAT 3002 - Application Programming Introduction (Credits: 3) 9

 

    SAT 3210 - DB Management (Credits: 3)10

 

    SAT 4600 - Web Application Development (Credits: 3)11

 

 

 

 

 

 

 

 

9 New 3-credit course designed for Fall 2014

10 Summer offerings available.

11 New 3-credit course designed for Spring 2015

 

 

 

 

Introduced to Senate: 05 March 2014

Approved by Senate: 26 March 2014

Approved by Administration: 03 April 2014