Skip to main content
Online Registration
Become a data-driven professional with Purdue's online certification in Data Analytics.

Purdue Data Analytics Certification Online

Purdue’s Data Analytics Certificate aims to bring big insights for your company and your career. We do this by taking you through a project-based curriculum where you will learn by doing. Purdue’s Data Analytics Certificate, in collaboration with XTOL, will turn you into a data-driven professional.

As the amount of data that businesses collect continues to increase, insights can no longer be effectively derived manually. The online Data Analytics Certificate teaches you how to find valuable insights from quantitative and qualitative data collected on a massive scale.

You will learn machine learning techniques to solve a broad range of data science problems by interpreting the results to make predictions and establish the reliability of those predictions. All while developing a data analytics portfolio to showcase and highlight your skills.

Data Analytics Info Session

Upcoming Start Dates: March 2 and March 18, 2021

The job market for data professionals continues to rise with the World Economic Forum stating that 85% of companies are planning on adopting big data technology by 2022. Join us to learn more about the program and the benefits of earning your Data Analytics Certificate.

Register Now

This 100% online program is project-based and takes a learn-by-doing approach. You will be supported by an expert mentor and work with a small team of fellow students to move through the courses. In addition to data analytics and data science training, you will get career coaching designed to help you land a job. From mock interviews to resume assistance to portfolio development, your mentor will ensure that you can showcase your skills and experience.

What you will learn in the Data Analytics certificate program:

  • Apply Python, R, SQL, Spark, and Amazon Web Services to solve data science problems
  • Translate business objectives into analytical opportunities using data mining
  • Become broadly competent in the use and evaluation of the statistical machine learning techniques classification, regression and association
  • Construct, justify, and apply custom data science processes
  • Apply parametric and nonparametric machine learning models and avoid overfitting models

Courses you will be taking:

In this course, you’ll learn how to use statistical machine learning techniques to understand the relationship between customer demographics and purchasing behavior, and then develop a model for predicting the future sales volume of products. You will learn how to apply the Python programming language to solve data analytics and machine learning problems.

Data Analytics Tasks:

  • Analyze differences in customer behavior
  • Uncover relationships in purchasing behavior

Tools used: Python, Jupyter Notebook, Matplotlib, Pandas, Scikit-learn

Course Topics:

  • Identifying types of business problems for which data analysis can provide significant insights in support of business decision-making
  • Translating business objectives into analytical opportunities using data mining
  • Applying statistical machine learning tools and methods to different kinds of data
  • Preprocessing data (e.g., feature engineering, addressing missing data, transforming numeric values to nominal values, discretizing data)
  • Communicating data mining results to management and other nontechnical audiences
  • Using data mining tools to investigate patterns in complex data sets
  • Using decision tree classifiers to investigate classification problems
  • Applying cross-validation methods
  • Interpret and draw inferences from the results of data mining
  • Applying Python to data science problems

In this course, you will continue to use the Python programming language to perform machine learning and data science tasks. More specifically, you will use Pandas, NumPy, Matplotlib and Sci-kit learn to develop machine learning and predictive analytics solutions.

Data Analytics Tasks:

  • Develop predictive models for appropriate customer credit limits

Tools used: Python, Jupyter Notebook, Matplotlib, Pandas, SciKit-Learn, SQL

Course Topics:

  • Using the Sci-kit learn machine learning library for Python
  • Identifying and solving collinearity through feature engineering and feature selection
  • Constructing, justifying, and applying custom data science processes
  • Drawing relationships between learner performance and measured features to help understand model performance
  • Conducting feature selection to investigate the correlation between different features in a dataset
  • Defining the business purpose of a data analytics project and making a principled, realistic analysis plan
  • Assessing the predictive performance of classifiers by examining key error metrics
  • Identifying where learning methods fail and gaining insight into why with error analysis
  • Selecting and justifying appropriate types of data analysis and statistical procedures
  • Using data mining tools and different classifiers (e.g., k-nearest neighbor, decision trees, support vector machines) to develop predictive models

In this course, you will learn to use statistical machine learning techniques to predict brand preferences based on customer characteristics, and then you will develop a model to recommend new products based on customer purchases. You will learn and use the R statistical programming language and the RStudio analytics environment.

Data Analytics Tasks:

  • Develop predictive models for customer brand preferences
  • Build a recommender system using association rules

Tools used: R, RStudio, and machine learning packages, SQL

Course Topics:

  • Presenting nontechnical data mining results to management
  • Becoming broadly competent in the use and evaluation of the statistical machine learning techniques classification, regression and association
  • Applying parametric and non-parametric machine learning models
  • Avoiding overfitting models
  • Optimizing classifiers by adjusting and testing classifier parameters
  • Applying data analytics in e-commerce (e.g., understanding customer behavior, segmenting customers by key demographics, selecting new products strategically and predicting their profitability).
  • Managing data in R with Tidyverse
  • Comparing and selecting different predictive models
  • Applying predictive models to test sets
  • Exploring data using visualization techniques and descriptive statistics in R using ggplot2 and plot.ly

In this course, you will continue to use the R statistical programming language and a variety of add-on “packages” to visualize data relationships and to implement classification and regression models for emerging engineering applications, such as understanding behavior in the “Internet of Things.”

Data Analytics Tasks:

  • Model smart energy usage
  • Develop an indoor locationing solution

Tools used: R packages for data visualization, SQL, time series analysis and machine learning

Course Topics:

  • Presenting highly technical, engineering-oriented data mining results to business management
  • Applying dimensionality reduction methods to broad datasets to reduce their complexity prior to modeling
  • Using the caret package for R
  • Choosing and evaluating classification modeling techniques in R
  • Choosing and evaluating regression modeling techniques in R
  • Analyzing time series data
    • Using data mining tools and different classifiers (e.g., k-nearest neighbor, decision trees, support vector machines) to develop predictive models
  • Interpreting performance metrics
  • Performing error analysis
  • Applying cross-validation and train-test split methods

In this course, you will learn how to mine and analyze extremely large datasets to provide insight into real-world business problems. You will conduct automated keyword extraction using natural language processing and use cloud-based computing to interpret the results and to make and communicate predictions of vital interest to business stakeholders.

Data Analytics Tasks:

  • Mine textual content from a large public corpus to conduct a web sentiment analysis of users’ attitudes towards various mobile phone platforms
  • Construct a sentiment analysis predictive model using natural language processing and machine learning

Tools used: Amazon Web Services, Elastic Map Reduce, Microsoft Azure, Databricks, Spark

Course Topics:

  • Acquiring, processing and analyzing extremely large data sets using cloud-based data mining methods to discover patterns or do data exploration
  • Operating the Amazon AWS cloud computing platform for data analysis
  • Discovering and mounting very large public datasets on the AWS platform
  • Setting up and running Elastic Map Reduce (EMR) instances for data analysis
  • Defining and creating a Spark cluster on Azure
  • Exploring and visualizing data using Databricks and PySpark
  • Performing sentiment analysis with an Azure Spark cluster using NLP
  • Doing data science in the Cloud
  • Assessing the predictive performance of classifiers by examining key error metrics
  • Using the Matplotlib and Seaborn Python 2D plotting libraries

In the Data Science Capstone, you will propose and design your own data science solution for a problem of your choosing. The capstone is an assessment of your ability to independently utilize the tools and methods needed to be a successful data scientist. This project will begin at the end of course two.

 

Register Now Program FAQ Download the Course Outline Data Analytics infographic, showing job outlook, pay scale, and job titles for data analytics professionals.

Request More Information


Register Now

Program at a Glance

Duration:

  • Full time: 20 weeks
  • Part time: 40 weeks

Modality: Online Instructor-led

Weekly Commitment:

  • Full time: 30 hours
  • Part time: 15 hours

Tuition:

  • Full time: $5,900
  • Part time: $6,900

10% Discount:

  • Register two weeks before the registration deadline. Use code: Analytics2021
  • Purdue Faculty/Staff/Alumni & Military, email noncredit@purdue.edu for more information.

Start Date: April 5, 2021

Who Should Enroll:

  • Those with career goals of becoming a data analyst or data scientist.
  • Professionals looking to gain extensive skills in data analytics.

Prerequisites:

  • Two years of work experience
  • Basic familiarity with Microsoft Office or an equivalent productivity suite

A background in statistics is helpful, but not required.

Contact: noncredit@purdue.edu

Purdue University, 610 Purdue Mall, West Lafayette, IN 47907, (765) 494-4600

© Purdue University | An equal access/equal opportunity university | Copyright Complaints | Maintained by ITaP

Trouble with this page? Disability-related accessibility issue? Please contact us at itap@purdue.edu.