Description:
This course introduces students to working with really large datasets, with a specific focus on real-world healthcare-related data. The course builds off of the Data Science Primer I to extend the learner’s knowledge of working with APIs and databases as well as strategies for more effectively working with large datasets and responsibly using healthcare data. Learners will gain practical knowledge and have an opportunity to practice working with diverse and complex healthcare data.
Please note that a basic familiarity of Python is expected before starting this course including a grasp of programming fundamentals in Python, including loops, functions and different data types, familiarity working with common data formats, and the basics of data wrangling using Python libraries including MatPlotLib and Pandas.
For a limited time, learners who register for this course will automatically receive free access to Data Science Primer I (a $500 value), making it easy to prepare with no additional cost.
We strongly recommend completing the Data Science Primer I to obtain baseline competence with Python.
Access to course materials is available for 365 days from registration.
- For registration and course access questions, please email: noncredit@purdue.edu
- For course content questions, please email: datascihcp@purdue.edu