Description:
This course introduces students to working with really large datasets, with a specific focus on real-world healthcare-related data. The course builds off of the Data Science Primer I to extend the learner’s knowledge of working with APIs and databases as well as strategies for more effectively working with large datasets and responsibly using healthcare data. Learners will gain practical knowledge and have an opportunity to practice working with diverse and complex healthcare data.
Please note that a basic familiarity of Python is expected before starting this course including a grasp of programming fundamentals in Python, including loops, functions and different data types, familiarity working with common data formats, and the basics of data wrangling using Python libraries including MatPlotLib and Pandas.
We strongly recommend signing up for the Data Science Primer I course to obtain a baseline competence with Python: http://www.eventreg.purdue.edu/online/DataSciencePrimerI
- For registration and course access questions, please email: noncredit@purdue.edu
- For course content questions, please email: datascihcp@purdue.edu