Data Science Course Outline CES LUMS
Data Science Course Outline CES LUMS
Target Audience This course is ideal for aspiring and current data scientists, career switchers, and professionals looking to expand their
skills
Prerequisites (if any) Participants should have a basic working knowledge of Microsoft Excel and familiarity with handling data.
Course Description
This course seamlessly blends essential Python programming, hands-on data exploration, and practical machine learning concepts for a comprehensive
learning experience. Students will explore libraries like Pandas, NumPy, Matplotlib, and scikit-learn to manipulate, understand, and build predictive models
with your data, and will learn to leverage generative AI support for code generation, troubleshooting, and concept understanding.
LO1: Python Fluency: Demonstrate proficiency in core Python concepts (variables, data types, control flow, functions) for data science tasks.
LO2: Data Handling Expertise: Utilize Pandas to effectively import, clean, transform, and manipulate datasets for analysis and modelling.
LO3: Exploratory Analysis Mastery: Employ NumPy, Matplotlib, and Seaborn to calculate summary statistics and create informative visualizations,
extracting meaningful insights from data.
LO4: Machine Learning Foundations: Understand the principles of supervised learning and build basic linear regression and decision tree models
using scikit-learn. Evaluate model performance using appropriate metrics.
LO5: Process-Oriented Mindset: Apply a structured workflow to a data science project encompassing data cleaning, exploratory analysis, model
selection, and result interpretation.
Course Summary
Week 2 Python and Data Control Flow, Functions Practice project: Data cleaning and
Fundamentals (Cont.) Pandas data selection and transformation manipulation with Pandas
Week 3 EDA Understanding statistical data analysis concepts Quiz on EDA concepts
Summary Statistics
Data aggregation
Week 4 EDA (Cont.) Confidence interval and hypothesis testing EDA mini-project progress check-in
Visualization
Week 6 Model Building Model selection concepts, training, validation and testing Capstone Project: Peer feedback and
Capstone Project evaluation
Python:
o Learn Python (https://www.learnpython.org/)
o "Automate the Boring Stuff with Python" (https://automatetheboringstuff.com/)
Data Analysis and Visualization
o Kaggle Datasets (https://www.kaggle.com/datasets)
o Python Data Science Handbook (https://jakevdp.github.io/PythonDataScienceHandbook/)
o "Storytelling with Data" (https://www.storytellingwithdata.com/)
o Flourish (https://flourish.studio/)
Machine Learning
o "Introduction to Statistical Learning" (https://www.statlearning.com/)
o Towards Data Science Blog (https://towardsdatascience.com/)
Generative AI
o OpenAI API Documentation (https://beta.openai.com/docs)
o "Coding with ChatGPT" (https://medium.com/@tanyamarleytsui/coding-with-chatgpt-b50ab3fcb45f)
o Democratizing access to AI-enabled coding with Colab (https://blog.google/technology/ai/democratizing-access-to-ai-enabled-coding-with-
colab)