4-Month Data Science Mastery Roadmap
4-Month Data Science Mastery Roadmap (6-8 hrs/day)
Goal: Become job-ready for Data Science roles with portfolio-ready projects, hands-on tools, and conceptual
strength.
MONTH 1: Core Foundation - Python, Math & Statistics
Week 1: Python Fundamentals
- Syntax, Variables, Data Types, Loops, Functions
Resources:
- 100 Days of Code (Hindi) - CodeWithHarry
- 100 Days Python (English) - Udemy
Week 2: OOPs, File Handling, Error Handling
- Classes, Objects, Inheritance, Reading/Writing files
Week 3: Statistics & Probability
- Mean, Median, Mode, Variance, Probability, Distributions
Resources:
- Linear Algebra Notes
- Statistics Basics Video
- Book: Probability and Statistics in Engineering by Hines
Week 4: Optimization + Graphs
- Basics of dy/dx, Gradient Descent, Normal Distribution
- Play with Graphs - Amit Aggarwal
MONTH 2: Data Handling & Visualization
Week 5: NumPy
- Arrays, Indexing, Vectorized Operations
Week 6: Pandas
- DataFrames, Series, Aggregation, Filtering, Missing Values
Week 7: Data Visualization
- Matplotlib, Seaborn, Pairplots, Heatmaps, Distributions
Week 8: Project 1: Exploratory Data Analysis (EDA)
- Use Pandas + Seaborn to explore a real dataset
- Tools: pandas-profiling, seaborn, matplotlib
MONTH 3: Databases, Git, ML Basics
Week 9: SQL + NoSQL Basics
- MySQL: SELECT, JOIN, GROUP BY
- MongoDB: PyMongo, NoSQL Concepts
Week 10: Git + Linux Essentials
- GitHub, Commits, Branches, SSH Keys, Bash commands
Week 11: Machine Learning Intro
- Supervised vs Unsupervised, Regression, Classification
Week 12: Project 2: Regression or Classification ML Model
- Use sklearn, pandas, matplotlib on real-world dataset
- Build model, evaluate, visualize metrics
MONTH 4: End-to-End Projects & Advanced Tools
Week 13: Model Tuning + Pipelines
- GridSearchCV, Cross-validation, Data Pipelines in sklearn
Week 14: Web Scraping (Bonus Tool)
- BeautifulSoup, Web Automation with Requests
Week 15: Cloud + BI Tools (Optional)
- AWS Free Tier, Tableau/PowerBI overview
Week 16: Final Project + Resume/GitHub Setup
- End-to-End ML Project using structured workflow
- Upload to GitHub, write README, polish resume
Final Outcome:
- 3+ Projects (EDA, ML, End-to-End)
- Portfolio-ready GitHub repo
- Resume-ready skillset in Python, Stats, ML, SQL, Git
- Confident to apply for Data Analyst, ML Intern, or Junior DS roles