[go: up one dir, main page]

0% found this document useful (0 votes)
78 views30 pages

EPGP in Data Science (Curriculum)

The document outlines the curriculum for an Executive PG Programme in Data Science. It covers topics like data analysis using Excel, analytics problem solving, data analysis using Python, SQL, machine learning algorithms like linear regression, logistic regression, decision trees and clustering. The curriculum aims to provide foundational skills in data analysis, data visualization, statistics, SQL, and machine learning through both conceptual and hands-on learning over 15 courses spanning topics like Python, Excel, databases, statistics, linear regression, logistic regression, decision trees, clustering and natural language processing. The curriculum is designed by experts and is subject to change based on industry and university feedback.

Uploaded by

adadaed ad
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
78 views30 pages

EPGP in Data Science (Curriculum)

The document outlines the curriculum for an Executive PG Programme in Data Science. It covers topics like data analysis using Excel, analytics problem solving, data analysis using Python, SQL, machine learning algorithms like linear regression, logistic regression, decision trees and clustering. The curriculum aims to provide foundational skills in data analysis, data visualization, statistics, SQL, and machine learning through both conceptual and hands-on learning over 15 courses spanning topics like Python, Excel, databases, statistics, linear regression, logistic regression, decision trees, clustering and natural language processing. The curriculum is designed by experts and is subject to change based on industry and university feedback.

Uploaded by

adadaed ad
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 30

1

Executive PG Programme
in Data Science
CO MMON CU RRIC ULUM
PRE-PROGRAM PREPARATORY CONTENT
1. DATA ANALYSIS IN EXCEL

1. INTRODUCTION TO EXCEL Taught by one of the most renowned data


scientists in the country (S.Anand, CEO,
2. DATA ANALYSIS IN EXCEL - I:
Gramener), this module takes you from
FUNCTIONS, FORMULAE, AND
a beginner level Excel user to an almost
CHARTS
professional user.
3. DATA ANALYSIS IN EXCEL - II:
PIVOTS AND LOOKUPS

2. ANALYTICS PROBLEM SOLVING

1. THE CRISP-DM FRAMEWORK This module covers concepts of the CRISP-


- BUSINESS AND DATA DM framework for business problem-solving.
UNDERSTANDING

2. CRISP-DM FRAMEWORK
- DATA PREPARATION,
MODELLING, EVALUATION
AND DEPLOYMENT

COURSE 1: DATA TOOLKIT


1. INTRODUCTION TO PYTHON

1. UNDERSTANDING THE Build a foundation for the most in-demand 2 WEEKS


UPGRAD CODING CONSOLE programming language of the 21st century.

2. BASICS OF PYTHON

3. DATA STRUCTURES IN
PYTHON

4. CONTROL STRUCTURE AND


FUNCTIONS IN PYTHON

5. OOP IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
01 2

2. PROGRAMMING IN PYTHON

1. LOGIC AND SYNTAX Learn how to approach and solve logical 1 WEEK
BUILDING problems using programming.

2. DATA STRUCTURES: LISTS,


STRINGS, DICTIONARIES, AND
STACKS

3. TIME COMPLEXITY

4. SEARCHING AND SORTING

5. TWO POINTERS

6. RECURSION

3. PYTHON FOR DATA SCIENCE

1. INTRODUCTION TO NUMPY Learn how to manipulate datasets in Python 1 WEEK


using Pandas which is the most powerful
2. INTRODUCTION TO
library for data preparation and analysis.
MATPLOTLIB

3. INTRODUCTION TO PANDAS

4. GETTING AND CLEANING


DATA

4. DATA VISUALIZATION IN PYTHON

1. INTRODUCTION TO DATA Humans are visual learners and hence no 1 WEEK


VISUALIZATION task related to data is complete without
visualisation. Learn to plot and interpret
2. DATA VISUALISATION USING
various graphs in Python and observe how
SEABORN
they make data analysis and drawing insights
easier.

*The Curriculum is subject to change as per the inputs from university or industry experts
3

5. EXPLORATORY DATA ANALYSIS

1. DATA SOURCING Learn how to find and analyse the 2 WEEKS


patterns in the data to draw actionable
2. DATA CLEANING
insights.
3. UNIVARIATE ANALYSIS

4. BIVARIATE ANALYSIS AND


MULTIVARIATE ANALYSIS

6. CREDIT EDA CASE STUDY

1. PROBLEM STATEMENT Solve a real industry problem through the 1 WEEK


concepts learnt in exploratory data analysis.
2. EVALUATION RUBRIC

3. FINAL SUBMISSION

4. SOLUTION

7. INFERENTIAL STATISTICS

1. BASICS OF PROBABILITY Build a strong statistical foundation and learn 1 WEEK


how to ‘infer’ insights from a huge population
2. DISCRETE PROBABILITY
using a small sample.
DISTRIBUTIONS

3. CONTINUOUS PROBABILITY
DISTRIBUTIONS

4. CENTRAL LIMIT THEOREM

8. HYPOTHESIS TESTING

1. CONCEPTS OF HYPOTHESIS Understand how to formulate and validate 1 WEEK


TESTING - I: NULL AND hypotheses for a population to solve real-life
ALTERNATE HYPOTHESIS, business problems.
MAKING A DECISION, AND
CRITICAL VALUE METHOD

2. CONCEPTS OF HYPOTHESIS
TESTING - II: P-VALUE METHOD
AND TYPES OF ERRORS

3. INDUSTRY DEMONSTRATION
OF HYPOTHESIS TESTING:
TWO-SAMPLE MEAN AND
PROPROTION TEST, A/B
TESTING

*The Curriculum is subject to change as per the inputs from university or industry experts
01 4
13

9. DATA ANALYSIS USING SQL

1. DATABASE DESIGN Data in companies is definitely not stored 1 WEEK


in excel sheets! Learn the fundamentals
2. DATABASE CREATION IN
of database and extract information
MYSQL WORKBENCH
from RDBMS using the structured query
3. QUERYING IN MYSQL language.
4. JOINS AND SET OPERATIONS

10. ADVANCED SQL & BEST PRACTICES

1. WINDOW FUNCTIONS Apply advanced SQL concepts like window- 1 WEEK


ing and procedures to derive insights from
2. CASE STATEMENTS, STORED
data and answer pertinent business
ROUTINES AND CURSORS
questions.
3. QUERY OPTIMISATION AND
BEST PRACTICES

4. PROBLEM-SOLVING USING
SQL

11. SQL ASSIGNMENT: RSVP MOVIES

1. PROBLEM STATEMENT In this assignment, you will work on a movies 1 WEEK


dataset using SQL to extract exciting insights.
2. EVALUATION RUBRIC

3. FINAL SUBMISSION

4. SOLUTION

COURSE 2 - MACHINE LEARNING


1. LINEAR REGRESSION

1. SIMPLE LINEAR REGRESSION Venture into the machine learning community 2 WEEKS
by learning how one variable can be predict-
2. SIMPLE LINEAR REGRESSION
ed using several other variables through a
IN PYTHON
housing dataset where you will predict the
3. MULTIPLE LINEAR prices of houses based on various factors.
REGRESSION

4. MUTLIPLE LINEAR
REGRESSION IN PYTHON

5. INDUSTRY RELEVANCE OF
LINEAR REGRESSION

*The Curriculum is subject to change as per the inputs from university or industry experts
5

2. LINEAR REGRESSION ASSIGNMENT

1. PROBLEM STATEMENT Build a model to understand the factors 1 WEEK


on which the demand for bike sharing
2. EVALUATION RUBRIC
systems vary on and help a company
3. FINAL SUBMISSION optimise its revenue.
4. SOLUTION

3. LOGISTIC REGRESSION

1. UNIVARIATE LOGISTIC Learn your first binary classification tech- 2 WEEKS


REGRESSION nique by determining which customers of a
telecom operator are likely to churn versus
2. MULTIVARIATE LOGISTIC
who are not to help the business retain cus-
REGRESSION: MODEL
tomers.
BUILDING AND EVALUATION

3. LOGISTIC REGRESSION:
INDUSTRY APPLICATIONS

4. CLASSIFICATION USING DECISION TREES

1. INTRODUCTION TO DECISION Learn how the human decision making 1 WEEK


TREES process can be replicated using a decision
treeand tune it to suit your needs.
2. ALGORITHMS FOR DECISION
TREES CONSTRUCTION

3. HYPERPARAMETER TUNING
IN DECISION TREES

5. UNSUPERVISED LEARNING: CLUSTERING

1. INTRODUCTION TO Learn how to group elements into different 1 WEEK


CLUSTERING clusters when you don’t have any pre-de-
fined labels to segregate them through
2. K-MEANS CLUSTERING
K-means clustering, hierarchical clustering,
3. HIERARCHICAL CLUSTERING and more.
4. OTHER FORMS OF
CLUSTERING: K-MODE,
K-PROTOTYPE, DB SCAN

*The Curriculum is subject to change as per the inputs from university or industry experts
01 6

6. BASICS OF NLP AND TEXT MINING

1. REGEX AND INTRODUCTION Do you get annoyed by the constant 1 WEEK


TO NLP spams in your mailbox? Wouldn’t it be
nice if we had a program to check your
2. BASIC LEXICAL PROCESSING
spellings? In this module learn how to
3. ADVANCED LEXICAL build a spell checker & spam detector
PROCESSING using techniques like phonetic hashing,
bag-of-words, TF-IDF, etc.

5. BUSINESS PROBLEM SOLVING

1. INTRODUCTION TO BUSINESS Learn how to approach open ended real 1 WEEK


PROBLEM SOLVING world problems using data as a lever to
draw actionable insights.
2. BUSINESS PROBLEM
SOLVING: CASE STUDY
DEMONSTRATIONS

7. CASE STUDY: LEAD SCORING

1. PROBLEM STATEMENT Help the Sales team of your company iden- 1 WEEK
tify which leads are worth pursuing through
2. EVALUATION RUBRIC
this classification case study.
3. FINAL SUBMISSION

4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
7

S P E CI ALI SATI O N: DEEP LEARNING


COURSE 3 - ADVANCED MACHINE LEARNING
1. BAGGING & RANDOM FOREST

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms 1 WEEK


can improve your classification models by
2. INTRODUCTION TO RANDOM
building random forests from decision trees.
FORESTS

3. FEATURE IMPORTANCE IN
RANDOM FORESTS

4. RANDOM FORESTS IN
PYTHON

2. BOOSTING

1. INTRODUCTION TO Learn about ensemble modelling through 1 WEEK


BOOSTING AND ADABOOST bagging and boosting and understand how
weak algorithms can be transformed into
2. GRADIENT BOOSTING
stronger ones.

3. MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 1 WEEK
SELECTION complex models and the different methods
for quantifying model complexity, alongwith
2. MODEL EVALUATION
general machine learning techniques like
3. MODEL SELECTION: BEST feature engineering, model evaluation, and
PRACTICES many more.

4. PRINCIPAL COMPONENT ANALYSIS

1. PRINICIPAL COMPONENT Understand important concepts related to 1 WEEK


ANALYSIS AND SINGULAR dimensionality reduction, the basic idea and
VALUE DECOMPOSITION the learning algorithm of PCA, and its practi-
cal applications on supervised and unsuper-
2. PRINCIPAL COMPONENT
vised problems.
ANALYSIS IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
01 8

5. ADVANCED REGRESSION

1. GENERALIZED LINEAR In this module, take a more advanced look 1 WEEK


REGRESSION at regression models and learn the concepts
related to regularization.
2. REGULARIZED REGRESSION

6. ADVANCED ML CASE STUY 1 WEEK

1. PROBLEM STATEMENT Build a regularized regression model to


understand the most important variables to
2. EVALUATION RUBRIC
predict the house prices in Australia.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 4 - ADVANCED MACHINE LEARNING AND


DEEP LEARNING
1. TIME SERIES ANALYSIS

1. INTRODUCTION TO In this module, you will learn how to analyse 2 WEEKS


TIME SERIES AND ITS and forecast a series that varies with time.
COMPONENTS

2. WORKING WITH STATIONARY


TIME SERIES

3. END-TO-END ANALYSIS OF
TIME SERIES

2. INTRODUCTION TO NEURAL NETWORKS AND ANN

1. STRUCTURE OF NEURAL Learn the most sophisticated and cutting- 3 WEEKS


NETWORKS edge technique in machine learning -
Artificial Neural Networks or ANNs
2. FEED FORWARD IN NEURAL
NETWORKS

3. BACKPROPAGATION IN
NEURAL NETWORKS

4. MODIFICATIONS TO NEURAL
NETWORKS

5. HYPERPARAMETER TUNING
IN NEURAL NETWORKS

*The Curriculum is subject to change as per the inputs from university or industry experts
9

3. NEURAL NETWORK ASSIGNMENT

1. PROBLEM STATEMENT Build a neural network from scratch in 1 WEEK


Tensorflow to identify the type of skin cancer
2. EVALUATION RUBRIC
from image.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 5 - ADVANCED DEEP LEARNING AND


COMPUTER VISION
1. CONVOLUTIONAL NEURAL NETWORKS

1. INTRODUCTION TO Learn the basics of CNN and OpenCV and 2 WEEKS


CONVOLUTIONAL NEURAL how to classify image data using various
NETWORKS architectures which you will then implement
using Python and Keras.
2. BUILDING CNNS WITH
PYTHON AND KERAS

3. CNN ARCHITECTURES AND


TRANSFER LEARNING

4. STYLE TRANSFER AND


OBJECT DETECTION

2. CONVOLUTIONAL NEURAL NETWORKS -INDUSTRY


APPLICATIONS
1. INDUSTRY DEMONSTRATION: Apply CNNs to Computer Vision tasks like 1 WEEK
USING CNNS WITH FLOWERS detecting anomalies in chest X-Ray scans.
IMAGES

2. INDUSTRY DEMONSTRATION:
USING CNNS WITH X-RAY
IMAGES

*The Curriculum is subject to change as per the inputs from university or industry experts
01 10

3. OBJECT DETECTION & IMAGE SEGMENTATION (OPTIONAL)

1. FUNDAMENTALS OF OBJECT Learn the applications of DL in computer 0 WEEK


DETECTION vision through industry-relevant detection
algorithms such as RCNNs, YOLO and SSD.
2. REGION-BASED DETECTORS

3. ONE-SHOT DETECTORS

4. CUSTOM OBJECT DETECTION

5. SEMANTIC SEGMENTATION

4. RECURRENT NEURAL NETWORKS

1. WHAT MAKES A NEURAL Ever wondered what goes behind machine 1 WEEK
NETWORK RECURRENT translation, sentiment analysis, speech
recognition? Learn how RNN helps in these
2. VARIANTS OF RNNS:
areas having sequential data like text,
BIDIRECTIONAL RNNS AND
speech, videos, and a lot more.
LSTMS

3. BUILDING RNNS IN PYTHON

5. GESTURE RECOGNITION

1. TWO ARCHITECTURES: 3D Make a Smart TV system which can control 2 WEEKS


CONVS AND CNN-RNN STACK the TV with user’s hand gestures as the
remote control
2. UNDERSTANDING
GENERATORS

3. STARTER CODE
WALKTHROUGH

4. PROBLEM STATEMENT AND


FINAL SUBMISSION

*The Curriculum is subject to change as per the inputs from university or industry experts
11

COURSE 6 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. AN OVERVIEW OF THE Choose from a range of real-world industry 4 WEEKS


DOMAIN AND ASSOCIATED woven projects on advanced topics like
CONCEPTS Recommendation Systems, Fraud Detection,
Emotion Detection from faces, Social Media
2. PROBLEM STATEMENT
Listening, Speech Recognition among many
3. EVALUATION RUBRIC others.
4. MID SUBMISSION

5. FINAL SUBMISSION

6. SOLUTION

S P E CI ALI SATI O N: NAT URAL


L A NGUAGE PROC ESSING
COURSE 3 - ADVANCED MACHINE LEARNING
1. BAGGING & RANDOM FOREST

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms 1 WEEK


can improve your classification models by
2. INTRODUCTION TO RANDOM
building random forests from decision trees.
FORESTS

3. FEATURE IMPORTANCE IN
RANDOM FORESTS

4. RANDOM FORESTS IN
PYTHON

2. BOOSTING

1. INTRODUCTION TO Learn about ensemble modelling through 1 WEEK


BOOSTING AND ADABOOST bagging and boosting and understand how
weak algorithms can be transformed into
2. GRADIENT BOOSTING
stronger ones.

*The Curriculum is subject to change as per the inputs from university or industry experts
01 12

3. MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 1 WEEK
SELECTION complex models and the different methods
for quantifying model complexity, alongwith
2. MODEL EVALUATION
general machine learning techniques like
3. MODEL SELECTION: BEST feature engineering, model evaluation, and
PRACTICES many more.

4. PRINCIPAL COMPONENT ANALYSIS

1. PRINICIPAL COMPONENT Understand important concepts related to 1 WEEK


ANALYSIS AND SINGULAR dimensionality reduction, the basic idea
VALUE DECOMPOSITION and the learning algorithm of PCA, and its
practical applications on supervised and
2. PRINCIPAL COMPONENT
unsupervised problems.
ANALYSIS IN PYTHON

5. ADVANCED REGRESSION

1. GENERALIZED LINEAR In this module, take a more advanced look 1 WEEK


REGRESSION at regression models and learn the concepts
related to regularization.
2. REGULARIZED REGRESSION

6. ADVANCED ML CASE STUY

1. PROBLEM STATEMENT Build a regularized regression model to 1 WEEK


understand the most important variables to
2. EVALUATION RUBRIC
predict the house prices in Australia.
3. FINAL SUBMISSION

4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
13

COURSE 4 - ADVANCED MACHINE LEARNING AND


NATURAL LANGUAGE PROCESSING
1. TIME SERIES FORECASTING

1. INTRODUCTION TO In this module, you will learn how to analyse 2 WEEKS


TIME SERIES AND ITS and forecast a series that varies with time.
COMPONENTS

2. WORKING WITH STATIONARY


TIME SERIES

3. END-TO-END ANALYSIS OF
TIME SERIES

2. NEURAL NETS FOR NLP

1. UNDERSTANDING NEURAL Learn the most sophisticated and cutting- 1 WEEK


NETWORKS edge technique in machine learning -
Artificial Neural Networks or ANNs.
2. LOSS FUNCTIONS AND BACK
PROPAGATION

3. UNDERSTANDING
TENSORFLOW

4. CASE STUDY : IMDB MOVIE


REVIEW CLASSIFICATION

3. SYNTACTIC PROCESSING

1. INTRODUCTION TO Learn how to analyse the syntax or the 2 WEEKS


SYNTACTIC PROCESSING grammatical structure of sentences using
POS tagging and Dependency parsing.
2. PARSING

3. INFORMATION EXTRACTION

4. CONDITIONAL RANDOM
FIELDS

*The Curriculum is subject to change as per the inputs from university or industry experts
01 14

4. SYNCTACTIC PROCESSING ASSIGNMENT

1. PROBLEM STATEMENT Use the techniques such as POS tagging 1 WEEK


and Dependency parsing to extract
2. EVALUATION RUBRIC
information from unstructured text data
3. FINAL SUBMISSION

4. SOLUTION

COURSE 5- ADVANCED NATURAL LANGUAGE


PROCESSING
1. SEMANTIC PROCESSING

1. INTRODUCTION TO Learn the most interesting area in the field of 2 WEEKS


SEMANTIC PROCESSING NLP and understand different techniques like
word-embeddings, topic modelling to build
2. DISTRIBUTIONAL SEMANTICS
an application that extracts opinions about
3. INDUSTRY APPLICATIONS OF socially relevant issues.
DISTRBUTIONAL SEMANTICS

4. TOPIC MODELLING

2. APPLIED DL IN NLP

1. INTRODUCTION TO MACHINE Apply the concepts of DL in natural language 2 WEEKS


TRANSLATION processing problems through encoder-
decoder architecture, NMTs, and implement
2. ATTENTION-BASED NMT
them in TensorFlow.
MODEL

3. CUSTOM MODEL BUILDING IN


TENSORFLOW

3. CASE STUDY: AUTOMATIC TICKET CLASSIFICATION

1. PROBLEM STATEMENT Categorise support tickets with the help of 2 WEEKS


Unsupervised learning and Topic modelling
2. EVALUATION RUBRIC

3. FINAL SUBMISSION

4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
15

COURSE 5 - CAPSTONE PROJECT


1. CAPSTONE PROJECT

1. AN OVERVIEW OF THE Choose from a range of real-world industry 4 WEEKS


DOMAIN AND ASSOCIATED woven projects on advanced topics like
CONCEPTS Recommendation Systems, Fraud Detection,
Emotion Detection from faces, Social Media
2. PROBLEM STATEMENT
Listening, Speech Recognition among many
3. EVALUATION RUBRIC others.
4. MID SUBMISSION

S P E CI ALI SATI O N: BUSINESS ANALYT ICS


COURSE 3 - DATA VISUALISATION AND
STORYTELLING
1. VISUALISATION USING TABLEAU

1. DATA EXPLORATION IN Learn basic visualisation techniques using 1 WEEK


TABLEAU the most in-demand visualization tool in the
industry.
2. VISUALISING AND ANALYSING
DATA IN TABLEAU WITH
BASIC PLOTS

2. ADVANCED EXCEL

1. EXCEL FUNCTIONS Learn the advanced concepts in Excel and 1 WEEK


start to perform data analysis like a pro!
2. DATA ANALYSIS IN EXCEL

3. ADVANCED TOOLS AND


VISUALISATIONS

3. VISUALISATION USING POWERBI

1. POWERBI: INTRODUCTION Take your visualization game a step forward 1 WEEK


AND SETUP by understanding how to operate PowerBI.

2. VISUALISING AND ANALYSING


DATA IN POWERBI

3. DATA TRANSFORMATIONS
USING POWERBI

*The Curriculum is subject to change as per the inputs from university or industry experts
01 16

4. STRUCTURED PROBLEM SOLVING USING FRAMEWORKS

1. INTRODUCTION TO Learn how to attack a business problem 1 WEEK


STRUCTURED PROBLEM using various structured frameworks like 5W,
SOLVING 5WHYs, and SPIN.

2. INTERVIEWING AND
FRAMEWORKS - I: 5W AND
5WHYS

3. INTERVIEWING AND
FRAMEWORKS - II: SPIN

4. INDUSTRY DEMONSTRATIONS
ON FRAMEWORKS

5. UNDERSTANDING BUSINESS
MODEL CANVAS AND ISSUE
TREE FRAMEWORK

6. INDUSTRY DEMONSTRATIONS
ON ISSUE TREE FRAMEWORK

7. SPECIALIZED FRAMEWORKS
FOR BUSINESS PROBLEMS:
7PS, 5CS, ETC.

5. DATA STORYTELLING

1. INTRODUCTION TO DATA Learn how to effectively strategise, 1 WEEK


STORYTELLING communicate, and fine grain your data
analysis projects and understand how to
2. COMPONENTS OF A
optimally present your findings to technical
GOOD STORY WITH
and non-technical stakeholders and upgrade
DATA - UNDERSTANDING
your storytelling skills.
YOUR STAKEHOLDER AND
STAKEHOLDER EMPATHY,
LEVELS OF DETAILS FOR
DIFFERENT STAKEHOLDERS
- CXO/LEADERSHIP VS TEAM
PRESENTATIONS, VISUALS,
ETC.

3. GOLDEN RULES FOR DATA


STORYTELLING

*The Curriculum is subject to change as per the inputs from university or industry experts
17

6. AIRBNB CASE STUDY

1. PROBLEM STATEMENT Use your newly learnt UI tools skills to 1 WEEK


analyse an AirBnB dataset to make important
2. EVALUATION RUBRIC
business decisions. But the analysis is
3. FINAL SUBMISSION just a small part; can you also effectively
4. SOLUTION present it using Data Storytelling to the right
stakeholders?

COURSE 4-ADVANCED MACHINE LEARNING FOR


BUSINESS ANALYSTS
1. BAGGING & RANDOM FOREST

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms 1 WEEK


can improve your classification models by
2. INTRODUCTION TO RANDOM
building random forests from decision trees.
FORESTS

3. FEATURE IMPORTANCE IN
RANDOM FORESTS

4. RANDOM FORESTS IN
PYTHON

2. MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 2 WEEKS
SELECTION complex models and the different methods
for quantifying model complexity, alongwith
2. MODEL BUILDING AND
general machine learning techniques like
EVALUATION
feature engineering, model evaluation, and
3. FEATURE ENGINEERING many more.
4. CLASS IMBALANCE

*The Curriculum is subject to change as per the inputs from university or industry experts
01 18

3. TIME SERIES FORECASTING

1. INTRODUCTION TO In this module, you will learn how to analyse 2 WEEKS


TIME SERIES AND ITS and forecast a series that varies with time.
COMPONENTS

2. SMOOTHING TECHNIQUES

3. INTRODUCTION TO AR
MODELS

4. BUILDING AR MODELS

4. MODEL SELCTION CASE STUDY

1. PROBLEM STATEMENT Apply your business acumen to the newly 1 WEEK


learnt machine learning techniques, and
2. EVALUATION RUBRIC
select the right model most appropriate for a
3. FINAL SUBMISSION provided business scenario.
4. SOLUTION

COURSE 5: SOLVING BUSINESS REQUIREMENTS


1. OPERATIONS RESEARCH IN EXCEL

1. INTRODUCTION & CONCEPTS Learn about the world of operations research 1 WEEK
OF OPTIMISATION through linear and integer optimisations.

2. OPTIMISATION USING EXCEL

3. OPTIMISATION USING
PYTHON

4. OR IN INDUSTRY -
WAREHOUSE PROBLEM,
ASSIGNMENT PROBLEM, JOB-
SHOP SCHEDULING, ETC.

*The Curriculum is subject to change as per the inputs from university or industry experts
19

2. DATA ARCHITECTURE
1. COMPONENTS OF EFFECTIVE Given a broad business challenge, describe 1 WEEK
DATA ARCHITECTURE how you would approach the development
of a Machine Learning Architecture strategy
2. TECHOLOGY AND
using the Structured Problem Solving
INFRASTRUCTURE
Method.
3. TOOLS TO BUILD
AN EFFECTIVE DATA
ARCHITECTURE

3. DATA STRATEGY

1. BACKGROUND OF DATA Understand how to identify the right business 2 WEEKS


STRATEGY problems (Revenue/Cost Perspective, Value
Chain) using the DS project assessment
2. CORE OF DATA STRATEGY-I
framework. You will also learn how to
3. CORE OF DATA STRATEGY-II manage a product from production to
4. CASE STUDIES FOR DATA deployment and understand the overall
STRATEGY lifecycle management of an Analytics/DS
project.

4. BUSINESS CASE STUDY

1. PROBLEM STATEMENT Understand how a project in the industry 2 WEEKS


is taken up and solved through a
2. EVALUATION RUBRIC
comprehensive business case study.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 5 - CAPSTONE PROJECT


1. CAPSTONE PROJECT

1. POWER BI - OPTIONAL Solve an end-to-end real-life industry


4 WEEKS
problem from a wide variety of domains.
2. AN OVERVIEW OF THE
DOMAIN AND ASSOCIATED
CONCEPTS

3. PROBLEM STATEMENT

4. EVALUATION RUBRIC

5. MID SUBMISSION

6. FINAL SUBMISSION

7. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
01 20

S P E CI ALI SATI ON: BUSINESS


I N T E L LI GE NCE / DATA ANALYT IC S
COURSE 3 - DATA VISUALISATION AND STORY-
1. VISUALISATION USING TABLEAU

1. DATA EXPLORATION IN Learn basic visualisation techniques using 1 WEEK


TABLEAU the most in-demand visualization tool in the
industry.
2. VISUALISING AND ANALYSING
DATA IN TABLEAU WITH
BASIC PLOTS

2. ADVANCED EXCEL

1. EXCEL FUNCTIONS Learn the advanced concepts in Excel and 1 WEEK


start to perform data analysis like a pro!
2. DATA ANALYSIS IN EXCEL

3. ADVANCED TOOLS AND


VISUALISATIONS

3. VISUALISATION USING POWERBI

1. POWERBI: INTRODUCTION Take your visualization game a step forward 1 WEEK


AND SETUP by understanding how to operate PowerBI.

2. VISUALISING AND ANALYSING


DATA IN POWERBI

3. DATA TRANSFORMATIONS
USING POWERBI

*The Curriculum is subject to change as per the inputs from university or industry experts
21

4. STRUCTURED PROBLEM SOLVING USING FRAMEWORKS

1. INTRODUCTION TO Learn how to attack a business problem 1 WEEK


STRUCTURED PROBLEM using various structured frameworks like 5W,
SOLVING 5WHYs, and SPIN.

2. INTERVIEWING AND
FRAMEWORKS - I: 5W AND
5WHYS

3. INTERVIEWING AND
FRAMEWORKS - II: SPIN

4. INDUSTRY DEMONSTRATIONS
ON FRAMEWORKS

5. UNDERSTANDING BUSINESS
MODEL CANVAS AND ISSUE
TREE FRAMEWORK

6. INDUSTRY DEMONSTRATIONS
ON ISSUE TREE FRAMEWORK

7. SPECIALIZED FRAMEWORKS
FOR BUSINESS PROBLEMS:
7PS, 5CS, ETC.

5. DATA STORYTELLING

1. INTRODUCTION TO DATA Learn how to effectively strategise, 1 WEEK


STORYTELLING communicate, and fine grain your data
analysis projects and understand how to
2. COMPONENTS OF A
optimally present your findings to technical
GOOD STORY WITH
and non-technical stakeholders and upgrade
DATA - UNDERSTANDING
your storytelling skills.
YOUR STAKEHOLDER AND
STAKEHOLDER EMPATHY,
LEVELS OF DETAILS FOR
DIFFERENT STAKEHOLDERS
- CXO/LEADERSHIP VS TEAM
PRESENTATIONS, VISUALS,
ETC.

3. GOLDEN RULES FOR DATA


STORYTELLING

*The Curriculum is subject to change as per the inputs from university or industry experts
01 22

4. AIRBNB CASE STUDY

1. PROBLEM STATEMENT Use your newly learnt UI tools skills to 1 WEEK


analyse an AirBnB dataset to make important
2. EVALUATION RUBRIC
business decisions. But the analysis is
3. FINAL SUBMISSION just a small part; can you also effectively
4. SOLUTION present it using Data Storytelling to the right
stakeholders?

COURSE 4: ADVANCED DBS AND BIG DATA


ANALYTICS
1. DATA MODELLING

1. DATABASE DESIGN RECAP In this module, you will learn and use data 1 WEEK
modelling on a dataset to solve a business
2. BUILDING BLOCKS OF DATA
problem.
MODELLING

3. PROBLEM SOLVING USING


DATA MODELLING

4. DATA MODELLING: OPTIONAL


ASSIGNMENT

2. ADVANCED SQL AND BEST PRACTICES

1. WINDOW FUNCTIONS Apply advanced SQL concepts like 1 WEEK


windowing and procedures to derive insights
2. CASE STATEMENTS, STORED
from data and answer pertinent business
ROUTINES, AND CURSORS
questions.
3. QUERY OPTIMISATION AND
BEST PRACTICES

4. PROBLEM SOLVING USING


SQL

3. INTRODUCTION TO BIG DATA AND CLOUD

1. BIG DATA AND CLOUD Understand the basics of big data and cloud 1 WEEK
COMPUTING and learn to work with an EMR cluster on a
cloud-based service.
2. AMAZON WEB SERVICES

3. BIG DATA STORAGE AND


PROCESSING - HADOOP

4. EMR CLUSTER IN AWS

*The Curriculum is subject to change as per the inputs from university or industry experts
23

4. ANALYTICS USING SPARK

1. EXPLORATORY DATA Use PySpark to do EDA and Predictive 2 WEEKS


ANALYSIS WITH PYSPARK Analysis using Spark’s ML library.

2. PREDICTIVE ANALYSIS WITH


SPARK MLLIB

5. BIG DATA CASE STUDY

1. PROBLEM STATEMENT Use your analytics skills to work on a 1 WEEK


large dataset in cloud to solve an industry
2. EVALUATION RUBRIC
problem.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 5: ADVANCED PROBLEM SOLVING AND


PROGRAMMING
1. DATA STRUCTURES - SETS, DICTIONARIES, STACKS, QUEUES

1. IN-BUILT DATA STRUCTURES Learn user defined data structures -Stack, 1 WEEK
Queue, Trees in Python that help in
2. STACK
advanced data manipulation.
3. QUEUE

4. TREES

2. SEARCHING AND SORTING

1. SEARCHING Learn most fundamental searching and 1 WEEK


sorting algorithms and design techniques
2. SORTING

3. TWO POINTERS

*The Curriculum is subject to change as per the inputs from university or industry experts
01 24

3. ALGORITHM ANALYSIS + RECURSION

1. ALGORITHM ANALYSIS Learn how to assess the efficiency your code 1 WEEK
using algorithm analysis techniques and
2. TIME AND SPACE
learn to write recursive algorithms
COMPLEXITY

3. RECURSION

4. ADVANCED DATABASE PROGRAMMING USING PANDAS

1. ADVANCED DATA WRANGLING Learn and implement advanced wrangling 1 WEEK


WITH PANDAS - I functions and techniques in Pandas related
to date-time, multi-columns aggregation,
2. ADVANCED DATA WRANGLING
hierarchical indexing, and more.
WITH PANDAS - II

5. PYTHON & SQL LAB

1. SQL: TIMED TEST + In this competitive assignment, you will solve 2 WEEKS
ASSIGNMENT a variety of programming questions in both
SQL and Python in a timed environment. You
2. PYTHON: TIMED TESTS I & II
will also demonstrate one of the questions
3. VIDEO SUBMISSION through a video submission to help improve
your interviewing skills.

COURSE 6 - CAPSTONE PROJECT


1. CAPSTONE PROJECT

1. AN OVERVIEW OF THE Solve an end-to-end real-life industry 4 WEEKS


DOMAIN AND ASSOCIATED problem from a wide variety of domains.
CONCEPTS

2. PROBLEM STATEMENT

3. EVALUATION RUBRIC

4. MID SUBMISSION

5. FINAL SUBMISSION

6. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
25

S P E CI ALI SATI ON: DATA ENG INEERING


COURSE 3: DATA ENGINEERING - I
1. DATA MANAGEMENT AND RELATIONAL DATABASE MODELLING

1. ENTERPRISE DATA Understand the concepts of Data 1 WEEK


MANAGEMENT Management and learn to model data
from a Relational Database.
2. RELATIONAL DATABASE
MODELLING

3. NORMAL FORMS AND ER


DIAGRAMS

2. INTRODUCTION TO BIG DATA(OPTIONAL)

1. 4VS OF BIG DATA This module you will learn what big data 0 WEEK
is, its various characteristics, and its
2. BIG DATA: INDUSTRY CASE
determining factors. You will also get an
STUDIES
idea of the various sources of big data and
the wide range of big data applications in
different industries such as retail, healthcare,
and finance.

3. INTRODUCTION TO CLOUD AND AWS SETUP

1. INTRODUCTION TO CLOUD Understand what is cloud and setup your 1 WEEK


AWS account which will be required duing
2. AWS SETUP
the program.

4. INTRODUCTION TO HADOOP AND MAPREDUCE PROGRAMMING

1. CONCEPTS RETAILED TO Understand the world of distributed data 1 WEEK


DISTRIBUTED COMPUTING processing and storage with Hadoop. Learn
to write MapReduce jobs in Python.
2. HADOOP DISTRIBUTED FILE
SYSTEM

3. MAPREDUCE PROGRAMMING
IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
01 26

5. ASSIGNMENT (OPTIONAL)
1. INTRODUCTION, PROBLEM Solve an assignment to brush up the skills 1 WEEK
STATEMENT AND GRADING learnt so far.
RUBRICS

6. NOSQL DATABASES AND APACHE HBASE NOSQL DATABASES


AND MONGODB (OPTIONAL)

1. CONCEPTS OF NOSQL Learn the concepts of NoSQL databases. 1 WEEK


DATABASES Understand the working of Apache HBase.

2. INTRODUCTION TO APACHE
HBASE

3. HBASE PYTHON API

4. COMPARISION OF NOSQL
DATABASES

7. DATA WAREHOUSING (OPTIONAL)

1. INTRODUCTION TO DATA Understand the intricacies behind designing 0 WEEK


WAREHOUSE AND DATA a data warehouse and a data lake for use
LAKES case/s.

2. DESIGNING DATA
WAREHOUSING FOR AN ETL
DATA PIPELINE

3. DESIGNING DATA LAKE FOR


AN ETL DATA PIPELINE

8. DATA INGESTION WITH APACHE SQOOP AND APACHE FLUME

1. INTRODUCTION TO DATA Get familiar with the challenges involed in 1 WEEK


INGESTION data ingestion. Use Sqoop and Flume to
ingest structured and unstructured data into
2. STRUCTURED DATA
Hadoop.
INGESTION WITH SQOOP

3. UNSTRUCTURED DATA
INGESTION WITH FLUME

*The Curriculum is subject to change as per the inputs from university or industry experts
27

9. MAP REDUCE PROGRAMMING ASSIGNMENT

1. PROBLEM STATEMENT AND Practise MapReduce Programming on a Big 1 WEEK


SAMPLE DATASET Dataset.

2. SOLUTION

COURSE 4 - DATA ENGINEERING - II

1. HIVE & QUERYING

1. FUNDAMENTALS OF APACHE Manage and query a data warehouse with 2 WEEKS


HIVE Apache Hive. Learn to write optimized HQL
for large scale data analysis.
2. WRITING HQL FOR DATA
ANALYSIS

3. PARTITIONING AND
BUCKETING WITH HIVE

2. ASSIGNMENT (OPTIONAL)

1. INTRODUCTION, PROBLEM Solve an assignment to brush up the skills 0 WEEK


STATEMENT AND GRADING learnt so far.
RUBRICS

3. AMAZON REDSHIFT

1. DATA WAREHOUSING WITH Learn to deploy a Redshift cluster and use it 1 WEEK
REDSHIFT for querying data.

2. ANALYZE DATA WITH


REDSHIFT

4. INTRODUCTION TO APACHE SPARK

1. SPARK ARCHITECTURE Get introduced to Apache Spark, a lighting 1 WEEK


fast big data processing engine.
2. RDD, DATAFRAME API,
SPARKSQL

*The Curriculum is subject to change as per the inputs from university or industry experts
01 28

5. PROJECT: ETL DATA PIPLINE

1. INTRODUCTION AND Make use of Sqoop, Redshift & Spark to 2 WEEKS


PROBLEM STATEMENT design an ETL data pipeline.

2. GRADING RUBRICS AND


SUBMISSION

6. AWS CLOUD INFRASTRUCTURE (OPTIONAL)

1. THE AWS CLOUD PLATFORM Do a deep dive into AWS Cloud 0 WEEK
2. BUILDING AND DEPLOYING
VIRTUAL MACHINES

3. AWS CLOUD STORAGE


SOLUTIONS

4. APPLICATION DEPLOYMENT

5. CLOUD ADMINISTRATION
AND SECURITY

6. LOAD BALANCING AND


BACKUP STRATEGIES

7. CLOUD AUTOMATION

COURSE 5 - DATA ENGINEERING - III

1. OPTIMISING SPARK FOR LARGE SCALE DATA PROCESSING

1. RUNNING SPARK ON Use PySpark to create large scale data 1 WEEK


MULTINODE CLUSTER processing applications.

2. SPARK MEMORY & DISK


OPTIMISATION

3. OPTIMISING SPARK CLUSTER


ENVIRONMENT

*The Curriculum is subject to change as per the inputs from university or industry experts
29

2. APACHE FLINK(OPTIONAL)

1. INTRODUCTION TO APACHE Get Introduced to Apahce Flink and learn 0 WEEK


FLINK query batch data

2. BATCH DATA PROCESSING


WITH FLINK
Use DataStream API to create a stream
3. STREAM PROCESSING WITH processing application
APACHE FLINK

4. SQL API

3. REAL-TIME DATA STREAMING WITH APACHE KAFKA

1. INTRO TO REAL-TIME Understand the producer-consumer 1 WEEK


DATA PROCESSING architecture of Apache Kafka. Learn to set up
ARCHITECTURES a Kafka cluster for managing real-time data.

2. FUNDAMENTALS OF APACHE
KAFKA

3. SETTING UP KAFKA
PRODUCER AND CONSUMER

4. KAFKA CONNECT API &


KAFKA STREAMS

4. REAL-TIME DATA PROCESSING USING SPARK STREAMING

1. SPARK STREAMING Learn about the real-time data processing 1 WEEK


ARCHITECTURE architecture of Apache Spark. Build Spark
Streaming applications to process data in
2. SPARK STREAMING APIS
real-time.
3. BUILDING STREAM
PROCESSING APPLICATION
WITH SPARK

4. COMPARISION BETWEEN
SPARK STREAMING AND
FLINK

5. ASSIGNMENT (OPTIONAL)
1. INTRODUCTION, PROBLEM Solve an assignment to brush up the skills 0 WEEK
STATEMENT AND GRADING learnt so far.
RUBRICS

*The Curriculum is subject to change as per the inputs from university or industry experts
01 30

6. BUILDING AUTOMATED DATA PIPELINES WITH AIRFLOW

1. FUNDAMENTS OF AIRFLOW Automate Data Pipelines with Airflow. 1 WEEK


2. WORKFLOW MANAGEMENT
WITH AIRFLOW

3. AUTOMATING AN ENTIRE
DATA PIPELINE WITH
AIRFLOW

7. ANALYTICS USING PYSPARK

1. EXPLORATORY DATA Use PySpark to do EDA and Predictive 1 WEEK


ANALYSIS WITH PYSPARK Analysis using Spark’s ML library.

2. PREDICTIVE ANALYSIS WITH


SPARK MLLIB

8. PROJECT: REAL TIME DATA PROCESSING

1. INTRODUCTION AND Build an end-to-end real-time data 1 WEEK


PROBLEM STATEMENT processing application using Spark
Streaming and Kafka.
2. GRADING RUBRICS AND
SUBMISSION

COURSE 5 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. AN OVERVIEW OF THE The capstone project will stich all the 4 WEEKS
DOMAIN AND ASSOCIATED components of data engineering together.
CONCEPTS

2. PROBLEM STATEMENT

3. EVALUATION RUBRIC

4. MID SUBMISSION

5. FINAL SUBMISSION

6. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts

You might also like