FACULTY
BIG DATA &
MACHINE LEARNING
PRODEGREE
Imarticus Learning is an
EdTech Partner of:
TOP 4 Leading Institute in
BIG DATA
2016
10
1 0110
0 011
SAS & Python
1 01
1 00
1
ANALYTICS TRAINING Analytics Vidhya
NO
TRAINING INSTITUTE
INSTITUTE IN INDIA
INDUSTRY LANDSCAPE
Machine learning marks a major technological breakthrough in the field of computer
science, big data and artificial intelligence. Machine learning has seen a phenomenal
rise in terms of industry application over the last year because it is ideal for exploiting
the opportunities hidden in Big Data.
THE SKILLS GAP IN DEMAND SKILL SETS
à Distributed Computing
à Predictive Modeling
à Math, Stats
Projected growth of Fortune 500 à Machine Learning
in global data organizations will need à Storytelling
generated to exploit Big Data by
à Deep Learning
each year 2020 to stay in the
game
GROWING DEMAND DATA SCIENTIST:
THE
9.93 L
SEXIEST
BIG DATA
10.43 L
JOB IN THE
MACHINE LEARNING
MACHINE LEARNING + BIG DATA 13.94 L
21 CENTURYST
HARVARD BUSINESS REVIEW, OCT 2012
SALARY IN LAKHS PER ANNUM
TOP COMPANIES EXPLOITING
BIG DATA AND
MACHINE LEARNING
OVERVIEW OF PROGRAM
The Big Data and Machine Learning Prodegree, in association with IBM as the
EdTech Partner, is a first-of-its-kind 145-hour program providing in-depth exposure
to Data Science, Machine Learning and Deep Learning through a rigorous industry-
aligned curriculum featuring Python and IBM Watson.
26%
20%
ning
Dat th Pyth
wi
11
Lear
a An on
% %
14
hine
alys
St g
in
is
Mac
at
ist a rn ons
ics Le ati
ep lic
8% De App 6%
Bas n
Pythic of so
Wat
on IBM
9%
6% Introduction
Job Read
iness
to ML
145
HOURS
PROGRAM AVAILABLE
PROGRAM AVAILABLEININCLASSROOM & ONLINE
CLASSROOM DELIVERY
& ONLINE FORMAT
DELIVERY FORMAT
CURRICULUM
LEVEL 1. INTRODUCTION TO STATISTICS
ML Spectrum & Journey | Intro to Modeling Lifecycle | Intro to
OVERVIEW OF Supervised Learning | Intro to Unsupervised Learning | Big Data and
MACHINE LEARNING Hadoop | Big Data and its Sources | Popular Tools Used for Big Data
AND BIG DATA | RDBMS vs Hadoop | Hadoop Architecture and Ecosystem | HDFS
Design and Architecture Overview | When to Use & Not Use Hadoop?
Rules of Probability | Permutations and Combinations | Bayers
BASIC PROBABILITY Theorem | Descriptive Statistics | Compound Probability |
AND TERMS Conditional Probability
PROBABILITY Types of Distributions | Functions of Random Variables | Probability
DISTRIBUTIONS Distribution Graphs | Confidence Intervals
DATA Merge, Rollup, Transpose and Append | Missing Analysis and Treatment
TRANSFORMATION | Outlier Analysis and Treatment
Summarizing and Visualizing the Important Characteristics of Data
EXPLORATORY Hypothesis Testing | Visualizations | Univariates, Bivariates |
DATA ANALYSIS
Crosstabs, Correlation
LEVEL 2. BASICS OF PYTHON FOR MACHINE LEARNING
Python Basics | Spyder IDE | Jupyter Notebook | Floats and
INTRODUCTION
Strings | Simple Input & Output | Variables | Single and
TO PYTHON
Multiline Comments
CONTROL Booleans and Comparisons | Conditional Statements (IF ELSE)
STRUCTURES Operator Precedence | Lists - Operations and Functions
FUNCTIONS AND Function Arguments | Comments and doc strings | Functions as
MODULES Objects | Modules | Standard Lib and Pip
EXCEPTIONS Exception Handling | Raising Exceptions | Assertions | Working
AND FILES With Files
LEVEL 3. DATA ANALYSIS WITH PYTHON FOR MACHINE LEARNING
Introduction to Pandas | IO Tools | Basics of NumPy | NumPy Functions
PANDAS | Pandas - Series and Dataframes
DATA Basics of Data Visualization | Line Plots | Bar Charts | Pie Charts |
VISUALIZATION Histograms | Scatter Plots | Parallel Coordinates
Logistic Regression with Python | Making Sense of Result
LOGISTIC Parameters: Wald Test, Likelihood Ratio Test Statistic, Chi-Square Test |
REGRESSION Goodness of Fit Measures | Model Validation: Cross Validation, ROC
Curve, Confusion Matrix
Implementing Decision Trees using Python | Homogeneity | Entropy
DECISION TREES | Information Gain | Gini Index | Standard Deviation Reduction |
Vizualizing & Prunning a Tree
Handling Time Series Data | Holt-Winters Model | ARIMA Model |
TIME SERIES
ACF/PACF Functions
PROJECT 1 LINEAR REGRESSION Property Price Prediction Using Linear Regression
PROJECT 2 LOGISTIC REGRESSION Bankruptcy Prediction Using Logistic Regression
PROJECT 3 DECISION TREES Facebook Post Count Using Decision Tree
Forecasting and Predicting the Sales of Furniture of
PROJECT 4 TIME SERIES
the Superstore
LEVEL 4. INTRODUCTION TO MACHINE LEARNING
Machine Learning | ML Modelling Flow | How to Treat Data in ML |
INTRODUCTION TO Parametric & Non-Parametric ML Algorithm | Types of Machine Learning
MACHINE LEARNING | Performance Measures | Bias-Variance Trade-Off | Overfitting &
Underfitting | Bootstrap Sampling | Bagging Aggregation | Boosting
Introduction to SciKit Learn | Load Data into Scikit Learn | Run
Machine Learning Algorithms Both for Unsupervised and Supervised
SCIKIT LEARN Data | Supervised Methods: Classification & Regression |
Unsupervised Methods: Clustering, Gaussian Mixture Models | Decide
What's the Best Model for Every Scenario
Constant Learning Rate Procedures | Adaptive Learning Procedures |
OPTIMISATION Batch Gradient Descent | Mini-Batch Gradient Descent | Stochastic
TECHNIQUES Gradient Descent | Nesterov Accelerated Gradient | Root Mean
Squared Propagation | Adaptive Moment Estimation Procedure
ML ALGORITHM - Linear Regression with Stochastic Gradient Descent | Logistic
SUPERVISED Regression with Stochastic Gradient Descent | K-Nearest Neighbour |
LEARNING AND Eager Methods vs. Lazy Methods | Nearest Neighbor Classification |
UNSUPERVISED Building kD-Trees | Support Vector Machine | Perceptron Algorithm.
LEARNING What is Clustering? | K-means Algorithm | K-means Clusters
ENSEMBLE Ensemble Techniques | Bootstrap Aggregation | Random Forest |
ALGORITHMS Boosting
Neural Networks | The Biological Inspiration Perceptron Learning &
NEURAL NETWORKS Binary Classification | Backpropagation Learning | Learning Feature
Vectors for Words | Object Recognition
LEVEL 5. INTRODUCTION TO DEEP LEARNING
Keras for Classification and Regression in Typical Data Science Problems
KERAS | Setting up KERAS | Different Layers in KERAS | Creating a Neural
Network | Training Models and Monitoring Artificial Neural Networks
PROJECT 5 ANN ON KERAS Credit Default Using ANN on Keras
Introducing Tensorflow | Neural Networks using Tensorflow Debugging
TENSORFLOW and Monitoring | Convolutional Neural Networks | Unsupervised
Learning
Handwriting/Facial Recognition Using CNN on
PROJECT 6 CNN ON TENSORFLOW
TensorFlow
RNN Recurrent Neural Network | Application Areas | Case Study
LEVEL 6. JOB READINESS
The Why, the What and the How of Resumes | Personal Branding
RESUME WRITING Tips and Resources | Using Social Media | CV Discussion
MOCK INTERVIEWS 1:1 or Panel Mock Interviews with Faculty to Clear the Technical Round
- DOMAIN of Interviews to Give You Confidence to Face Real World Scenarios
LEVEL 7. THE FUTURE WITH IBM WATSON
IBM WATSON Fundamentals of IBM Watson | Advantages of IBM Watson | Use
DEVELOPER Cases of Cognitive Services | Applications on IBM Watson
PROJECT 7 IBM WATSON Application on Watson
CAPSTONE PROJECT IN DEEP LEARNING/MACHINE LEARNING
PROJECT PROBLEM STATEMENT GOAL TECHNIQUES
Predicting How can Machine Use historical Use various Machine
Consumers’ Buying Learning help in customer order data Learning techniques
Behaviour on an E- modeling and of eCommerce giant, taught during the
Commerce Portal. predicting human GroceryKart, to course of this
buying behavior? predict which Prodegree.
previously
purchased products
will be reordered
during the next
checkout by a user.
KEY HIGHLIGHTS
COMPREHENSIVE COVERAGE
The Prodegree features 145 hours of impactful, supervised and
unsupervised learning which forms the core of Machine Learning.
Aspirants also learn about Data Science, Machine Learning and
Deep Learning through hands-on practice on tools such as Python
and IBM Watson.
COLLABORATION WITH IBM
The Big Data and Machine Learning Prodegree is co-created with IBM
as the EdTech Partner, who are at the forefront of technology
innovation.
PROJECT BASED LEARNING
The program provides an edge through our unique project-based
methodology, focusing on real life projects. The Prodegree features
seven projects covering tools such as Python and IBM
Watson and teaches you how to apply predictive models to massive
dataset typically found in healthcare, financial services, social media
and many more!
VIRTUAL LABS AND CODING PLATFORM
Learn on a state-of-the-art virtual lab, with 24/7 access to all required
software and datasets pre-installed.
Agnostic of machine configuration, with no installation and compatibility
issues, learn anytime, anywhere!
JOB READINESS
The Imarticus Career Assistance Services (CAS) team prepares you
to be job-ready through extensive interview prep, resume building &
mock interviews.
MENTORSHIP
Industry experts from leading companies act as your personal
mentor to advise and guide you in your journey towards job-
readiness.
TWO DELIVERY MODES TO CHOOSE FROM:
CLASSROOM DELIVERY OR ONLINE DELIVERY
CASE STUDIES
Build valuable hands-on development experience which can be showcased to
future recruiters.
MODELS TECHNIQUE
Linear Regression Sklearn Linear Model & Gradient Descent Model
Logistic Regression Sklearn Logistic Model & Stochastic Average Gradient
Descent
Decision Tree & Random Decision Tree Classifier, Random Forest Classifier,
Forest Adaboost Classifier & Bagging Classifier
KNN KNN Classifier & K Value
SVM SVM Classifier Using Different Kernels
K-Means K-Means Clustering
Neural Networks Neural Networks
FACULTY
SANDEEP AGARWAL VINAY BORHADE SATYA SRINIVAS
Sandeep has over 18 years of Vinay’s tech expertise includes Satya has 25 years of
experience in IT and AI – Machine Learning, experience aligning multi-
extensive hands-on expertise Python, PL-SQL, and Big Data million dollar IT deployments
in application development – Netezza, Java/J2EE. Having with business strategy and
involving analysis, design, served more than 10 years operational processes for
development and with Bank of America (Merrill Fortune 1000 companies. He
maintenance with 10+ years Lynch), he has worked on has expertise in performance
of experience in data mining projects like Finance, Liquidity management in enterprise
and BI and Big Data Hadoop. and Capital Risk (Regulatory architecture, data mining &
He has worked across Reporting) and has won analytics, machine learning,
multiple business domains repeat business from clients pattern recognition, social
such as Manufacturing, for BOA using technologies media analytics and Big Data
Retail, Banking and Insurance like Machine Learning, Python management & analytics.
and has experience with & IBM Netezza. He is a B.E in Satya is a MS - Computer
large-scale, distributed computers from Mumbai and Engineering from Florida
systems design and has strong techno-functional Atlantic University.
development. skills.
Indicative Faculty
PLACEMENT ASSISTANCE
Imarticus provides 100% assistance throughout the program to guide and help
navigate ample career options and help you get job-ready from Day 1.
RESUME BUILDING INTERVIEW PREP MOCK INTERVIEWS
Refine and polish your Prepare you to ace HR and Preparing candidates to
resume with experts tips technical interview rounds face interview scenarios
to help you land your with interview Q&A and through 1:1 mock
coveted job mock interviews interviews with industry
veterans
COLLABORATION WITH IBM
COGNITIVE CLASS IBM PLATFORMS IBM CERTIFICATION
Get access to IBM’s Aspirants are provided All candidates earn IBM
state-of-the-art access to IBM Cloud Badges on completion of
content made and Platforms featuring IBM the Prodegree with an
delivered by the Watson and other option of additional IBM
experts. software for 24/7 practice. certifications.
IBM is a cognitive solutions and cloud platform company that leverages the power of
innovation, data and expertise to improve business and society. For more
information, visit www.ibm.com
OUR CAMPUSES
MUMBAI THANE PUNE
+91 22 61419595 +91 9773111412 +91 8433905205
BANGALORE HYDERABAD DELHI
+91 8971729953 +91 40 47123304/06 +91 8448584775
CHENNAI GURUGRAM AHMEDABAD
+91 9789879741 +91 1244874030 +91 8980682998
FOR ONLINE DELIVERY INQUIRIES: 1800-267-7679
WWW.IMARTICUS.ORG INFO@IMARTICUS.COM