Part3 ML

This document provides an overview of machine learning and data science concepts along with how Python can be used for data analysis and machine learning. It discusses popular Python libraries for data analysis (pandas, NumPy, Matplotlib), machine learning (scikit-learn), and deep learning (TensorFlow, Keras). Specific machine learning algorithms covered include KNN, Naive Bayes classification, decision trees, time series analysis, and association rule mining. Code examples are provided to demonstrate how to implement these algorithms in Python.

Uploaded by

TechManager SaharaTvm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views201 pages

Part3 ML

Uploaded by

TechManager SaharaTvm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 201

MACHINE LEARNING

How to apply Data Science in your domain?

USE CASES
• To predict or automate some task in your domain
• Industry-predicting failure of a machine
• Healthcare-predicting occurrence of a disease
• Banking-fraud detection
• Finance-sales prediction
• HR department-predicting salary based on candidate’s credentials
• Real estate-House prediction
• Education-Customized and dynamic learning experience, career prediction,better
student performance evaluation etc
Roles of Data Analytics in Education
Industry
Why Python for Data Science?
1)Python is easy to use-syntax is simple
• Less lines of codes for implementing a task when compared to other programming languages
• 2)Python supports many libraries and framework
• Opensource
• Main libraries-numpy,pandas,scikitlearn,matplotlib,seaborn
• DL lib-tensorflow,keras,Theano,pytorch- for dl nn
• Python is used for creating web based applications,web services,web scrapping
• 3)Python has community and corporate support-google search-github link,online repositories,
various online learning resources apart from youtube tutorials
• Many top companies like Google,fb,amazon use python to implement their product,eg amazon
alexa,google assistant,siri from apple,Netflix movie recommendation system,fb using friend
recommendation system using python
Creating File/Project
Writing Code
Variable Explorer and Output Console
File Explorer
Python Basics
16
• Variables

• Data types

• Operators

• Conditional Statements

• Loops

• Functions
17
Exploratory Data Analysis refers to the critical process of
performing initial investigations on data so as to discover
patterns,to spot anomalies,to test hypothesis and to check
assumptions with the help of summary statistics and graphical
representations.
Python Tools and Libraries
for Data Science
DATA ANALYSIS
DEPLOYMENT
IDE 1.Panda
1.Flask
1.Spyder 2.Numpy
2.Django
2.Pycharm 3.Matplotlib
3.AWS
3.Jupyter 4.Seaborn
4.Azure
5.Scipy

DATA SCIENCE

MACHINE
VISUALLIZATION LEARNING &
1.Tableau DEEP LEARNING
2.Power BI 1.Sklearn
2.Tensorflow
3.Keras
4.Pytorch
Python Libraries for Data Analysis, Data
Modelling and Visualisation
Numpy
• Numpy provides array oriented computing
• Numpy provides a fast built-in object(ndarray)which is a multi dimensional array of
homogeneous data
Python Implementation
Pandas
• Pandas is a high-level data manipulation tool
• It is built on the Numpy package - key data structure is DataFrame
• DataFrames allow to store and manipulate tabular
data in rows of observations and columns of variables
Loading the data
25
Python Implementation
Data Visualisation
Why data visualisation
Python Implementation
Seaborn
• Used for data visualization and is based on Matplotlib
• Seaborn allows the creation of statistical graphs

Functionalities
• Allows comparison between multiple variables
• Supports multigrid plot
• Univariate and bivariate visualizations
• Availability of different color palettes
Python Implementation
Scipy
• SciPy is an Open Source library of scientific tools for Python. It depends on the NumPy library, and it gathers a
variety of high level science and engineering modules together as a single package. SciPy provides modules for
• file input/output
• statistics
• optimization
• numerical integration
• linear algebra
• Fourier transforms
• signal processing
• image processing
Scikit learn-Sklearn
• Sklearn is machine learning library
• Simple and efficient tool for data analysis
• It features various regression, classification and clustering algorithms
• Dimensionality reduction, model selection and preprocessing algorithm
• Built on Numpy, Scipy and Matplotlib
MACHINE LEARNING ALGORITHMS
MACHINE LEARNING
ALGORITHMS
ML ALGORITHM

SUPERVISED UNSUPERVISED RE-INFORCEMENT

REGRESSION CLASSIFICATION CLUSTERING ASSOCIATION SYSTEMS WITH

ANALYSIS FEEDBACK
1.SIMPLE LINEAR 1.K-NEAREST 1.K-MEANS
2.MULTIPLE LINEAR NEIGHBOURS 2.DBSCAN
3.LOGISTIC 2.NAIVE BAYERS 3.HIERARCHICAL
4POISSON 3.DECISION TREE
5.NEGATIVE BINOMIAL
6.ZERO INFLATED
SUPERVISED MACHINE LEARNING
Python Implementation
KNN algorithm is based of feature similarity. Choosing right value of
‘k’ is a process called ’parameter tuning’ and is it important for better
accuracy
How to choose k?
When do we use KNN?
How does KNN algorithm work?
According to Euclidean distance formula,distance between two
points with coordinates(x,y) and (a,b) is given by:
Python Implementation
Python Implementation
Python Implementation
Python Implementation
Forecasting –Time Series Analysis
Python Implementation
• Support represents the popularity of that product of all the product
transactions
• Confidence can be interpreted as the likelihood of purchasing both
the products A and B
• Confidence is calculated as the number of transactions that include
both A and B divided by the number of transactions includes only
product A
• The lift value is a measure of importance of a rule

• For an association rule X ==> Y, if the lift is equal to 1, it means that X

and Y are independent. If the lift is higher than 1, it means that X and
Y are positively correlated. If the lift is lower than 1, it means that X
and Y are negatively correlated
Python Implementation

Klein B. Data Analysis With Python. Numpy, Matplotlib and Pandas 2021
No ratings yet
Klein B. Data Analysis With Python. Numpy, Matplotlib and Pandas 2021
515 pages
Data Science
No ratings yet
Data Science
17 pages
NM1021 Report
No ratings yet
NM1021 Report
28 pages
Ai, Ds & ML
No ratings yet
Ai, Ds & ML
52 pages
Eurotherm 2408 2404 Installation Handbook
No ratings yet
Eurotherm 2408 2404 Installation Handbook
139 pages
DS Unit 1 - NUMPY
No ratings yet
DS Unit 1 - NUMPY
29 pages
Machine Manual - TPSys - 2.4
No ratings yet
Machine Manual - TPSys - 2.4
276 pages
Elc Report
No ratings yet
Elc Report
12 pages
Unit 4
No ratings yet
Unit 4
105 pages
Informationuser 1
No ratings yet
Informationuser 1
17 pages
CHANGELOG
No ratings yet
CHANGELOG
41 pages
Python For Data Science (Anees Ahamad) - 20250408 - 180733 - 0000
No ratings yet
Python For Data Science (Anees Ahamad) - 20250408 - 180733 - 0000
12 pages
Data Analytics and Visualization With Python 1728356869
No ratings yet
Data Analytics and Visualization With Python 1728356869
121 pages
EXCEL Formula (Importent)
No ratings yet
EXCEL Formula (Importent)
212 pages
CHAFON Ticket Management Solution
No ratings yet
CHAFON Ticket Management Solution
7 pages
NumPy, Pandas, MatplotLib, Seaborn, ScikitLearn (SkLearn)
No ratings yet
NumPy, Pandas, MatplotLib, Seaborn, ScikitLearn (SkLearn)
14 pages
Data Preprocessing and Data Analysis Using Python
No ratings yet
Data Preprocessing and Data Analysis Using Python
32 pages
Python For Data Science
No ratings yet
Python For Data Science
8 pages
Data Sets
No ratings yet
Data Sets
36 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
Important Libraries For Data Science
No ratings yet
Important Libraries For Data Science
29 pages
PythonDASE - 2025 Version1
No ratings yet
PythonDASE - 2025 Version1
44 pages
1 Hindawi
No ratings yet
1 Hindawi
16 pages
Chapter 6 - Automated and Emerging
No ratings yet
Chapter 6 - Automated and Emerging
12 pages
Lecture 4 Academic Writing
No ratings yet
Lecture 4 Academic Writing
26 pages
Die Zahlen Bingo
No ratings yet
Die Zahlen Bingo
32 pages
How To Use PortSIP UC With Fanvil Product
No ratings yet
How To Use PortSIP UC With Fanvil Product
10 pages
1739440384836.MIT App Inventor - Answers - Textbooks
No ratings yet
1739440384836.MIT App Inventor - Answers - Textbooks
3 pages
Tool and Lib in Data Science
No ratings yet
Tool and Lib in Data Science
32 pages
Unit 1
No ratings yet
Unit 1
84 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
51 pages
Data Analysis With Python & Pandas
100% (2)
Data Analysis With Python & Pandas
378 pages
l9 Scientific Python Proc
No ratings yet
l9 Scientific Python Proc
30 pages
Mainframe 2025
No ratings yet
Mainframe 2025
2 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
49 pages
Capstone Project Rinshana
No ratings yet
Capstone Project Rinshana
17 pages
Desktop Organizer Wallpapers 3240 X 2160 SlidesMania
No ratings yet
Desktop Organizer Wallpapers 3240 X 2160 SlidesMania
13 pages
MBOS Reactivation and MFA Reset 2
No ratings yet
MBOS Reactivation and MFA Reset 2
1 page
INDUSTRY 2 Jaimin
No ratings yet
INDUSTRY 2 Jaimin
14 pages
ML File Updated
No ratings yet
ML File Updated
60 pages
Data Science Using With Python
No ratings yet
Data Science Using With Python
14 pages
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
No ratings yet
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
35 pages
Columbine DOOM2
No ratings yet
Columbine DOOM2
2 pages
Exploring The Power of Data Manipulation and Analysis - A Comprehensive Study of NumPy, SciPy, and Pandas
No ratings yet
Exploring The Power of Data Manipulation and Analysis - A Comprehensive Study of NumPy, SciPy, and Pandas
23 pages
Unit 1-1
No ratings yet
Unit 1-1
10 pages
Wa0003.
No ratings yet
Wa0003.
12 pages
Data Science Tools
No ratings yet
Data Science Tools
2 pages
DTM QB Final
No ratings yet
DTM QB Final
13 pages
Python Week+1 New
No ratings yet
Python Week+1 New
44 pages
Data Science
No ratings yet
Data Science
42 pages
Examples Nastran
100% (1)
Examples Nastran
442 pages
Data Science
No ratings yet
Data Science
9 pages
Programming For Data Science
No ratings yet
Programming For Data Science
48 pages
Data Science With Python ML Course Syllabus
No ratings yet
Data Science With Python ML Course Syllabus
4 pages
Data Science
No ratings yet
Data Science
8 pages
Daily Code DSA
No ratings yet
Daily Code DSA
20 pages
Get File - Descargar Assimil Ingles Sin Esfuerzo mp3
No ratings yet
Get File - Descargar Assimil Ingles Sin Esfuerzo mp3
3 pages
Numpy Lib
No ratings yet
Numpy Lib
19 pages
Mediatek Dimensity 8300 Release FINAL
No ratings yet
Mediatek Dimensity 8300 Release FINAL
2 pages
Data Preprocessing-AIML Algorithm1
No ratings yet
Data Preprocessing-AIML Algorithm1
47 pages
Suraj Report File
No ratings yet
Suraj Report File
17 pages
Embedded Computing Basics - Microcontrollers - System-On-Chips
No ratings yet
Embedded Computing Basics - Microcontrollers - System-On-Chips
4 pages
Chapter-14 Data Science
No ratings yet
Chapter-14 Data Science
12 pages
Data Science I: Charles C.N. Wang
No ratings yet
Data Science I: Charles C.N. Wang
68 pages
Final Exam Web Application - Application Summary
No ratings yet
Final Exam Web Application - Application Summary
2 pages
PYTHON
No ratings yet
PYTHON
11 pages
DSBA Curriculum Guide
No ratings yet
DSBA Curriculum Guide
18 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
Data Visualization
No ratings yet
Data Visualization
25 pages
Hyperfine Revit Shortcuts
No ratings yet
Hyperfine Revit Shortcuts
1 page
1
No ratings yet
1
3 pages
Internship Presentation
No ratings yet
Internship Presentation
18 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Lab - Manual FDS
No ratings yet
Lab - Manual FDS
12 pages
Machine Learning in Python Main Developments and T
100% (1)
Machine Learning in Python Main Developments and T
44 pages
Bernd Klein Python Data Analysis Letter
No ratings yet
Bernd Klein Python Data Analysis Letter
514 pages
12.1.1 BC9000 - BC9050 Controlador Terminal de Bus Ethernet EN PDF
No ratings yet
12.1.1 BC9000 - BC9050 Controlador Terminal de Bus Ethernet EN PDF
2 pages
Python For Data Science
No ratings yet
Python For Data Science
8 pages
Python Data Analysis Sample Chapter
No ratings yet
Python Data Analysis Sample Chapter
40 pages
Python Libraries
No ratings yet
Python Libraries
17 pages
Applied Data Science With Python-N
No ratings yet
Applied Data Science With Python-N
17 pages
20191120122749-Data Science Certification Training
No ratings yet
20191120122749-Data Science Certification Training
4 pages
Made Easy Books List
100% (1)
Made Easy Books List
3 pages
CS3361 - Data Science Laboratory
No ratings yet
CS3361 - Data Science Laboratory
31 pages
Registration Report
No ratings yet
Registration Report
1 page
Zero Trust Essentials Ebook - Microsoft
No ratings yet
Zero Trust Essentials Ebook - Microsoft
11 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
Python For Data Science Extended Ebook PDF
100% (5)
Python For Data Science Extended Ebook PDF
56 pages
Mastering Python Data Visualization - Sample Chapter
100% (9)
Mastering Python Data Visualization - Sample Chapter
63 pages
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet

Part3 ML

Uploaded by

Part3 ML

Uploaded by

MACHINE LEARNING

How to apply Data Science in your domain?

SUPERVISED UNSUPERVISED RE-INFORCEMENT

REGRESSION CLASSIFICATION CLUSTERING ASSOCIATION SYSTEMS WITH

• For an association rule X ==> Y, if the lift is equal to 1, it means that X

You might also like