Part3 ML
Part3 ML
• Data types
• Operators
• Conditional Statements
• Loops
• Functions
17
Exploratory Data Analysis refers to the critical process of
performing initial investigations on data so as to discover
patterns,to spot anomalies,to test hypothesis and to check
assumptions with the help of summary statistics and graphical
representations.
Python Tools and Libraries
for Data Science
DATA ANALYSIS
DEPLOYMENT
IDE 1.Panda
1.Flask
1.Spyder 2.Numpy
2.Django
2.Pycharm 3.Matplotlib
3.AWS
3.Jupyter 4.Seaborn
4.Azure
5.Scipy
DATA SCIENCE
MACHINE
VISUALLIZATION LEARNING &
1.Tableau DEEP LEARNING
2.Power BI 1.Sklearn
2.Tensorflow
3.Keras
4.Pytorch
Python Libraries for Data Analysis, Data
Modelling and Visualisation
Numpy
• Numpy provides array oriented computing
• Numpy provides a fast built-in object(ndarray)which is a multi dimensional array of
homogeneous data
Python Implementation
Pandas
• Pandas is a high-level data manipulation tool
• It is built on the Numpy package - key data structure is DataFrame
• DataFrames allow to store and manipulate tabular
data in rows of observations and columns of variables
Loading the data
25
Python Implementation
Data Visualisation
Why data visualisation
Python Implementation
Seaborn
• Used for data visualization and is based on Matplotlib
• Seaborn allows the creation of statistical graphs
Functionalities
• Allows comparison between multiple variables
• Supports multigrid plot
• Univariate and bivariate visualizations
• Availability of different color palettes
Python Implementation
Scipy
• SciPy is an Open Source library of scientific tools for Python. It depends on the NumPy library, and it gathers a
variety of high level science and engineering modules together as a single package. SciPy provides modules for
• file input/output
• statistics
• optimization
• numerical integration
• linear algebra
• Fourier transforms
• signal processing
• image processing
Scikit learn-Sklearn
• Sklearn is machine learning library
• Simple and efficient tool for data analysis
• It features various regression, classification and clustering algorithms
• Dimensionality reduction, model selection and preprocessing algorithm
• Built on Numpy, Scipy and Matplotlib
MACHINE LEARNING ALGORITHMS
MACHINE LEARNING
ALGORITHMS
ML ALGORITHM