0% found this document useful (0 votes)

30 views121 pages

Unit 4

AIML part 4

Uploaded by

manishukale472

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views121 pages

Unit 4

AIML part 4

Uploaded by

manishukale472

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 121

Third Year Engineering

Artificial Intelligence and Machine Learning

Class - T.Y. (SEM-IV)
Unit – IV Introduction To Supervised Learning

AY 2024-2025
Unit IV - Syllabus
Unit – Introduction to Supervised Learning 09 hours
• Feature Selection vs. feature extraction techniques,

• Linear Regression;

• Logistic Regression;

• Decision Trees;

• Random Forests;

• Support Vector Machines (SVM);

• Naive Bayes;

• K-Nearest Neighbors (KNN)

Unit IV – Outline
• Feature Selection vs. feature extraction techniques,

• Linear Regression;

• Logistic Regression;

• Decision Trees;

• Random Forests;

• Support Vector Machines (SVM);

• Naive Bayes;

• K-Nearest Neighbors (KNN)

INTRODUCTION TO FEATURE SELECTION
• Feature selection in machine learning refers to the process of
choosing the most relevant and informative features or variables
from a dataset.
• The goal is to improve model performance by reducing the number of
features while retaining the most important ones that contribute the
most to the predictive power of the model.
INTRODUCTION TO FEATURE SELECTION
There are several reasons for performing feature selection:
• Improved Model Performance: Reducing irrelevant or redundant
features can prevent overfitting and help the model generalize
better to new, unseen data.
• Faster Training: With fewer features, training machine learning
models becomes faster and more efficient.
• Enhanced Interpretability: Selecting key features can help in
understanding the important factors that influence predictions,
providing better insights.
INTRODUCTION TO FEATURE SELECTION
Feature selection methods can be broadly categorized into three types:
• Filter Methods: These methods assess the relevance of features based on
statistical measures, such as correlation, mutual information, or
significance tests, independent of any machine learning algorithm.
• Wrapper Methods: These methods involve evaluating subsets of features
by training models and selecting the subset that produces the best model
performance. This is often computationally expensive but can yield good
results.
• Embedded Methods: These methods incorporate feature selection as
part of the model-building process, where the model itself decides which
features are most important during training. Examples include
regularization techniques like Lasso Regression and decision trees with
built-in feature importance measures.
INTRODUCTION TO FEATURE EXTRACTION
• Feature extraction in machine learning involves transforming raw
data into a set of new, more meaningful features that represent
the essential characteristics of the original data.
• It aims to reduce the dimensionality of the data while retaining
important information.
FEATURE EXTRACTION: OBJECTIVES
The primary objectives of feature extraction are:
• Dimensionality Reduction: Converting high-dimensional data into
a lower-dimensional space by extracting the most relevant
information, which helps in alleviating computational complexity
and addressing the curse of dimensionality.
• Improved Model Performance: Creating more informative and
discriminative features can enhance the performance of machine
learning algorithms by focusing on the most critical aspects of the
data.
TYPES OF FEATURE EXTRACTION
Feature extraction techniques can be categorized into various methods:
• Principal Component Analysis (PCA): A popular technique that
transforms data into a new coordinate system by identifying and
retaining the most significant components while discarding less important
ones.
• Linear Discriminant Analysis (LDA): Similar to PCA but focuses on
maximizing class separability, making it particularly useful for
classification tasks.
• Autoencoders: Neural networks that learn to encode input data into a
lower-dimensional representation, then decode it back to the original
space. The encoded values serve as the extracted features.
TYPES OF FEATURE EXTRACTION
Feature extraction techniques can be categorized into various
methods:
• Feature Scaling and Normalization: Techniques that rescale
features to a standard range or normalize them to make the data
more amenable to machine learning algorithms.
• Feature Engineering: Creating new features from existing ones
based on domain knowledge or specific insights about the data,
which may involve mathematical transformations, combining
features, or extracting more relevant information.
FEATURE SELECTION VS EXTRACTION
Aspect Feature Selection Feature Extraction

Objective Choose most relevant features Transform original features into new ones

Purpose Select subset of original features Create a new set of features

Outcome Subset of original features Transformed or derived new features

Process Selection based on relevance/importance Transformation/Compression/Summarization

PCA, LDA, Autoencoders, Feature Scaling,

Method Types Filter, Wrapper, Embedded Methods
etc.

Information Type Retains original feature values Creates new feature values

Reduces dimensionality and captures

Advantages Preserves interpretability
complexity

Computational efficiency Enhanced information and noise reduction

Disadvantages Potential loss of information Potential loss of interpretability

Limited scope for capturing complexity Loss of nuanced information from original data

Correlation, Mutual Information, Lasso Principal Component Analysis (PCA), Linear

Example Techniques
Regression Discriminant Analysis (LDA), Autoencoders
Unit IV – Outline
• Feature Selection vs. feature extraction techniques,

• Linear Regression;

• Logistic Regression;

• Decision Trees;

• Random Forests;

• Support Vector Machines (SVM);

• Naive Bayes; K-Nearest

• Neighbors (KNN)
WHAT IS SUPERVISED LEARNING?
Features:
• Dogs and cats both have 4 legs
and a tail.
• Dogs come in small to large
sizes. Cats, on the other hand,
are always small.
• Dogs have a long mouth while
cats have smaller mouths.
• Dogs bark while cats meow.
• Different dogs have different
ears while cats have almost the
same kind of ears.
WHY IS IT IMPORTANT?
• Supervised learning gives the algorithm experience which can be used
to output the predictions for new unseen data
• Experience also helps in optimizing the performance of the algorithm
TYPES OF SUPERVISED LEARNING
Supervised learning can be further classified into:
• Classification

• Regression
PROBLEMS IN MACHINE LEARNING
PROBLEMS IN MACHINE LEARNING
INTRODUCTION TO REGRESSION
Regression: Regression analysis is a form of predictive modelling
technique which investigates the relationship between a dependent
and independent variable.
USES OF REGRESSION
• Determining the strength of predictors (strength of the effect that
the independent variable have on the dependent variable)
• Forecasting an effect
• Trend forecasting
INTRODUCTION TO LINEAR REGRESSION
• Linear regression is like drawing a straight line through data points
to predict future outcomes or understand the relationship between
two variables.
• It's used when we want to find a relationship between one thing we
want to predict (called the dependent variable) and one or more
things we use to make that prediction (called independent variables
or predictors).
LINEAR REGRESSION: WORKING
• Imagine you have a bunch of points on a graph.
• Linear regression finds the best-fitting line that goes through those
points.
• Once you have this line, you can use it to make predictions about
future points or understand how changes in one variable might
affect another.
LINEAR REGRESSION: WORKING
LINEAR REGRESSION: WORKING
LINEAR REGRESSION
LINEAR REGRESSION
R-SQUARED VALUE
• R-squared value is a statistical measure of how close the data are to
the fitted regression line.
• It is also known as coefficient of determination, or the coefficient of
multiple determination
GOODNESS OF FIT
GOODNESS OF FIT
• When the value of R square is equal to 1 then the actual values lies
on the regression line.
MEAN SQUARED ERROR
GRADIENT DESCENT
• Gradient descent is an algorithm that finds best fit line for a given
training dataset
GRADIENT DESCENT: EXAMPLE
• Area = [2600, 3000, 3200, 3600, 4000]
• Price = [550k, 565k, 610k, 680k, 725k]
GRADIENT DESCENT: EXAMPLE
• Area = [2600, 3000, 3200, 3600, 4000]
• Price = [550k, 565k, 610k, 680k, 725k]
CONT.…
CONT.….

For Slope
CONT.…
CODE
LINEAR REGRESSION: APPLICATIONS
• Predicting house prices based on factors like size, number of rooms,
location, etc.
• Forecasting sales based on advertising spending, seasonality, or
other factors.
• Understanding how temperature affects ice cream sales.
LINEAR REGRESSION: ADVANTAGES
• Simplicity: Easy to understand and implement.
• Interpretability: Provides insights into the relationship between
variables.
• Speed: Quick to train and make predictions.
LINEAR REGRESSION: DISADVANTAGES
• Assumes Linearity: Assumes that the relationship between
variables is linear, which might not always be the case.
• Sensitivity to Outliers: Outliers (extreme data points) can
significantly impact the model's performance.
• Limited Complexity: Cannot capture complex relationships between
variables without modifications (like polynomial regression).
Unit IV – Outline
• Feature Selection vs. feature extraction techniques,

• Linear Regression;

• Logistic Regression;

• Decision Trees;

• Random Forests;

• Support Vector Machines (SVM);

• Naive Bayes;

• K-Nearest Neighbors (KNN)

INTRODUCTION TO LOGISTIC REGRESSION
• Logistic regression is a type of machine learning algorithm used for
binary classification tasks,
• which means it predicts the probability of an input belonging to one
of two categories.
• It's called "regression" but actually works for classification.
LOGISTIC REGRESSION: WORKING
• It models the relationship between a dependent binary variable
(target) and one or more independent variables (features).
• Utilizes the logistic function (sigmoid) to transform predictions into
probabilities between 0 and 1.
• The model makes predictions by calculating the probability that an
input belongs to a particular class.
LOGISTIC REGRESSION CURVE
EXAMPLE
CONT.…
CLASSIFICATION
▪Classification
is a process of categorizing a given set of data into classes,
It can be performed on both structured or unstructured data.
▪The process starts with predicting the class of given data points. The
classes are often referred to as target, label or categories.
CLASSIFICATION TERMINOLOGIES
TYPES OF LEARNERS IN CLASSIFICATION
CLASSIFICATION ALGORITHMS
In machine learning, classification is a supervised learning concept
which basically categorizes a set of data into classes.
LOGISTIC REGRESSION
It is a classification algorithm in machine learning that uses one or
more independent variables to determine an outcome.
It will have only two possible outcomes.
LOGISTIC REGRESSION: APPLICATIONS
• Medical Diagnosis: Predicting if a patient has a disease based on
symptoms.
• Marketing: Determining if a customer will buy a product.
• Credit Risk Assessment: Evaluating the risk of default for loans.
• Image Segmentation: Identifying objects in images as part of
computer vision tasks.
LOGISTIC REGRESSION: ADVANTAGES
• Simplicity: Easy to implement and understand.
• Efficiency: Computationally inexpensive and performs well on small
to medium-sized datasets.
• Interpretability: Provides insight into the importance of features on
the outcome.
LOGISTIC REGRESSION: DISADVANTAGES
• Linear Assumption: Assumes a linear relationship between features
and outcomes, which may not hold in real-world scenarios.
• Limited Complexity: Not suitable for complex patterns in data.
• Sensitivity to Outliers: Influenced by outliers that skew the model's
predictions.
LINEAR VS LOGISTIC REGRESSION
LINEAR VS LOGISTIC REGRESSION
Unit IV – Outline
• Feature Selection vs. feature extraction techniques,

• Linear Regression;

INTRODUCTION TO KNN
• K Nearest Neighbor is a simple algorithm that stores all the available
cases and classifies the new data or case based on a similarity measure.
Q: What does ‘k’ in KNN Algorithm represent?
Ans: k in KNN algorithm represents the number of nearest neighbor points
which are voting for the new test data’s class.

Note:
• If k=1, then test examples are given the same label as the closest example
in the training set.
• If k=3, the labels of the three closest classes are checked and the most
common (i.e., occurring at least twice) label is assigned, and so on for
larger ks.
INTRODUCTION TO KNN
INTRODUCTION TO KNN: INDUCTIVE ASSUMPTION
• Similar inputs map to similar outputs
• If not true -> learning algorithm is impossible
• If true -> learning reduces to defining “similar”
• Not all similarities created equal
• Predicting a person’s weight may depend on different attributes than
predicting their IQ.
BASIC KNN CLASSIFICATION
• Training method
• Save the training examples
• At prediction time
• Find the k training examples (x1,y1),…(xk, yk) that are closest to the test
example x
• Predict the most frequent class among those y’s
WHAT IS THE DECISION BOUNDARY?
Voronoi Diagram
BASIC KNN CLASSIFICATION
• Training method
• Save the training examples
• At prediction time
• Find the k training examples (x1,y1),…(xk, yk) that are closest to the test
example x
• Predict the most frequent class among those y’s

Improvements:
• Weighting examples from the neighbourhood
• Measuring “closeness”
• Finding “close” examples in a large training set quickly.
KNN
KNN
KNN
APPLICATION OF KNN IN INDUSTRY
HOW TO CHOOSE THE VALUE OF K IN KNN
ALGORITHM
KNN IS A LAZY LEARNER
KNN: EXAMPLE
121

Machine Learning
No ratings yet
Machine Learning
48 pages
ML Notes
No ratings yet
ML Notes
15 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
Feature Selection - Study Material
No ratings yet
Feature Selection - Study Material
6 pages
Unit 3
No ratings yet
Unit 3
55 pages
Unit 3
No ratings yet
Unit 3
50 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
Stanford ML
No ratings yet
Stanford ML
168 pages
ML Unit 1
No ratings yet
ML Unit 1
74 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Regression 0
No ratings yet
Regression 0
108 pages
Data Analyst Interview Questionaries
No ratings yet
Data Analyst Interview Questionaries
16 pages
M2 - Supervised Machine Learning
No ratings yet
M2 - Supervised Machine Learning
79 pages
Machine Learning For Quants
No ratings yet
Machine Learning For Quants
13 pages
Supervised Learning. wk3
No ratings yet
Supervised Learning. wk3
18 pages
DimensionalityReduction (Filter and Wrapper Methods)
No ratings yet
DimensionalityReduction (Filter and Wrapper Methods)
47 pages
An Introduction To Feature Selection
No ratings yet
An Introduction To Feature Selection
45 pages
ML Day2
No ratings yet
ML Day2
7 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
ML Day3
No ratings yet
ML Day3
10 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Machine Learning Overview Guide
No ratings yet
Machine Learning Overview Guide
68 pages
Week 2 v1.1 (Hidden) - Dimensionality and Evaluation
No ratings yet
Week 2 v1.1 (Hidden) - Dimensionality and Evaluation
47 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
1 - Intro To Machine Learning
No ratings yet
1 - Intro To Machine Learning
34 pages
Kernels, Model & Feature Selection
No ratings yet
Kernels, Model & Feature Selection
5 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
3b Features PDF
No ratings yet
3b Features PDF
40 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
23 pages
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
Lect 1
No ratings yet
Lect 1
24 pages
Notes Cce 577
No ratings yet
Notes Cce 577
71 pages
ML Unit 2 CLS Notes
No ratings yet
ML Unit 2 CLS Notes
38 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
Machine Learning Guide 2017
No ratings yet
Machine Learning Guide 2017
15 pages
AI5003 AML Week07
No ratings yet
AI5003 AML Week07
14 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Classification and Regression
No ratings yet
Classification and Regression
26 pages
Feature and Feature Extractionlect2
No ratings yet
Feature and Feature Extractionlect2
28 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
64 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Dimensionality Reduction in ML
No ratings yet
Dimensionality Reduction in ML
10 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Data Analysis Chap 3
No ratings yet
Data Analysis Chap 3
21 pages
BTMMeeting25Nov2020 StatisticalLearning
No ratings yet
BTMMeeting25Nov2020 StatisticalLearning
49 pages
Feature Selection: Slide 1
No ratings yet
Feature Selection: Slide 1
29 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
22 pages
Unit 5
No ratings yet
Unit 5
132 pages
Unit 4
No ratings yet
Unit 4
76 pages
Unit 2 CLOUD ARCHITECTURE AND ECONOMICS
No ratings yet
Unit 2 CLOUD ARCHITECTURE AND ECONOMICS
64 pages
CF Unit 1 INTRODUCTION TO CLOUD COMPUTING
No ratings yet
CF Unit 1 INTRODUCTION TO CLOUD COMPUTING
70 pages
Unit 3
No ratings yet
Unit 3
83 pages
Unit 1
No ratings yet
Unit 1
151 pages
Unit 2 - PPT
No ratings yet
Unit 2 - PPT
97 pages
Unit I - Introduction To AI - Notes
No ratings yet
Unit I - Introduction To AI - Notes
48 pages
AIML U2Notes
No ratings yet
AIML U2Notes
18 pages
Machine Learning-Assignments PDF
No ratings yet
Machine Learning-Assignments PDF
2 pages
Icmc 2023 Template
No ratings yet
Icmc 2023 Template
9 pages
Detection of Schizophrenia Using Quantam Learing
No ratings yet
Detection of Schizophrenia Using Quantam Learing
18 pages
Advanced Scikit Learn
No ratings yet
Advanced Scikit Learn
98 pages
DL Unit1
100% (2)
DL Unit1
79 pages
ICT5357 Assessment Brief T3 2024
No ratings yet
ICT5357 Assessment Brief T3 2024
12 pages
Unit-Iv Material
No ratings yet
Unit-Iv Material
24 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Python Code Demonstration
No ratings yet
Python Code Demonstration
40 pages
Business Analytics and Data Mining Modeling Using R
No ratings yet
Business Analytics and Data Mining Modeling Using R
6 pages
DataReduction NotCompletedYet
No ratings yet
DataReduction NotCompletedYet
9 pages
Acacia Data Science Curricum
No ratings yet
Acacia Data Science Curricum
18 pages
Lecture12-Data Analysis in IoT
No ratings yet
Lecture12-Data Analysis in IoT
17 pages
Parinya Sanguansat-Principal Component Analysis - Multidisciplinary Applications-InTech (2012)
No ratings yet
Parinya Sanguansat-Principal Component Analysis - Multidisciplinary Applications-InTech (2012)
212 pages
Alzubi 2018 J. Phys. Conf. Ser. 1142 012012
No ratings yet
Alzubi 2018 J. Phys. Conf. Ser. 1142 012012
23 pages
Advanced Market Segmentation Using Deep Clustering Phase 3
No ratings yet
Advanced Market Segmentation Using Deep Clustering Phase 3
4 pages
Machine Learning For Geochemical Exploration: Classifying Metallogenic Fertility in Arc Magmas and Insights Into Porphyry Copper Deposit Formation
No ratings yet
Machine Learning For Geochemical Exploration: Classifying Metallogenic Fertility in Arc Magmas and Insights Into Porphyry Copper Deposit Formation
24 pages
Applied Data Science Questions
No ratings yet
Applied Data Science Questions
15 pages
Data Pre-Processing Python For Beginner
No ratings yet
Data Pre-Processing Python For Beginner
12 pages
ML - UIII 2lecture LDA
No ratings yet
ML - UIII 2lecture LDA
20 pages
Top 50 Machine Learning Interview Questions & Answers (2022)
100% (2)
Top 50 Machine Learning Interview Questions & Answers (2022)
15 pages
Data Mining and Preprocessing Guide
No ratings yet
Data Mining and Preprocessing Guide
64 pages
Research On Amazon's Stock Price Forecasting Based On Arbitrage Pricing Model Based On Big Data
No ratings yet
Research On Amazon's Stock Price Forecasting Based On Arbitrage Pricing Model Based On Big Data
6 pages
PDF Machine Learning and AI For Healthcare: Big Data For Improved Health Outcomes Arjun Panesar Download
100% (5)
PDF Machine Learning and AI For Healthcare: Big Data For Improved Health Outcomes Arjun Panesar Download
55 pages
M.E. Cse.
0% (1)
M.E. Cse.
62 pages
BERT-Based Topic Modeling Guide
No ratings yet
BERT-Based Topic Modeling Guide
9 pages
PCA Tutorial with Iris Dataset
No ratings yet
PCA Tutorial with Iris Dataset
3 pages
Oil Spectra Analysis and Adulteration Detection in Mir Spectroscopy Data Using Machine Learning
No ratings yet
Oil Spectra Analysis and Adulteration Detection in Mir Spectroscopy Data Using Machine Learning
6 pages
18CSE481T AML AIML CT3 Answer Key
No ratings yet
18CSE481T AML AIML CT3 Answer Key
11 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages

Unit 4

Uploaded by

Unit 4

Uploaded by

Third Year Engineering

Artificial Intelligence and Machine Learning

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

Purpose Select subset of original features Create a new set of features

Outcome Subset of original features Transformed or derived new features

Process Selection based on relevance/importance Transformation/Compression/Summarization

PCA, LDA, Autoencoders, Feature Scaling,

Reduces dimensionality and captures

Computational efficiency Enhanced information and noise reduction

Disadvantages Potential loss of information Potential loss of interpretability

Correlation, Mutual Information, Lasso Principal Component Analysis (PCA), Linear

• Support Vector Machines (SVM);

• Naive Bayes; K-Nearest

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

Q: How do we choose the best attribute?

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

• Support Vector Machines (SVM);

• K-Nearest Neighbors (KNN)

You might also like