Classification and Regression

Classification

Uploaded by

Yashowardhan Shete

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views26 pages

Classification and Regression

Classification

Uploaded by

Yashowardhan Shete

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

MIT School of Computing

Department of Computer Science & Engineering

Third Year Engineering

23CSE3006 -MACHINE LEARNING
Class - T.Y. AIA (SEM-II)
Unit II: Supervised Machine Learning
Name Of the Course Coordinator:
Prof. Aarti Pimpalkar
Team Members
1. Prof. Dr. Nilima Kulkarni
2. Prof. Abhishek Das
3. Prof. Dattatray Kale
4. Prof. Nilesh Kulal

AY 2025-2026 SEM-II
INTRODUCTION TO CLASSIFICATION
Classification is used when you want to categorize data into different classes or groups. For
example, classifying emails as "spam" or "not spam" or predicting whether a patient has a
certain disease based on their symptoms. Here are some common types of classification
models:
1. Decision Tree Classification: Builds a tree where each node represents a test case for an
attribute, and branches represent possible outcomes.
2. Support Vector Machine: It can be used for both classification and regression tasks.
3. K-Nearest Neighbor (KNN): Classifies data points based on the 'k' nearest neighbors using
feature similarity.
INTRODUCTION TO REGRESSION
Regression algorithms predict a continuous value based on input data. This is
used when you want to predict numbers such as income, height, weight, or
even the probability of something happening (like the chance of rain). Some of
the most common types of regression are:
Simple Linear Regression: Models the relationship between one independent
variable and a dependent variable using a straight line.
Multiple Linear Regression: Predicts a dependent variable based on two or
more independent variables.
Logistic Regression: Models nonlinear relationships by fitting a curve to the
data.
CLASSIFICATION AND REGRESSION
• Classification and regression are two primary tasks in
supervised machine learning,
• where key difference lies in the nature of the output:
classification deals with discrete outcomes
• (e.g., yes/no, categories), while regression handles
continuous values (e.g., price, temperature).
• Both approaches require labeled data for training
but differ in their objectives—classification aims
to find decision boundaries that separate classes,
whereas regression focuses on finding the best-
fitting line to predict numerical outcomes.
• Understanding these distinctions helps in selecting
the right approach for specific machine learning
tasks.
CLASSIFICATION Vs REGRESSION
INTRODUCTION TO REGRESSION
Regression: Regression analysis is a form of predictive modelling
technique which investigates the relationship between a dependent and
independent variable.
INTRODUCTION TO LINEAR REGRESSION
• Linear regression is like drawing a straight line through data points
to predict future outcomes or understand the relationship between
two variables.
• It's used when we want to find a relationship between one thing we
want to predict (called the dependent variable) and one or more
things we use to make that prediction (called independent variables
or predictors).
LINEAR REGRESSION: WORKING
• Imagine you have a bunch of points on a graph.
• Linear regression finds the best-fitting line that goes through those
points.
• Once you have this line, you can use it to make predictions about
future points or understand how changes in one variable might affect
another.
INTRODUCTION TO REGRESSION
LINEAR REGRESSION: APPLICATIONS
• Predicting house prices based on factors like size, number of rooms,
location, etc.
• Forecasting sales based on advertising spending, seasonality, or other
factors.
• Understanding how temperature affects ice cream sales.
LINEAR REGRESSION: ADVANTAGES
• Simplicity: Easy to understand and implement.
• Interpretability: Provides insights into the relationship between
variables.
• Speed: Quick to train and make predictions.
LINEAR REGRESSION: DISADVANTAGES
• Assumes Linearity: Assumes that the relationship between variables
is linear, which might not always be the case.
• Sensitivity to Outliers: Outliers (extreme data points) can
significantly impact the model's performance.
• Limited Complexity: Cannot capture complex relationships
between variables without modifications (like polynomial
regression).
INTRODUCTION TO LOGISTIC REGRESSION
• Logistic regression is a type of machine learning algorithm used for
binary classification tasks,
• which means it predicts the probability of an input belonging to one
of two categories.
• It's called "regression" but actually works for classification.
INTRODUCTION TO LOGISTIC REGRESSION
Logistic Regression produces result in a binary format which is used to
predict the outcome of a categorical dependent variable. So the outcome
should be discrete/categorical such as:
LOGISTIC REGRESSION CURVE
INTRODUCTION TO LOGISTIC REGRESSION
EXAMPLE
LOGISTIC REGRESSION
LOGISTIC REGRESSION: WORKING
• It models the relationship between a dependent binary variable
(target) and one or more independent variables (features).
• Utilizes the logistic function (sigmoid) to transform predictions into
probabilities between 0 and 1.
• The model makes predictions by calculating the probability that an
input belongs to a particular class.
LOGISTIC REGRESSION: APPLICATIONS
• Medical Diagnosis: Predicting if a patient has a disease based on
symptoms.
• Marketing: Determining if a customer will buy a product.
• Credit Risk Assessment: Evaluating the risk of default for loans.
• Image Segmentation: Identifying objects in images as part of
computer vision tasks.
LOGISTIC REGRESSION: ADVANTAGES
• Simplicity: Easy to implement and understand.
• Efficiency: Computationally inexpensive and performs well on small
to medium-sized datasets.
• Interpretability: Provides insight into the importance of features on
the outcome.
LOGISTIC REGRESSION: DISADVANTAGES
• Linear Assumption: Assumes a linear relationship between features
and outcomes, which may not hold in real-world scenarios.
• Limited Complexity: Not suitable for complex patterns in data.
• Sensitivity to Outliers: Influenced by outliers that skew the model's
predictions.
LINEAR Vs LOGISTIC REGRESSION
MULTIPLE LINEAR REGRESSION
Linear regression is a statistical method used for predictive analysis. It models the relationship between a
dependent variable and a single independent variable by fitting a linear equation to the data.
Multiple Linear Regression extends this concept by modelling the relationship between a dependent variable and
two or more independent variables. This technique allows us to understand how multiple features collectively
affect the outcomes.
Steps for Multiple Linear Regression
Steps to perform multiple linear regression are similar to that of simple linear Regression but difference comes in
the evaluation process. We can use it to find out which factor has the highest influence on the predicted output and
how different variables are related to each other. Equation for multiple linear regression is:
y=β0+β1X1+β2X2+⋯+βnXn
Where: y is the dependent variable
•X1,X2,⋯XnX1,X2,⋯Xn are the independent variables
•β0β0 is the intercept
•β1,β2,⋯βnβ1,β2,⋯βn are the slopes
The goal of the algorithm is to find the best fit line equation that can predict the values based on the independent
variables. A regression model learns from the dataset with known X and y values and uses it to predict y values for
unknown X.
MULTIPLE LINEAR REGRESSION
How Does It Work?
•The algorithm finds the best coefficients (b1, b2, …)
•by minimizing the sum of squared errors between actual Y and predicted Y.
•This method is called the Least Squares Method.

Example Imagine we want to predict a student’s final exam score (Y) using two factors:
•X1 = Hours of Study
•X2 = Number of Practice Tests Taken

Suppose after training the model, we get this equation:

Score=20+5X1+3X2
Interpretation:
•Intercept (20): Even if a student studies 0 hours and takes 0 practice tests, they may score
20 marks (base level).
•Coefficient (5): Each additional hour of study increases the score by 5 points.
•Coefficient (3): Each additional practice test increases the score by 3 points.
Prediction Example : •The red points are the actual data (Hours of Study, Prac
•If a student studies for 4 hours and takes 2 practice tests: Tests → Exam Score).
•The colored plane is the regression surface (the
Score=20+(5×4)+(3×2)=20+20+6=46
equivalent of the regression line in 3D).
So, the predicted score = 46 marks.

Supervised Learning. wk3
No ratings yet
Supervised Learning. wk3
18 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
29 pages
Week 7. Intro To ML. Regression
No ratings yet
Week 7. Intro To ML. Regression
24 pages
ML Unit-2 Half
No ratings yet
ML Unit-2 Half
16 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
Unit 2 3 Notes
No ratings yet
Unit 2 3 Notes
16 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Regression Analysis Linear Multiple Logistic
No ratings yet
Regression Analysis Linear Multiple Logistic
25 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Experiment 2
No ratings yet
Experiment 2
16 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Unit - 2, Updated Notes
No ratings yet
Unit - 2, Updated Notes
121 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
ML Exp 1
No ratings yet
ML Exp 1
6 pages
Unit 2
No ratings yet
Unit 2
34 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
6 ML Updated
No ratings yet
6 ML Updated
23 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Unit 2 - NOTES1 - ML
No ratings yet
Unit 2 - NOTES1 - ML
35 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Week-14 Lecture 28
No ratings yet
Week-14 Lecture 28
34 pages
ML101 C&a
No ratings yet
ML101 C&a
33 pages
IDA117V Supervised ML
No ratings yet
IDA117V Supervised ML
39 pages
Supervised Learning Regression vs. Classification, Linear Regression, Logistic Regression, Decision Trees and Random Forests
No ratings yet
Supervised Learning Regression vs. Classification, Linear Regression, Logistic Regression, Decision Trees and Random Forests
9 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Unit 2
No ratings yet
Unit 2
19 pages
UNIT 2-3 - Notes - Unit-2-3-Notes
No ratings yet
UNIT 2-3 - Notes - Unit-2-3-Notes
16 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
LR LogReg
No ratings yet
LR LogReg
53 pages
ML Unit3b
No ratings yet
ML Unit3b
175 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
Unit-2: Machine Learning Techniques (KCS-055) Module-2
No ratings yet
Unit-2: Machine Learning Techniques (KCS-055) Module-2
199 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Artificial Intelligence Lec 4
No ratings yet
Artificial Intelligence Lec 4
13 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
13 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Complete
No ratings yet
Complete
12 pages
Unit 3
No ratings yet
Unit 3
30 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
ML Unit-4
No ratings yet
ML Unit-4
65 pages
Final ML
No ratings yet
Final ML
54 pages
Module 4
No ratings yet
Module 4
41 pages
Fai Module 3
No ratings yet
Fai Module 3
67 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
28 - AI-Regression vs. Classification
No ratings yet
28 - AI-Regression vs. Classification
35 pages
Fileml
No ratings yet
Fileml
54 pages
Unit I
No ratings yet
Unit I
14 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
chp6 (10) Fam
No ratings yet
chp6 (10) Fam
24 pages
Solving One Variable Linear Equations
No ratings yet
Solving One Variable Linear Equations
10 pages
Linear Regression and Logistic Regression
No ratings yet
Linear Regression and Logistic Regression
19 pages
Final Unit 1PPt - Updated
No ratings yet
Final Unit 1PPt - Updated
102 pages
Final OS Unit-I
No ratings yet
Final OS Unit-I
174 pages
Unit 1 FiniteAutomata
No ratings yet
Unit 1 FiniteAutomata
59 pages
Lab Manual JNG - Experiment - No. 3
No ratings yet
Lab Manual JNG - Experiment - No. 3
5 pages
Lending Club Data Analysis and Default
No ratings yet
Lending Club Data Analysis and Default
10 pages
Examining The Effective of Socio Economic Status and Language Input On Adolescent English Learner's Speech Production Outcomes
No ratings yet
Examining The Effective of Socio Economic Status and Language Input On Adolescent English Learner's Speech Production Outcomes
10 pages
Business Research Methods Updat
No ratings yet
Business Research Methods Updat
4 pages
C-Bello Fika
No ratings yet
C-Bello Fika
15 pages
20EJCIT200 - Abhishek Tiwari
No ratings yet
20EJCIT200 - Abhishek Tiwari
7 pages
Eva As Statistical Models
No ratings yet
Eva As Statistical Models
24 pages
Conservat Sci and Prac - 2025 - Cracco - The Effect of Swiss Regional Nature Parks On Agricultural Earnings
No ratings yet
Conservat Sci and Prac - 2025 - Cracco - The Effect of Swiss Regional Nature Parks On Agricultural Earnings
15 pages
4th Semester Model MCQ Fourth Paper
No ratings yet
4th Semester Model MCQ Fourth Paper
47 pages
AI & ML in Risk Management
No ratings yet
AI & ML in Risk Management
18 pages
Data Mining: Set-01: (Introduction)
No ratings yet
Data Mining: Set-01: (Introduction)
14 pages
Nursing Research
No ratings yet
Nursing Research
20 pages
ZEMENE21
No ratings yet
ZEMENE21
35 pages
STA780 - Wk1 - Intro To Multivariate Analysis-Student
No ratings yet
STA780 - Wk1 - Intro To Multivariate Analysis-Student
92 pages
Connecting Tiger
No ratings yet
Connecting Tiger
292 pages
Medicinal Chemistry Analysis
No ratings yet
Medicinal Chemistry Analysis
92 pages
Usp 1039
No ratings yet
Usp 1039
18 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
21 pages
Eapp Report
No ratings yet
Eapp Report
34 pages
How To Choose The Right Machine Learning Algorithm
No ratings yet
How To Choose The Right Machine Learning Algorithm
10 pages
SRM FS
No ratings yet
SRM FS
23 pages
Corporate Finance Literature Review
No ratings yet
Corporate Finance Literature Review
10 pages
Green Crowdfunding in Europe
No ratings yet
Green Crowdfunding in Europe
34 pages
Econometrics I: Dummy Variable Regression Models
No ratings yet
Econometrics I: Dummy Variable Regression Models
68 pages
Quiz-2 1
No ratings yet
Quiz-2 1
4 pages
Pre Test Practical Research 2
No ratings yet
Pre Test Practical Research 2
2 pages
Ansari. 2016
No ratings yet
Ansari. 2016
16 pages
Logistics Management - Chapter 5 PPT NFJnK1J2IS
No ratings yet
Logistics Management - Chapter 5 PPT NFJnK1J2IS
50 pages
EngMan Finals Reviewer
No ratings yet
EngMan Finals Reviewer
7 pages
NCM 111. Research Midterms
No ratings yet
NCM 111. Research Midterms
8 pages
Understanding Employee Motivation and Organizational Performance: Arguments For A Set-Theoretic Approach
No ratings yet
Understanding Employee Motivation and Organizational Performance: Arguments For A Set-Theoretic Approach
8 pages

Classification and Regression

Uploaded by

Classification and Regression

Uploaded by

MIT School of Computing

Department of Computer Science & Engineering

Third Year Engineering

Suppose after training the model, we get this equation:

You might also like