Logistic Regression

Logistic regression is a binary classification method that uses the sigmoid function to predict probabilities between 0 and 1, fitting an 'S' shaped curve instead of a linear regression line. Generalized Linear Models (GLMs) extend traditional regression by allowing for flexible, non-linear relationships and are applicable to various distributions, including logistic and linear regression. While GLMs offer advantages such as robustness and ease of use, they also have limitations, including assumptions about data distribution and potential overfitting.

Uploaded by

Prathamesh Kulkarni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views26 pages

Logistic Regression

Uploaded by

Prathamesh Kulkarni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Logistic Regression

• Logistic regression is used for binary classification where we

use sigmoid function, that takes input as independent variables
and produces a probability value between 0 and 1.
Logistic regression Classification
Steps are:
1. Data preprocessing
2. Fitting LR to the training set
3. Predicting the test results
4. Test accuracy of the result
5. Visualize the result.
• Logistic regression predicts the output of a categorical
dependent variable. The outcome is categorical or discrete
value.
• It can be either Yes or No, 0 or 1, true or False, etc. but instead
of giving the exact value as 0 and 1, it gives the probabilistic
values which lie between 0 and 1.
• In Logistic regression, instead of fitting a regression line, we fit
an “S” shaped logistic function, which predicts two maximum
values (0 or 1).
Sigmoid Function
• The sigmoid function is a mathematical function used to map
the predicted values to probabilities.
• It maps any real value into another value within a range of 0
and 1. The value of the logistic regression must be between 0
and 1, which cannot go beyond this limit, so it forms a curve
like the “S” form.
• In logistic regression, we use the concept of the threshold
value, which defines the probability of either 0 or 1. Such as
values above the threshold value tends to 1, and a value below
the threshold values tends to 0.
Logistic Regression Equation
Generalized Linear Models

Generalized Linear Models (GLMs) are a class of regression

models that can be used to model a wide range of relationships
between a response variable and one or more predictor
variables.
Unlike traditional linear regression models, which assume a
linear relationship between the response and predictor
variables, GLMs allow for more flexible, non-linear relationships
by using a different underlying statistical distribution.
Features of GLMs
1.Flexibility: GLMs can model a wide range of relationships between
the response and predictor variables, including linear, logistic,
Poisson, and exponential relationships.
2.Model interpretability: GLMs provide a clear interpretation of the
relationship between the response and predictor variables, as well
as the effect of each predictor on the response.
3.Robustness: GLMs can be robust to outliers and other anomalies in
the data, as they allow for non-normal distributions of the response
variable.
4.Scalability: GLMs can be used for large datasets and complex
models, as they have efficient algorithms for model fitting and
prediction.
5.Ease of use: GLMs are relatively easy to understand and use, especially
compared to more complex models such as neural networks or decision
trees.
6.Hypothesis testing: GLMs allow for hypothesis testing and statistical
inference, which can be useful in many applications where it’s important to
understand the significance of relationships between variables.

7.Regularization: GLMs can be regularized to reduce overfitting and

improve model performance, using techniques such as Lasso, Ridge, or
Elastic Net regression.
8. Model comparison: GLMs can be compared using information criteria
such as AIC or BIC, which can help to choose the best model among a set
of alternatives.
Some of the disadvantages of GLMs

• Assumptions: GLMs make certain assumptions about the

distribution of the response variable, and these assumptions may
not always hold.
• Model specification: Specifying the correct underlying statistical
distribution for a GLM can be challenging, and incorrect specification
can result in biased or incorrect predictions.
• Overfitting: Like other regression models, GLMs can be prone to
overfitting if the model is too complex or has too many predictor
variables.
• Overall, GLMs are a powerful and flexible tool for modeling
relationships between response and predictor variables, and are
widely used in many fields, including finance, marketing, and
epidemiology.
• Limited flexibility: While GLMs are more flexible than
traditional linear regression models, they may still not be able
to capture more complex relationships between variables, such
as interactions or non-linear effects.
• Data requirements: GLMs require a sufficient amount of data to
estimate model parameters and make accurate predictions, and
may not perform well with small or imbalanced datasets.
• Model assumptions: GLMs rely on certain assumptions about
the distribution of the response variable and the relationship
between the response and predictor variables, and violation of
these assumptions can lead to biased or incorrect predictions.
Generalized linear models (GLMs) which explains how Linear
regression and Logistic regression are a member of a much broader
class of models. GLMs can be used to construct the models for
regression and classification problems by using the type of
distribution which best describes the data or labels given for training
the model.
Below given are some types of datasets and the corresponding
distributions which would help us in constructing the model
1.Binary classification data – Bernoulli distribution
2.Real valued data – Gaussian distribution
3.Count-data – Poisson distribution
• To understand GLMs we will begin by defining exponential
families. Exponential families are a class of distributions whose
probability density function(PDF) can be molded into the
following form:
Linear Regression Model: To show that Linear Regression is a
special case of the GLMs. It is considered that the output labels
are continuous values and are therefore a Gaussian
distribution. So, we have
The first equation above corresponds to the first assumption that
the output labels (or target variables) should be the member of
an exponential family.
Second equation corresponds to the assumption that
the hypothesis is equal the expected value or mean of the
distribution and lastly.
the third equation corresponds to the assumption that natural
parameter and the input parameters follow a linear relationship.
Logistic Regression Model: To show that Logistic Regression is
a special case of the GLMs. It is considered that the output
labels are Binary valued and are therefore a
Bernoulli distribution. So, we have
From the third assumption, it is proven that:
The function that maps the natural parameter to the canonical
parameter is known as the canonical response function (here,
the log-partition function) and the inverse of it is known as
the canonical link function.

Therefore by using the three assumptions mentioned before it

can be proved that the Logistic and Linear Regression belongs
to a much larger family of models known as GLMs.

Generalized Linear Models GLMs
No ratings yet
Generalized Linear Models GLMs
10 pages
Generalised Linear Model
No ratings yet
Generalised Linear Model
4 pages
Ch13slides Generalized Linear Models
No ratings yet
Ch13slides Generalized Linear Models
24 pages
Generalized Linear Models
No ratings yet
Generalized Linear Models
7 pages
6.1 - Introduction To GLMs
No ratings yet
6.1 - Introduction To GLMs
3 pages
Week6 1 GLM
No ratings yet
Week6 1 GLM
28 pages
Generalized Linear Models Guide
No ratings yet
Generalized Linear Models Guide
24 pages
Random Notes
No ratings yet
Random Notes
11 pages
Advanced Regression with GLMs
No ratings yet
Advanced Regression with GLMs
13 pages
Rahul Narayanan - Generalizedlinearmodel
No ratings yet
Rahul Narayanan - Generalizedlinearmodel
28 pages
15 GLM
No ratings yet
15 GLM
32 pages
Lecture 3
No ratings yet
Lecture 3
18 pages
Business Analytics: Advance: Logistic Regression
100% (1)
Business Analytics: Advance: Logistic Regression
26 pages
Chapter 2
No ratings yet
Chapter 2
5 pages
Logistic Regression Model - A Review
No ratings yet
Logistic Regression Model - A Review
5 pages
GLM & Logistic
No ratings yet
GLM & Logistic
26 pages
Modelos Lineales Generalizados Con Ejemplos en R
No ratings yet
Modelos Lineales Generalizados Con Ejemplos en R
573 pages
S M S T C Lecture 2425
No ratings yet
S M S T C Lecture 2425
45 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
General Linear Model Statistical Process For Social Sciences
No ratings yet
General Linear Model Statistical Process For Social Sciences
2 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
9 pages
Intro to General Linear Models
No ratings yet
Intro to General Linear Models
3 pages
Day 4 ML Starting
No ratings yet
Day 4 ML Starting
54 pages
Misc 5
No ratings yet
Misc 5
1 page
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
30 pages
Chapter 13 - Generalized Linear Models
No ratings yet
Chapter 13 - Generalized Linear Models
6 pages
Modelling Lecture 5
No ratings yet
Modelling Lecture 5
10 pages
Logistic Regression
100% (1)
Logistic Regression
56 pages
07 GLM
No ratings yet
07 GLM
49 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
3 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
Logistics Regression
No ratings yet
Logistics Regression
8 pages
Logistic Regression for BBA Students
No ratings yet
Logistic Regression for BBA Students
12 pages
General Linear Model Overview
No ratings yet
General Linear Model Overview
5 pages
Generalized Linear Models: Simon Jackman Stanford University
No ratings yet
Generalized Linear Models: Simon Jackman Stanford University
7 pages
Logistics Regression
No ratings yet
Logistics Regression
56 pages
Logistic Regression Basics
No ratings yet
Logistic Regression Basics
13 pages
SandipK - Generalized Linear Models Question Bank
No ratings yet
SandipK - Generalized Linear Models Question Bank
38 pages
Sarang Ke Liye
No ratings yet
Sarang Ke Liye
14 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
ML 4
No ratings yet
ML 4
80 pages
Class - Lectur 5&6
No ratings yet
Class - Lectur 5&6
12 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
13 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
21 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
3 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
VO MCA S4 Data Mining Unit 8
No ratings yet
VO MCA S4 Data Mining Unit 8
18 pages
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
No ratings yet
A Research Project On Applying Logistic Regression To Predict Result of Binary Classification Problems
6 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
No ratings yet
Httpsemas2.Ui - Ac.idpluginfile - Php2375826mod Resourcecontent1kuliah1 2 PDF
31 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
Logistic Regression - Metrics and Iteration
No ratings yet
Logistic Regression - Metrics and Iteration
26 pages
AI & ML Unit 4, 5 Notes
No ratings yet
AI & ML Unit 4, 5 Notes
137 pages
UT2 NLP Final
No ratings yet
UT2 NLP Final
1 page
Deep Learning Theory
No ratings yet
Deep Learning Theory
3 pages
Applications
No ratings yet
Applications
1 page
Ut 2 - QP - Sepm 2023-24
No ratings yet
Ut 2 - QP - Sepm 2023-24
2 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
Syllabus
No ratings yet
Syllabus
2 pages
TE AINDS Syllabus REV 2019 - DAV
No ratings yet
TE AINDS Syllabus REV 2019 - DAV
3 pages
Cross Validation
No ratings yet
Cross Validation
16 pages
Sas#7 Fin081
No ratings yet
Sas#7 Fin081
7 pages
Business Analyst
No ratings yet
Business Analyst
3 pages
Data Analytics - 4 Manuscripts - Data Science For Beginners, Data Analysis With Python, SQL Computer Programming For Beginners, Statistics For Beginners
100% (1)
Data Analytics - 4 Manuscripts - Data Science For Beginners, Data Analysis With Python, SQL Computer Programming For Beginners, Statistics For Beginners
481 pages
Spriiprad - Machine Learning Model Basics Intermediate
No ratings yet
Spriiprad - Machine Learning Model Basics Intermediate
2 pages
Project Report
No ratings yet
Project Report
16 pages
Ae 311 Midterm Exam Part
No ratings yet
Ae 311 Midterm Exam Part
13 pages
Assignment 2-Group 10
No ratings yet
Assignment 2-Group 10
5 pages
Midterm So Ls
No ratings yet
Midterm So Ls
9 pages
Teachers' Cloud Pedagogy Skills
No ratings yet
Teachers' Cloud Pedagogy Skills
13 pages
GSM Network Optimization Dropped Calls, Congestion Causes and Solutions
No ratings yet
GSM Network Optimization Dropped Calls, Congestion Causes and Solutions
14 pages
UNIT 1 Exploratory Data Analysis
100% (1)
UNIT 1 Exploratory Data Analysis
8 pages
Department of Education Tnchs - Senior High School
No ratings yet
Department of Education Tnchs - Senior High School
9 pages
Case Study Project Report Sample 2
No ratings yet
Case Study Project Report Sample 2
4 pages
Hypothesis Testing Examples
No ratings yet
Hypothesis Testing Examples
5 pages
Siraj Proposal
No ratings yet
Siraj Proposal
11 pages
Linear Regression - Part 1
No ratings yet
Linear Regression - Part 1
32 pages
Simran Gupta Edited Project
No ratings yet
Simran Gupta Edited Project
64 pages
Biodun Oluwole - Business Analyst - Jconnect Infotech.
No ratings yet
Biodun Oluwole - Business Analyst - Jconnect Infotech.
9 pages
Table Z
No ratings yet
Table Z
1 page
Introduction To Regression and Analysis of Variance PDF
No ratings yet
Introduction To Regression and Analysis of Variance PDF
15 pages
Vaibhav AIML
No ratings yet
Vaibhav AIML
2 pages
From Manipulation To Citizen Control
No ratings yet
From Manipulation To Citizen Control
27 pages
Machine Learning Business Analysis Report
92% (12)
Machine Learning Business Analysis Report
42 pages
Cheatsheet Reflex Models
No ratings yet
Cheatsheet Reflex Models
4 pages
Hilda Noor Farikha - 7101418106 UTS Ekomet
No ratings yet
Hilda Noor Farikha - 7101418106 UTS Ekomet
8 pages
Exponential Smoothing Explained
No ratings yet
Exponential Smoothing Explained
5 pages
Influence of Recruitment Retention and Motivation
No ratings yet
Influence of Recruitment Retention and Motivation
10 pages
Anova One Way & Two Way Classified Data: Dr. Mukta Datta Mazumder Associate Professor Department of Statistics
No ratings yet
Anova One Way & Two Way Classified Data: Dr. Mukta Datta Mazumder Associate Professor Department of Statistics
32 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
8 pages
Reporting and Sharing Findings Data Presentation Data Analysis and Interpretation Synthesis of Findings and Interpretations
No ratings yet
Reporting and Sharing Findings Data Presentation Data Analysis and Interpretation Synthesis of Findings and Interpretations
5 pages

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic Regression

• Logistic regression is used for binary classification where we

Generalized Linear Models (GLMs) are a class of regression

7.Regularization: GLMs can be regularized to reduce overfitting and

• Assumptions: GLMs make certain assumptions about the

Therefore by using the three assumptions mentioned before it

You might also like