0% found this document useful (0 votes)

3 views49 pages

Lesson 12 Logistic Regression

Uploaded by

nguyenquangdung177

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views49 pages

Lesson 12 Logistic Regression

Uploaded by

nguyenquangdung177

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

Logistic Regression

Outline

Logistic Regression

Hypothesis Cost Gradient

Classification
Function Function Descent
Binary outcomes are common and important
• The patient survives the operation, or does not.
• The accused is convicted, or is not.
• The customer makes a purchase, or does not.
• The marriage lasts at least five years, or does not.
• The student graduates, or does not.

3
Categorical
Examples: Response Variables

 Non  smoker
Whether or not a person Y 
smokes Binary Response Smoker
Survives
Success of a medical Y 
treatment Dies

Opinion poll responses Agree


Y   Neutral
Ordinal Response Disagree

Difference between linear regression and
logistic regression

https://www.kaggle.com/
Sigmoid Function

“Bias unit”

“Weights”
“Parameters”
“Output”

• Sigmoid (logistic) activation function

“Input” Slide credit: Andrew Ng
Learning a Logistic Regression Model
• How to learn a logistic regression model 𝜽
𝑻
,
where = [ 𝟎 𝒎 and 𝟎 𝒎 ?
• By minimizing the following cost function:

Cost( 𝜽 )=
𝜽 𝜽
• That is:
1
minimize Cost(𝒉𝜽 𝒙 ,𝑦 )
𝑛

≣
1 1 1 Cost function
minimize −𝑦 log − (1 − 𝑦) log 1 −
𝑛 1+𝑒 𝜽 1+𝑒 𝜽
Learning a Logistic Regression Model
• How to learn a logistic regression model 𝜽
𝑻
,
where = [ 𝟎 𝒎 and 𝟎 𝒎 ?
• By minimizing the following cost function:

Cost( 𝜽 )=
𝜽 𝜽
• That is:
1
minimize Cost(𝒉𝜽 𝒙 ,𝑦 )
𝑛

≣
1 1 1 Cost function
minimize −𝑦 log − (1 − 𝑦) log 1 −
𝑛 1+𝑒 𝜽 1+𝑒 𝜽
Gradient Descent For Logistic Regression
• Outline:
• Have cost function , where =[ 𝟎 𝒎
• Start off with some guesses for
• It does not really matter what values you start off with, but a common
choice is to set them all initially to zero
• Repeat until convergence{ Partial derivative

Note: Update all 𝜽𝒋 simulatenously

}
Learing rate, which controls how big a step we take
when we update
Gradient Descent For Logistic Regression
• Outline:
• Have cost function , where =[ 𝟎 𝒎
• Start off with some guesses for
• It does not really matter what values you start off with, but a common
choice is to set them all initially to zero
• Repeat until convergence{

The final formula

() after applying
𝜽 partial derivatives
}
Inference After Learning
• After learning the parameters =[ , we can predict the
output of any new unseen 𝟎 𝒎 as follows:

𝜽 𝜽 𝒙

𝜽 𝜽 𝒙
Visualization of weights, bias, activation function

range
determined
by g(.)
bias b only change the
position of the hyperplane

Slide credit: Hugo Larochelle

Activation - sigmoid

• Squashes the neuron’s pre-

activation between 0 and 1
• Always positive
• Bounded
• Strictly increasing

Slide credit: Hugo Larochelle

A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]

and vaccine the of nigeria y

Email a 1 1 0 1 1 1
Email b 0 0 1 1 0 0
Email c 0 1 1 0 0 1
Email d 1 0 0 1 0 0
Email e 1 0 1 0 1 1
Email f 1 0 1 1 0 0

A Training Dataset
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]

and vaccine the of nigeria y

Email a 1 1 0 1 1 1
Email b 0 0 1 1 0 0
Email c 0 1 1 0 0 1
Email d 1 0 0 1 0 0
Email e 1 0 1 0 1 1
Email f 1 0 1 1 0 0

1 entails that a word (i.e., “and”) is present in an email (i.e., “Email a”)
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]

and vaccine the of nigeria y

Email a 1 1 0 1 1 1
Email b 0 0 1 1 0 0
Email c 0 1 1 0 0 1
Email d 1 0 0 1 0 0
Email e 1 0 1 0 1 1
Email f 1 0 1 1 0 0

0 entails that a word (i.e., “and”) is abscent in an email (i.e., “Email b”)
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0] We define 6 parameters
(the first one, i.e., 𝜃 ,
5 words (or features) = [ 𝟏, 𝟐, 𝟑, 𝟒, 𝟓] is the intercept)

𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y

Email a 1 1 0 1 1 1
Email b 0 0 1 1 0 0
Email c 0 1 1 0 0 1
Email d 1 0 0 1 0 0
Email e 1 0 1 0 1 1
Email f 1 0 1 1 0 0
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0] The parameter vector:
𝜽 = [𝜽𝟎 , 𝜽𝟏 , 𝜽𝟐 , 𝜽𝟑 , 𝜽𝟒 , 𝜽𝟓 ]
x=[ 𝟎, 𝟏, 𝟐, 𝟑, 𝟒, 𝟓] The feature vector

𝒙𝟎 = 𝟏 𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y

Email a 1 1 1 0 1 1 1
Email b 1 0 0 1 1 0 0
Email c 1 0 1 1 0 0 1
Email d 1 1 0 0 1 0 0
Email e 1 1 0 1 0 1 1
Email f 1 1 0 1 1 0 0

To account for the intercept

Recap: Gradient Descent For Logistic
Regression
• Outline:
• Have cost function , where =[ 𝟎 𝒎
• Start off with some guesses for
• It does not really matter what values you start off with, but a common
choice is to set them all initially to zero
• Repeat until convergence{

()
𝜽
First, let us calculate this factor
} for every example in our
training dataset
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟎
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 0.5
[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 -0.5
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 0.5
[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5
Recap: Gradient Descent For Logistic
Regression
• Outline:
• Have cost function , where =[ 𝟎 𝒎
• Start off with some guesses for
• It does not really matter what values you start off with, but a common
choice is to set them all initially to zero
• Repeat until convergence{ Second, let us calculate
this equation for every
example in our training
() dataset and for every 𝜽𝒋 ,
𝜽 where j is between 0
and m
}
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟎
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 ( − 𝟏) × 𝟏 = -0.5

𝟎

[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 ( − 𝟎) × 𝟏 = 0.5

[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 ( − 𝟏) × 𝟏 = -0.5
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 ( − 𝟎) × 𝟏 = 0.5
[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 ( − 𝟏) × 𝟏 = -0.5
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 ( − 𝟎) × 𝟏 = 0.5
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟎
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 ( − 𝟏) × 𝟏 = -0.5

𝟎

[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 ( − 𝟎) × 𝟏 = 0.5

[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 ( − 𝟏) × 𝟏 = -0.5
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 ( − 𝟎) × 𝟏 = 0.5
[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 ( − 𝟏) × 𝟏 = -0.5
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 ( − 𝟎) × 𝟏 = 0.5
Recap: Gradient Descent For Logistic
Regression
• Outline:
• Have cost function , where =[ 𝟎 𝒎
• Start off with some guesses for
• It does not really matter what values you start off with, but a common
choice is to set them all initially to zero
• Repeat until convergence{

() Third, let us compute

𝜽 every 𝜽𝒋

}
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 1
( 𝜽 𝒙 − 𝒚)𝒙𝟎 −𝑦 𝑥 () =𝟎
1+𝑒 𝜽
[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5
[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 𝑻𝒉𝒆𝒏,
0.5
[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 𝜃 = 𝜃 − α ×0
-0.5
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 New 𝜽𝟎
0.5
[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 1
( 𝜽 𝒙 − 𝒚)𝒙𝟎 −𝑦 𝑥 () =𝟎
1+𝑒 𝜽
[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5
[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 𝑻𝒉𝒆𝒏,
0.5
[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 𝜃 = 𝜃 − α ×0
-0.5
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 0.5 Old 𝜽𝟎

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 1
( 𝜽 𝒙 − 𝒚)𝒙𝟎 −𝑦 𝑥 () =𝟎
1+𝑒 𝜽
[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5
[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 𝑻𝒉𝒆𝒏,
0.5
[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 𝜃 = 𝜃 − α ×0
-0.5
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 0.5 = 0 − 0.5 × 𝟎 = 𝟎

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5 New Paramter Vector:
𝟏 𝟐, 𝟑, 𝟒, 𝟓
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟏
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5 New Paramter Vector:
𝟐, 𝟑, 𝟒, 𝟓
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟐
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 0
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0 New Paramter Vector:
, 𝟑, 𝟒, 𝟓
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟑
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5 New Paramter Vector:
, , 𝟒, 𝟓
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟒
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 0
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5 New Paramter Vector:
, , , 𝟓
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 ( − 𝒚)𝒙𝟓
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 0
[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 0
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 0.5
[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 0.5
A Concrete Example: The Training Phase
• Let us apply logistic regression on the spam email recognition problem,
assuming = 0.5 and starting with = [0, 0, 0, 0, 0, 0]
𝒙 𝒚 𝜽𝑻 𝒙 1
( 𝜽 𝒙 − 𝒚)𝒙𝟓 −𝑦 𝑥 () = −𝟏
1+𝑒 𝜽
[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0
-0.5
[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 𝑻𝒉𝒆𝒏,
0
[1,0,1,1,0,0] 1 [0,0,0,0,0,0]×[1,0,1,1,0,0]=0 𝜃 = 𝜃 − α × (−𝟏)
0
[1,1,0,0,1,0] 0 [0,0,0,0,0,0]×[1,1,0,0,1,0]=0 = 0 − 0.5 × (−1) = 𝟎. 𝟓
0
[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0
-0.5
[1,1,0,1,1,0] 0 [0,0,0,0,0,0]×[1,1,0,1,1,0]=0 New Paramter Vector:
0
, , ,
A Concrete Example: Testing
• Let us now test logistic regression on the spam email recognition
problem, using the just learnt , , ,
• Note: Testing is typically done over a portion of the dataset that is not used
during training, but rather kept only for testing the accuracy of the algorithm’s
predictions thus far
• In this example, we will test over all the examples that we used during training,
just for illustrative purposes
A Concrete Example: Testing
• Let us test logistic regression on the spam email recognition problem,
using the just learnt , , ,
𝒙 𝒚 𝜽𝑻 𝒙 𝒉𝜽 𝒙 = ( ) Predicted Class
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331

[1,0,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,0,0,1,1,0]=-0.5 0.377540669
[1,0,1,1,0,0] 1 [0,0,0.5,0,-0.5,0.5]×[1,0,1,1,0,0]=0.5 0.622459331
[1,1,0,0,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,0,1,0]=-0.5 0.377540669
[1,1,0,1,0,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,0,1]=0.5 0.622459331
[1,1,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,1,0]=-0.5 0.377540669
A Concrete Example: Testing
• Let us test logistic regression on the spam email recognition problem,
using the just learnt , , ,
𝒙 𝒚 𝜽𝑻 𝒙 𝒉𝜽 𝒙 = ( ) Predicted Class
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331

[1,0,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,0,0,1,1,0]=-0.5 0.377540669
[1,0,1,1,0,0] 1 [0,0,0.5,0,-0.5,0.5]×[1,0,1,1,0,0]=0.5 0.622459331
[1,1,0,0,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,0,1,0]=-0.5 0.377540669
[1,1,0,1,0,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,0,1]=0.5 0.622459331
[1,1,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,1,0]=-0.5 0.377540669
A Concrete Example: Testing
• Let us test logistic regression on the spam email recognition problem,
using the just learnt , , ,
(𝒊𝒇 𝒉𝜽 𝒙 ≥ 𝟎. 𝟓, 𝒚’ = 𝟏; 𝒆𝒍𝒔𝒆 𝒚’ = 𝟎)
𝒙 𝒚 𝜽𝑻 𝒙 𝒉𝜽 𝒙 = ( ) Predicted Class (or 𝒚’)
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331

[1,0,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,0,0,1,1,0]=-0.5 0.377540669
[1,0,1,1,0,0] 1 [0,0,0.5,0,-0.5,0.5]×[1,0,1,1,0,0]=0.5 0.622459331
[1,1,0,0,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,0,1,0]=-0.5 0.377540669
[1,1,0,1,0,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,0,1]=0.5 0.622459331
[1,1,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,1,0]=-0.5 0.377540669
A Concrete Example: Testing
• Let us test logistic regression on the spam email recognition problem,
using the just learnt , , ,
(𝒊𝒇 𝒉𝜽 𝒙 ≥ 𝟎. 𝟓, 𝒚’ = 𝟏; 𝒆𝒍𝒔𝒆 𝒚’ = 𝟎)
𝒙 𝒚 𝜽𝑻 𝒙 𝒉𝜽 𝒙 = ( ) Predicted Class (or 𝒚’)
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331 1

[1,0,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,0,0,1,1,0]=-0.5 0.377540669 0
[1,0,1,1,0,0] 1 [0,0,0.5,0,-0.5,0.5]×[1,0,1,1,0,0]=0.5 0.622459331 1
[1,1,0,0,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,0,1,0]=-0.5 0.377540669 0
[1,1,0,1,0,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,0,1]=0.5 0.622459331 1
[1,1,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,1,0]=-0.5 0.377540669 0
A Concrete Example: Testing
• Let us test logistic regression on the spam email recognition problem,
using the just learnt , , ,
(𝒊𝒇 𝒉𝜽 𝒙 ≥ 𝟎. 𝟓, 𝒚’ = 𝟏; 𝒆𝒍𝒔𝒆 𝒚’ = 𝟎)
𝒙 𝒚 𝜽𝑻 𝒙 𝒉𝜽 𝒙 = ( ) Predicted Class (or 𝒚’)
𝜽 𝒙

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331 1

[1,0,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,0,0,1,1,0]=-0.5 0.377540669 0
[1,0,1,1,0,0] 1 [0,0,0.5,0,-0.5,0.5]×[1,0,1,1,0,0]=0.5 0.622459331 NO
1
[1,1,0,0,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,0,1,0]=-0.5 Mispredictions!
0.377540669 0
[1,1,0,1,0,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,0,1]=0.5 0.622459331 1
[1,1,0,1,1,0] 0 [0,0,0.5,0,-0.5,0.5]×[1,1,0,1,1,0]=-0.5 0.377540669 0
A Concrete Example: Inference
• Let us infer whether a given new email, say, k = [1, 0, 1, 0, 0, 1] is a spam
or not, using logistic regression with the just learnt parameter vector
, , ,
𝒙𝟎 = 𝟏 𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y
Email a 1 1 1 0 1 1 1
Email b 1 0 0 1 1 0 0
Email c 1 0 1 1 0 0 1
Email d 1 1 0 0 1 0 0
Email e 1 1 0 1 0 1 1
Email f 1 1 0 1 1 0 0
Email k 1 0 1 0 0 1 ?
Our Training Dataset
A Concrete Example: Inference
• Let us infer whether a given new email, say, k = [1, 0, 1, 0, 0, 1] is a spam
or not, using logistic regression with the just learnt parameter vector
, , ,
𝒙𝟎 = 𝟏 𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y
Email a 1 1 1 0 1 1 1
Email b 1 0 0 1 1 0 0
Email c 1 0 1 1 0 0 1
Email d 1 1 0 0 1 0 0
Email e 1 1 0 1 0 1 1
Email f 1 1 0 1 1 0 0
Email k 1 0 1 0 0 1 ?
A Concrete Example: Inference
• Let us infer whether a given new email, say, k = [1, 0, 1, 0, 0, 1] is a spam
or not, using logistic regression with the just learnt parameter vector
, , ,
𝟎
𝟎
𝜽 𝜽 𝒙 𝟎. 𝟓
𝟏, 𝟎, 𝟏, 𝟎, 𝟎, 𝟏 = 𝟎. 𝟓 × 𝟏 + 𝟎. 𝟓 × 𝟏 = 𝟏
𝟎
−𝟎. 𝟓
𝟎. 𝟓
𝟏

 Class 1 (i.e., Spam)

A Concrete Example: Inference
• Let us infer whether a given new email, say, k = [1, 0, 1, 0, 0, 1] is a spam
or not, using logistic regression with the just learnt parameter vector
, , ,
𝒙𝟎 = 𝟏 𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y
Email a 1 1 1 0 1 1 1
Email b 1 0 0 1 1 0 0
Email c 1 0 1 1 0 0 1
Email d 1 1 0 0 1 0 0
Email e 1 1 0 1 0 1 1
Email f 1 1 0 1 1 0 0
Email k 1 0 1 0 0 1 1

Somehow interesting since it considered “vaccine” and “nigeria” indicative of spam!

Logistic Regression
 Sources:
 https://www.kaggle.com/
 http://research.cs.tamu.edu
 http://web.iitd.ac.in
 https://www3.nd.edu/

Lecture 16 - Classification
No ratings yet
Lecture 16 - Classification
43 pages
Logistic Regression for Beginners
No ratings yet
Logistic Regression for Beginners
9 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Logistic Regression
No ratings yet
Logistic Regression
79 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Problem Set Logistic Regression
No ratings yet
Problem Set Logistic Regression
2 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
L14 Logistic Regression
No ratings yet
L14 Logistic Regression
22 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
23 LogisticRegression
No ratings yet
23 LogisticRegression
67 pages
Week 8
No ratings yet
Week 8
38 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
93 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Module 04 - Extra Class: Logistic Regression
No ratings yet
Module 04 - Extra Class: Logistic Regression
41 pages
L6 LogisticRegression
No ratings yet
L6 LogisticRegression
22 pages
L5 LogisticRegression
No ratings yet
L5 LogisticRegression
22 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Lecture 6 Logistic Regression
No ratings yet
Lecture 6 Logistic Regression
28 pages
Reference Material - Logistic - Regression
No ratings yet
Reference Material - Logistic - Regression
11 pages
Logistic Regression for Analysts
No ratings yet
Logistic Regression for Analysts
33 pages
P-2 M.L. M-I U-I Logistic Regression
No ratings yet
P-2 M.L. M-I U-I Logistic Regression
50 pages
Logistic Regression for ML Students
No ratings yet
Logistic Regression for ML Students
34 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
COMP-377Week6 v1.1
No ratings yet
COMP-377Week6 v1.1
38 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regression
No ratings yet
Logistic Regression
22 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
Sonia Jessica - 2022 - How Does Logistic Regression Work
No ratings yet
Sonia Jessica - 2022 - How Does Logistic Regression Work
4 pages
Csa Lab 3
No ratings yet
Csa Lab 3
14 pages
Logistic Regression
No ratings yet
Logistic Regression
23 pages
Reference Material Logistic Regression
No ratings yet
Reference Material Logistic Regression
11 pages
W2 Ann
No ratings yet
W2 Ann
12 pages
03-Logistic Regression
No ratings yet
03-Logistic Regression
59 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
10 pages
Logisticregression
No ratings yet
Logisticregression
22 pages
ML Assignment
No ratings yet
ML Assignment
20 pages
PCCAIML601
No ratings yet
PCCAIML601
7 pages
Logistic Regression by IntuitiveAI v2.5
No ratings yet
Logistic Regression by IntuitiveAI v2.5
8 pages
AI Lab8
No ratings yet
AI Lab8
8 pages
Logistic Regression for Beginners
No ratings yet
Logistic Regression for Beginners
28 pages
Logistic Regression
No ratings yet
Logistic Regression
19 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Ai Tech Agency Infographics
No ratings yet
Ai Tech Agency Infographics
65 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
FALLSEM2024-25 MMAT501L TH VL2024250107615 2024-09-24 Reference-Material-I
No ratings yet
FALLSEM2024-25 MMAT501L TH VL2024250107615 2024-09-24 Reference-Material-I
12 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Logisticregression 2021
No ratings yet
Logisticregression 2021
78 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
Canadian Business Etiquette Guide
No ratings yet
Canadian Business Etiquette Guide
4 pages
Altermodernity: A Postcolonial(s) Constellation
50% (2)
Altermodernity: A Postcolonial(s) Constellation
10 pages
Pandas - DataFrame.to CSV
No ratings yet
Pandas - DataFrame.to CSV
3 pages
Active vs Passive Voice Guide
No ratings yet
Active vs Passive Voice Guide
7 pages
Bible Sharing Activity Sheet
No ratings yet
Bible Sharing Activity Sheet
2 pages
ST Thomas Aquinas - Exposition of Psalm 42
No ratings yet
ST Thomas Aquinas - Exposition of Psalm 42
8 pages
Systematic Theology: Theology Proper - The Study of The Character of God Lectures 9 - 10
No ratings yet
Systematic Theology: Theology Proper - The Study of The Character of God Lectures 9 - 10
9 pages
02-023 Iqualich Niginaqtuat, Fish That We Eat Final
No ratings yet
02-023 Iqualich Niginaqtuat, Fish That We Eat Final
345 pages
1ST Quarter Examination in Music
0% (1)
1ST Quarter Examination in Music
2 pages
Jones Wisdom and Happiness in Euthydemus 278282
No ratings yet
Jones Wisdom and Happiness in Euthydemus 278282
21 pages
Remote Mob Programming
No ratings yet
Remote Mob Programming
37 pages
FactoryTalk Optix - 1.3.3 (Released 4 - 2024)
No ratings yet
FactoryTalk Optix - 1.3.3 (Released 4 - 2024)
6 pages
IMO Problem and Shortlist PDF
100% (1)
IMO Problem and Shortlist PDF
1,078 pages
11th English Test 1
No ratings yet
11th English Test 1
4 pages
Math Problems for Competitive Exams
No ratings yet
Math Problems for Competitive Exams
5 pages
Sample TOC Questions
No ratings yet
Sample TOC Questions
5 pages
The Basic Counting Principle
No ratings yet
The Basic Counting Principle
18 pages
Technology Stack - Template
No ratings yet
Technology Stack - Template
3 pages
BelAZ Parts Catalog
No ratings yet
BelAZ Parts Catalog
32 pages
Linked List: CSD-202 Data Structure and Algorithms
100% (1)
Linked List: CSD-202 Data Structure and Algorithms
40 pages
Quotations and Paraphrases Lesson Reference Guide
No ratings yet
Quotations and Paraphrases Lesson Reference Guide
3 pages
Mr. Ranjit Kadam
100% (1)
Mr. Ranjit Kadam
10 pages
Da With Excel
No ratings yet
Da With Excel
26 pages
Outlier - Ratable Prompts
No ratings yet
Outlier - Ratable Prompts
5 pages
History As Poetic Indetermination The Murder Scene in Margaret Atwoods Alias Grace
No ratings yet
History As Poetic Indetermination The Murder Scene in Margaret Atwoods Alias Grace
13 pages
Class 8 Online Timetable 2022-23
No ratings yet
Class 8 Online Timetable 2022-23
2 pages
FCE P4 Key Word Transformation - BLANK
No ratings yet
FCE P4 Key Word Transformation - BLANK
15 pages
Yogendra Saxena: Senior Software Engineer
No ratings yet
Yogendra Saxena: Senior Software Engineer
2 pages
Java 01
No ratings yet
Java 01
22 pages
Irregular Verbs: REINFORCEMENT MATERIAL - Bilingual Students
No ratings yet
Irregular Verbs: REINFORCEMENT MATERIAL - Bilingual Students
2 pages

Lesson 12 Logistic Regression

Uploaded by

Lesson 12 Logistic Regression

Uploaded by

Logistic Regression

Hypothesis Cost Gradient

Opinion poll responses Agree

• Sigmoid (logistic) activation function

Note: Update all 𝜽𝒋 simulatenously

The final formula

Slide credit: Hugo Larochelle

• Squashes the neuron’s pre-

Slide credit: Hugo Larochelle

and vaccine the of nigeria y

and vaccine the of nigeria y

and vaccine the of nigeria y

𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y

𝒙𝟎 = 𝟏 𝒙𝟏 = and 𝒙𝟐 = vaccine 𝒙𝟑 = the 𝒙𝟒 = of 𝒙𝟓 = nigeria y

To account for the intercept

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 ( − 𝟏) × 𝟏 = -0.5

[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 ( − 𝟎) × 𝟏 = 0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 ( − 𝟏) × 𝟏 = -0.5

[1,0,0,1,1,0] 0 [0,0,0,0,0,0]×[1,0,0,1,1,0]=0 ( − 𝟎) × 𝟏 = 0.5

() Third, let us compute

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,0,1,0,1] 1 [0,0,0,0,0,0]×[1,1,0,1,0,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0,0,0,0]×[1,1,1,0,1,1]=0 -0.5

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331 1

[1,1,1,0,1,1] 1 [0,0,0.5,0,-0.5,0.5]×[1,1,1,0,1,1]=0.5 0.622459331 1

 Class 1 (i.e., Spam)

Somehow interesting since it considered “vaccine” and “nigeria” indicative of spam!

You might also like