0% found this document useful (0 votes)

12 views29 pages

06 LogisticRegression

The document discusses logistic regression as part of a natural language processing course, highlighting its role as a discriminative model that computes probabilities for classification tasks. It covers key concepts such as the sigmoid function, cross-entropy loss, stochastic gradient descent, and regularization techniques (L1 and L2). Additionally, it compares logistic regression to Naïve Bayes, emphasizing its robustness to correlated features.

Uploaded by

ursady4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views29 pages

06 LogisticRegression

Uploaded by

ursady4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

LOGISTIC REGRESSION

Spring 2023 CS6431 Natural Language Processing

B1:
Speech and Language Processing (Third Edition draft
– Jan2022)
Daniel Jurafsky, James H. Martin
Credits
1. B1
Assignment
Read:
B1: Chapter 5

Problems:
Generative and Discriminative Classifiers
 Generative: knows how to generate features if it
belonged to a particular class
 E.g. Naïve Bayes

 Discriminative model: directly compute 𝑃(𝑐|𝑑) by giving

more importance to features that are better at
discriminating output classes
 Logistic Regression
Logistic Regressor
 A single layer neural network with
 Sigmoid/SoftMax as the activation function
 Cross-entropy loss function

 Uses stochastic gradient descent for optimization

Sigmoid/SoftMax function
 Inputs, weights, and bias

 Or

Sigmoid (logistic) function

 Output (in case of binary classification)

 For sigmoid
Let the corresponding six weights be [2.5, -5.0,-1.2, 0.5, 2.0, 0.7] and 𝑏= 0.1
Period disambiguation
 What sort of features would you suggest?
 Hand-crafted features
 Feature interactions
 Feature templates

 Representation Learning
Scaling Input Features
 Z-normalization
 Zero mean
 Unit variance

 Or, simply normalize as (∈ [−1, +1])

Logistic Regression vs. Naïve Bayes
 Naïve Bayes has a overly strong conditional independence assumption
 Problem with correlated features
 Logistic Regression is much more robust to correlated features
 Naïve Bayes pluses
 Works well on small datasets
 Easy to implement and fast to train (no optimization step)
Multinomial logistic regression
 Or SoftMax Regression
 Only one among more than two classes can be true
 Both predicted output 𝑦ො and actual output 𝑦 are of size 𝑘
◼𝑦
ො𝑖 estimates 𝑃(𝑦𝑖 = 1|𝑥)
 Probabilistically normalized version of sigmoid
 E.g., input:

 Output:
 For logistic regression

Or,
 In multimodal, a feature can be evidence for or against each
individual class.
 An exclamation mark ‘!’ may indicate positive or negative emotion, but not
neutral
Cross-entropy loss function
 Conditional maximum likelihood estimation: Choose 𝑤 and 𝑏 that
maximize the log 𝑝(𝑦|𝑥) in the training data given the observations 𝑥.
 Only two possible outcomes: Bernoulli distribution

 Note: if y=1, 𝑝 𝑦 𝑥 = 𝑦,
ො else if y=0, 𝑝 𝑦 𝑥 = (1 − 𝑦)
ො
 Taking log

 To make it a loss function

Stochastic Gradient Descent
 Figuring out in which direction the function’s slope is rising the most
steeply, and moving in the opposite direction
 Logistic regression: convex error function
 Vs. Neural network: non-convex (multiple local minima)
The partial derivative tells the steepest along that dimension
Regularization
 Large weights => over
generalization (overfitting)

 Add a penalty for large weights

 L1 Regularization: Linear function of weights

 L2 Regularization: Quadratic function of weight values

L1 vs. L2
 L1
 Linear but non-continuous at 0,
complex derivative
 Laplace prior on the weights

 Prefers a sparse weight matrix

with a few large weights
 L2
 Simple derivative
 Gaussian prior with zero mean

 Prefers weight vectors with many

small weights

Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
No ratings yet
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
25 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
No ratings yet
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
21 pages
L14 Logistic Regression
No ratings yet
L14 Logistic Regression
22 pages
Logistic Regression
No ratings yet
Logistic Regression
19 pages
Logistic Regression
No ratings yet
Logistic Regression
78 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
25 pages
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
No ratings yet
Logistic Regression: "And How Do You Know That These Fine Begonias Are Not of Equal Importance?"
25 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
15 pages
Ch03 LogisticRegression
No ratings yet
Ch03 LogisticRegression
79 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
Logisticregression 2021
No ratings yet
Logisticregression 2021
78 pages
Logistic Regression
No ratings yet
Logistic Regression
79 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
Logistic Regression for NLP
No ratings yet
Logistic Regression for NLP
64 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
93 pages
Ed3book - Jan72023 87 110
No ratings yet
Ed3book - Jan72023 87 110
24 pages
Binary Logistic Regression 2
No ratings yet
Binary Logistic Regression 2
43 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
Exp 2
No ratings yet
Exp 2
7 pages
Lec 4
No ratings yet
Lec 4
17 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Lecture 11 Logistic
No ratings yet
Lecture 11 Logistic
19 pages
Logistic Regression
No ratings yet
Logistic Regression
91 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
5 LR Apr 7 2021
No ratings yet
5 LR Apr 7 2021
94 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
4.logistic Regression
No ratings yet
4.logistic Regression
16 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Lec4 Logistic Regression
No ratings yet
Lec4 Logistic Regression
12 pages
7 Logistic-Regression
No ratings yet
7 Logistic-Regression
63 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
41 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Unit 3-ML
No ratings yet
Unit 3-ML
99 pages
Output 23
No ratings yet
Output 23
6 pages
05 LogisticRegression PDF
No ratings yet
05 LogisticRegression PDF
23 pages
23 LogisticRegression
No ratings yet
23 LogisticRegression
67 pages
COMP-377Week6 v1.1
No ratings yet
COMP-377Week6 v1.1
38 pages
Mathematical Foundations of Computational Linguistics: Manfred Klenner and Jannis Vamvas
No ratings yet
Mathematical Foundations of Computational Linguistics: Manfred Klenner and Jannis Vamvas
32 pages
Week 8
No ratings yet
Week 8
38 pages
Machine Learning for Mechanics
No ratings yet
Machine Learning for Mechanics
19 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
Lecture 05
No ratings yet
Lecture 05
5 pages
Machine Learning Unit 2 Que and Ans
No ratings yet
Machine Learning Unit 2 Que and Ans
16 pages
Lecture 07
No ratings yet
Lecture 07
26 pages
P-2 M.L. M-I U-I Logistic Regression
No ratings yet
P-2 M.L. M-I U-I Logistic Regression
50 pages
11logistic Regression in Machine Learning - GeeksforGeeks
No ratings yet
11logistic Regression in Machine Learning - GeeksforGeeks
4 pages
Notes 05
No ratings yet
Notes 05
51 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
10 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
3 pages
5 LogRegNN
No ratings yet
5 LogRegNN
74 pages
ML Lec 3
No ratings yet
ML Lec 3
4 pages
Lec18 Logistic Regression
No ratings yet
Lec18 Logistic Regression
17 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
M02Logistic Regression Logistic RegressioLogistic Regressionn
No ratings yet
M02Logistic Regression Logistic RegressioLogistic Regressionn
19 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
MULTIVARIATE ANALYSIS Part 1
No ratings yet
MULTIVARIATE ANALYSIS Part 1
30 pages
Exponential Curve Fitting Guide
No ratings yet
Exponential Curve Fitting Guide
12 pages
2-1 Power and Radical Functios
100% (1)
2-1 Power and Radical Functios
34 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Psychometric Methods Theory Into Practice 1st Edition Larry R. Price Instant Download
No ratings yet
Psychometric Methods Theory Into Practice 1st Edition Larry R. Price Instant Download
87 pages
Notes On Regression For ITM
No ratings yet
Notes On Regression For ITM
10 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
12 pages
Level 2 Los 2018
No ratings yet
Level 2 Los 2018
46 pages
Econometrics Jimma Assignment
No ratings yet
Econometrics Jimma Assignment
6 pages
Bansal e Ochoa (2011)
No ratings yet
Bansal e Ochoa (2011)
53 pages
Playa Dorada
No ratings yet
Playa Dorada
13 pages
Curriculum PolScience
No ratings yet
Curriculum PolScience
109 pages
Sheorey 1997
No ratings yet
Sheorey 1997
9 pages
Econometrie1 Split Merge
No ratings yet
Econometrie1 Split Merge
7 pages
RecentAdvancementsinCommMang andTourism-GCUEditedBook
No ratings yet
RecentAdvancementsinCommMang andTourism-GCUEditedBook
148 pages
INVONTORY MANAGEMENT Assignment
No ratings yet
INVONTORY MANAGEMENT Assignment
38 pages
Pengaruh Kualitas Produk Dan Harga Terhadap Keputusan Pembelian Mobil Daihatsu Grand Max Pick Up
No ratings yet
Pengaruh Kualitas Produk Dan Harga Terhadap Keputusan Pembelian Mobil Daihatsu Grand Max Pick Up
11 pages
3.1 Multivariate Analysis
No ratings yet
3.1 Multivariate Analysis
32 pages
Modern Regression 1: Ridge Regression: Ryan Tibshirani Data Mining: 36-462/36-662
No ratings yet
Modern Regression 1: Ridge Regression: Ryan Tibshirani Data Mining: 36-462/36-662
21 pages
Malware Detection Using Machine Leaning
No ratings yet
Malware Detection Using Machine Leaning
9 pages
Himamaylan National High School
0% (1)
Himamaylan National High School
34 pages
Chapter4 Practise
No ratings yet
Chapter4 Practise
80 pages
Artificial Intelligence and Machine Learning For Defect Detection in Castings
No ratings yet
Artificial Intelligence and Machine Learning For Defect Detection in Castings
12 pages
Let's Check: Predicted Discharge - 7792 + (4.226 X Year)
100% (1)
Let's Check: Predicted Discharge - 7792 + (4.226 X Year)
4 pages
Bereket Presentation
No ratings yet
Bereket Presentation
29 pages
Cross Validation Techniques
No ratings yet
Cross Validation Techniques
30 pages
The Effect of Transport Management On Organizational Performance Among Textile Manufacturing Firms in Kenya
No ratings yet
The Effect of Transport Management On Organizational Performance Among Textile Manufacturing Firms in Kenya
17 pages
Debodt 2002
No ratings yet
Debodt 2002
10 pages
Machine Learning in Econometrics
No ratings yet
Machine Learning in Econometrics
41 pages
Biostatistics Suggestion
No ratings yet
Biostatistics Suggestion
6 pages

06 LogisticRegression

Uploaded by

06 LogisticRegression

Uploaded by

LOGISTIC REGRESSION

Spring 2023 CS6431 Natural Language Processing

 Discriminative model: directly compute 𝑃(𝑐|𝑑) by giving

 Uses stochastic gradient descent for optimization

Sigmoid (logistic) function

 Or, simply normalize as (∈ [−1, +1])

 To make it a loss function

 Add a penalty for large weights

 L2 Regularization: Quadratic function of weight values

 Prefers a sparse weight matrix

 Prefers weight vectors with many

You might also like