0% found this document useful (0 votes)

8 views21 pages

Module 07

Uploaded by

Aryansh Raj Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views21 pages

Module 07

Uploaded by

Aryansh Raj Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Predictive Analytics

Regression and Classification

Module 7

Sourish Das

Chennai Mathematical Institute

Linear Regression
mpg = β0 + β1 wt +

10 15 20 25 30
mpg

2 3 4 5

wt
Linear Regression

I mpg = β0 + β1 wt +
I We write the model in terms of linear models

y = Xβ +

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

 
1 wt1
1 wt2 
X = .
 
.. 
 .. . 
1 wtn

β = (β0 , β1 )T and = (1 , 2 , . . . , n )T

Linear Regression

I Normal Equations:

β̂ = (β̂0 β̂1 )T = (X T X )−1 X T y

Pn −1 Pn
n i=1 wti i=1 mpg i
= Pn P 2
Pn
i=1 wti i=1 wti i=1 wti .mpgi
Regression Plane
mpg=β0 +β1 wt+β2 disp+

35
30
25
mpg

disp
500
20

400
300
15

200
100
10

0
1 2 3 4 5 6

wt
Regression Plane

I mpg=β0 +β1 wt+β2 disp+

I We write the model in terms of linear models

y = Xβ +

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

 
1 wt1 disp1
1 wt2 disp2 
X = .
 
.. .. 
 .. . . 
1 wtn dispn

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

Linear Plane

I mpg=β0 +β1 wt+β2 disp+

I Normal Equations:

β̂ = (β̂0 β̂1 β̂2 )T

= (X T X )−1 X T y

I Ask yourself.
Linear Plane

I mpg=β0 +β1 wt+β2 disp+

I Normal Equations:

β̂ = (β̂0 β̂1 β̂2 )T

= (X T X )−1 X T y
 Pn Pn −1
n wt dispi
Pn P i=1 2i P i=1
= wt wt i=1 wti dispi
Pn i=1 i
 
P i=1 i P 2
i=1 disp i i=1 wt i dispi i=1 dispi
 Pn 
i=1 mpg i
× P ni=1 wti .mpgi 
P
n
i=1 dispi .mpgi
Quadratic Regression
mpg = β0 + β1 hp + β2 hp2 +

10 15 20 25 30
mpg

50 150 250

hp
Feature Engineering
mpg=β0 +β1 hp+β2 hp2 +

35
30
25

hp2
mpg

120000
100000
20

80000
60000
15

40000
20000
10

0
50 100 150 200 250 300 350

hp
Quadratic Regression

I mpg = β0 + β1 hp + β2 hp2 +
I We write the model in terms of linear models

y = Xβ +

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

1 hp1 hp21
 
1 hp2 hp2 
2
X = . ..  ,

. ..
. . . 
1 hpn hp2n

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

I The linear model is linear in parameter.
Quadratic Regression

I Normal Equations:

β̂ = (β̂0 β̂1 β̂2 )T

= (X T X )−1 X T y
Pn Pn −1  Pn
hp2i
 
n hp i=1 mpgi
P i=1 2i Pi=1
 ni=1 hpi .mpgi 
Pn n 3
P
=  i=1 hpi i=1 hpi i=1 hpi Pn
Pn 2
P n 3
Pn 4 2
i=1 hpi i=1 hpi i=1 hpi i=1 hpi .mpgi
Feature Engineering
mpg=β0 +β1 hp+β2 hp2 +

35
30
25
mpg

hp^2
500
20

400
300
15

200
100
10

0
1 2 3 4 5 6

hp
Feature Engineering/ Variable Transformation

I we put the original data into a higher dimension and

I hope that we will find a good fit for linear hyper-plane in a

higher dimension,

I which will explain the non-linear relationship between the

feature space and target variable.
Non-linear Regression Basis Functions

I Consider i th record

yi = f (x i ) + i , i = 1, 2, · · · , n

represents f (x) as
K
X
f (x) = βj φj (x) = φβ
j=1

we say φ is a basis system for f (x).

Representing Functions with Basis Functions

I mpg = β0 + β1 hp + β2 hp2 +

I Generic terms for curvature in linear regression

y = β1 + β2 x + β3 x 2 + · · · + i

implies
f (x) = β1 + β2 x + β3 x 2 + · · ·
I Sometimes in ML φ is known as ‘engineered features’
and the process is known as ‘feature engineering’
Fourier Basis
I sine cosine functions of incresing frequencies

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i

I constant ω = 2π/P defines the period P of oscillation of

the first sine/cosine pair. P is known.

I φ = {1, sin(ωx), cos(ωx), sin(2ωx), cos(2ωx)...}

I β T = {β1 , β2 , β3 , · · · }

y = φβ +
I Again in ML φ is known as ‘engineered features’

I mpg = β0 + β1 sin(ω hp ) +
Functional Estimation/Learning

I We are writing the function with its basis expansion

y = φβ +
I Lets assume basis (or engineered features) φ are fully
known.

I Problem is β is unknown - hence we estimate β.

Functional Estimation/Learning

I We are writing the function with its basis expansion

y = φβ +
I Lets assume basis (or engineered features) φ are fully
known.

I OLS Estimator:
β̂ = (φT φ)−1 φT y
Uncertainty associated with the OLS estimator

I How do we estimate the uncertainty (i.e., margin of error)

associate with OLS estimator β̂?

I If x0 is a test point, then

ŷ = φ(x0 )β̂

is the predicted value of true but unknown y0 .

I What is the margin of error of ŷ ?

Next ...

I We will discuss sampling distributions and inference of

regression coefficients!

MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Bayesian Linear Regression For Posterior Predictive Distribution MATLAB
No ratings yet
Bayesian Linear Regression For Posterior Predictive Distribution MATLAB
46 pages
Lecture 09 - 02.09.2024 - Regression-01
No ratings yet
Lecture 09 - 02.09.2024 - Regression-01
62 pages
ML Unit3
No ratings yet
ML Unit3
9 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
ML - Lec 4-Introduction To Regression
No ratings yet
ML - Lec 4-Introduction To Regression
65 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
Kondor Regression
No ratings yet
Kondor Regression
4 pages
PR M4 Notes
No ratings yet
PR M4 Notes
38 pages
Nonlinear Regression
No ratings yet
Nonlinear Regression
8 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
UNIT II Regration
No ratings yet
UNIT II Regration
62 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Regression Analysis
No ratings yet
Regression Analysis
11 pages
CPSC 4830 2025summer Lecture 3
No ratings yet
CPSC 4830 2025summer Lecture 3
33 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Polynomial & Categorical Regression
No ratings yet
Polynomial & Categorical Regression
10 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Chap 5
No ratings yet
Chap 5
13 pages
LSE, RLS, RBFN Examples for EE-241
No ratings yet
LSE, RLS, RBFN Examples for EE-241
16 pages
CPSC 540 Assignment 1 (Due January 19)
100% (1)
CPSC 540 Assignment 1 (Due January 19)
9 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
Hota ML Regression
No ratings yet
Hota ML Regression
57 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Chapter 14
No ratings yet
Chapter 14
18 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Machine Learning for Engineering Students
No ratings yet
Machine Learning for Engineering Students
49 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
56 pages
Unit 2 ML - Ver 2
No ratings yet
Unit 2 ML - Ver 2
129 pages
ML Unit
No ratings yet
ML Unit
23 pages
Polynomial Regression
No ratings yet
Polynomial Regression
15 pages
Linear Regression Basics for SOFE 4620
No ratings yet
Linear Regression Basics for SOFE 4620
30 pages
Lecture 3
No ratings yet
Lecture 3
90 pages
Intro To ML RevisionNotes
No ratings yet
Intro To ML RevisionNotes
24 pages
Linear Reg, Logistic Reg and SVM
No ratings yet
Linear Reg, Logistic Reg and SVM
40 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Matlabnoteschap 06
No ratings yet
Matlabnoteschap 06
34 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
Machine Learning (CSO851) - Lecture 02
No ratings yet
Machine Learning (CSO851) - Lecture 02
74 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
Supervised Learning Essentials
No ratings yet
Supervised Learning Essentials
30 pages
Unit III Da Notes
No ratings yet
Unit III Da Notes
43 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Unit 2
No ratings yet
Unit 2
80 pages
Unit II - Supervised Machine Learning Techniques
No ratings yet
Unit II - Supervised Machine Learning Techniques
131 pages
Experiment 7 ML Vtu
No ratings yet
Experiment 7 ML Vtu
5 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
No ratings yet
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
17 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
Beautiful Dawn Research Style
No ratings yet
Beautiful Dawn Research Style
2 pages
Cousin Moon Expanded Edition
No ratings yet
Cousin Moon Expanded Edition
2 pages
AML Course
No ratings yet
AML Course
27 pages
Economy Assignment 4 Questions
No ratings yet
Economy Assignment 4 Questions
7 pages
Applied ML Page 1
No ratings yet
Applied ML Page 1
1 page
AML Assignment Explanation
No ratings yet
AML Assignment Explanation
1 page
Premchand 30min Speech
No ratings yet
Premchand 30min Speech
4 pages
Gauranga Kantar Doc Q2 (B)
No ratings yet
Gauranga Kantar Doc Q2 (B)
1 page
Assignment3 Report MDS202312
No ratings yet
Assignment3 Report MDS202312
2 pages
AML2024 Assignment 2
No ratings yet
AML2024 Assignment 2
1 page
Channel Capacity Explained
No ratings yet
Channel Capacity Explained
51 pages
Write An Algorithm in The Form of Pseudocode and Draw A Flowchart To
No ratings yet
Write An Algorithm in The Form of Pseudocode and Draw A Flowchart To
1 page
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
29 pages
Spell Correction For Azerbaijani Language Using Deep Neural Networks
No ratings yet
Spell Correction For Azerbaijani Language Using Deep Neural Networks
5 pages
Thermodynamics 02 SRG 2
No ratings yet
Thermodynamics 02 SRG 2
14 pages
Soft Computing Manual
No ratings yet
Soft Computing Manual
14 pages
Private Aggregation of Teacher Ensembles (PATE)
No ratings yet
Private Aggregation of Teacher Ensembles (PATE)
14 pages
Heat Transfer Problems for Students
No ratings yet
Heat Transfer Problems for Students
2 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Offline Handwritten Text Recognition Using Hybrid CNNBLSTM Network
No ratings yet
Offline Handwritten Text Recognition Using Hybrid CNNBLSTM Network
6 pages
Parametric Leaky Tanh: A New Hybrid Activation Function For Deep Learning
No ratings yet
Parametric Leaky Tanh: A New Hybrid Activation Function For Deep Learning
12 pages
Digital Control and PID Control of Industrial Processes Assignment
No ratings yet
Digital Control and PID Control of Industrial Processes Assignment
18 pages
Conceptual Graphs in AI and Logic
No ratings yet
Conceptual Graphs in AI and Logic
3 pages
BRIR生成模型
No ratings yet
BRIR生成模型
5 pages
Document Processing
No ratings yet
Document Processing
4 pages
Artifical Neural Network
No ratings yet
Artifical Neural Network
7 pages
R22 - Iii - I - Cryptography and Network Security
No ratings yet
R22 - Iii - I - Cryptography and Network Security
185 pages
Parallel Sorting Algorithms
100% (1)
Parallel Sorting Algorithms
7 pages
Engineering Math & Probability Course
No ratings yet
Engineering Math & Probability Course
3 pages
Industrial Statistics A Computer Based Approach With Python Statistics For Industry Technology and Engineering Ron S. Kenett Download
No ratings yet
Industrial Statistics A Computer Based Approach With Python Statistics For Industry Technology and Engineering Ron S. Kenett Download
104 pages
Pair of Linear Equations in 2 Variables Question Paper
No ratings yet
Pair of Linear Equations in 2 Variables Question Paper
1 page
02 ML
100% (2)
02 ML
23 pages
Image Inpainting For Irregular Holes Using Partial Convolutions
No ratings yet
Image Inpainting For Irregular Holes Using Partial Convolutions
23 pages
Linear Programming Tutorial Solutions
No ratings yet
Linear Programming Tutorial Solutions
2 pages
Undamped - Vibrations
No ratings yet
Undamped - Vibrations
50 pages
Dynamic Bayesian Networks for Topic Sentiment Analysis
No ratings yet
Dynamic Bayesian Networks for Topic Sentiment Analysis
11 pages
Pertemuan 11 Computer Vision
No ratings yet
Pertemuan 11 Computer Vision
72 pages
Convolutional Layer Examples
No ratings yet
Convolutional Layer Examples
69 pages
SketchGAN CVPR2019
No ratings yet
SketchGAN CVPR2019
10 pages
Understanding PageRank and HITS
No ratings yet
Understanding PageRank and HITS
55 pages

Module 07

Uploaded by

Module 07

Uploaded by

Predictive Analytics

Regression and Classification

Chennai Mathematical Institute

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 )T and  = (1 , 2 , . . . , n )T

β̂ = (β̂0 β̂1 )T = (X T X )−1 X T y

I mpg=β0 +β1 wt+β2 disp+

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 , β2 )T and  = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

β̂ = (β̂0 β̂1 β̂2 )T

I mpg=β0 +β1 wt+β2 disp+

β̂ = (β̂0 β̂1 β̂2 )T

where y = (mpg1 , mpg2 , . . . , mpgn )T ;

β = (β0 , β1 , β2 )T and  = (1 , 2 , . . . , n )T

β̂ = (β̂0 β̂1 β̂2 )T

I we put the original data into a higher dimension and

I hope that we will find a good fit for linear hyper-plane in a

I which will explain the non-linear relationship between the

we say φ is a basis system for f (x).

I Generic terms for curvature in linear regression

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i

I constant ω = 2π/P defines the period P of oscillation of

I φ = {1, sin(ωx), cos(ωx), sin(2ωx), cos(2ωx)...}

I We are writing the function with its basis expansion

I Problem is β is unknown - hence we estimate β.

I We are writing the function with its basis expansion

I How do we estimate the uncertainty (i.e., margin of error)

I If x0 is a test point, then

is the predicted value of true but unknown y0 .

I What is the margin of error of ŷ ?

I We will discuss sampling distributions and inference of

You might also like

β = (β0 , β1 )T and = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

I mpg=β0 +β1 wt+β2 disp+

I mpg=β0 +β1 wt+β2 disp+

β = (β0 , β1 , β2 )T and = (1 , 2 , . . . , n )T

y = β1 +β2 sin(ωx)+β3 cos(ωx)+β4 sin(2ωx)+β5 cos(2ωx) · · ·+i