0% found this document useful (0 votes)

488 views14 pages

2b Multiple Linear Regression

Multiple linear regression involves predicting the value of a dependent variable based on the values of two or more independent variables. It can be represented by an equation with coefficients estimated from sample data to minimize error. Model parameters, residuals, and significance can be tested using matrices, sums of squares, and F-tests in the ANOVA table. The coefficient of determination R2 indicates how well the regression line represents the data.

Uploaded by

Namita Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

488 views14 pages

2b Multiple Linear Regression

Uploaded by

Namita Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Multiple Linear Regression

Sasadhar Bera, IIM Ranchi

Multiple Linear Regression Model

Multiple linear regression involves one dependent
variable and more than one independent variable. The
equation that describes multiple linear regression model is
given below:
y = 0 + 1 x1 + 2 x2 + .

. + k xk +

y is dependent variable and x1, x2 , .

.
.,xk are
independent variables. These independent variables being
used to predict the dependent variable.

0 , 1 , 2 , . . ., k are total (k+1) unknown regression

coefficients (also called model parameters). These
regression coefficients are estimated based on observed
sample data.
The term (pronounced as epsilon) is random error.
Sasadhar Bera, IIM Ranchi

Data for Multiple Regression

Suppose that n number of observations are collected for
response variable (y) and k number of independent
variables present in the regression model.
i = 1, 2, . . ., n
y
y1
y2
.
yi
.
yn

x1
x11
x21
.
xi1
.
xn1

j = 1, 2, . . .,k
x2
x12
x22
.
xi2
.
xn2

.
.
.
.
.
.
.

xj
x1j
x2j
.
xij
.
xnj

.
.
.
.
.
.
.

xk
x1k
x2k
.
xik
.
xnk

Sasadhar Bera, IIM Ranchi

Scalar Notation: Multiple Linear Regression

Suppose that n number of observations are collected for
response variable (y) and k number of independent
variables present in the regression model.
The scalar notation of regression model:

yi = 0 + 1 xi1 + 2 xi2 + .
i = 1, 2, . . ., n
j = 1, 2, . . .,k

. + j xij + . . + k xik + i

n = total number of observations

k = number of independent variables

j s are model parameters.

Sasadhar Bera, IIM Ranchi

Matrix Notation: Multiple Linear Regression

Suppose that n number of observations are collected for
response variable (y) and k number of independent
variables present in the regression model.
yn1 = Xn(k+1) (k+1) 1 + n1
n = total number of observations, k = total number of
variables, is model parameters in vector notation.
y1
.

y yi

.
y n

1 x11 . x1 j

. . .
.
X 1 x i1 . x ij

. . .
.
1 x . x
n1
nj

.
.
.
.
.

x 1k

.
x ik

.
x nk

0

1
.

j
.

k

1
.

i

.
n

Sasadhar Bera, IIM Ranchi

Model Parameter Estimation

The error in regression model is the difference between
actual and predicted value. It may be positive or negative
value.
Error is also known as residual. Predicted value by
regression equation is called fitted value or fit.
The sum of squared difference between the actual and
predicted values known as sum of square of error. Least
square method minimizes the sum of square of error to
find out the best fitting plane.
It is to be noted that the regressor variables in linear
regression model are non-random. That means its values
are fixed.
Sasadhar Bera, IIM Ranchi

Model Parameter Estimation (Contd.)

In matrix notation, the regression equation:
y =X +

By using least square estimator, we want estimate

that minimizes L =

i 1

2
i

=
T

y X ( y X)
T

The least square estimator must satisfy:

T
T
( L) 2 X y 2 X X 0

( XT X)1 XT y , estimated model parameters.

The fitted regression line: y X

Sasadhar Bera, IIM Ranchi

Estimated Residual and Standard Error

For

ith

observation (Xi), predicted value or Fit :

y i Xi

Error in the fit called residual:

ei y i y i
n

2
e
i

Mean Square Error = MSE =

i 1

n k 1

where n is the total number of observations, k is number

of regressors.

Standard error (SE) of estimate = =

MSE

Variance( ) = (X T X) 1
Sasadhar Bera, IIM Ranchi

Testing Significance of Regression Model

The test for significance of regression is a test to
determine if there is a linear relationship between the
response variable and regressor variables.
H0 : 1 = 2 = . . . = k = 0
H1 : At least one j is not zero
The test procedure involves an analysis of variance
(ANOVA) partitioning of the total sum of square into a sum
of squares due to regression and a sum of square due to
error (or residual)
Total number of model parameters = p = Number of
regression coefficients = (k+1)
Sasadhar Bera, IIM Ranchi

Testing Significance of Regression Model (Contd.)

ANOVA table
Source of
Variation
Regression
Residual
error
Total

FCal

SSR

SSR /k =MSR

MSR/MSE

n k-1

SSE

SSE / (n-k-1)
= MSE

n 1

TSS

y
2
i
n

T
SSR yi y XT y i1

n
i 1
n

TSS = SSR + SSE

SSE yi yi y T y XT y

i 1
n

TSS yi y
i 1

Sasadhar Bera, IIM Ranchi

Significance Test of Individual Regression

Coefficient
Adding an unimportant variable to the model can actually
increase the mean square error, thereby decreasing the
usefulness of the model.
The hypothesis for testing the significance of any
individual regression coefficient, say j is
H0: j = 0

H1: j 0

Test Statistic = Tcal =

j
2 C jj

, ( n k 1)

where 2 is mean square error (MSE) and C is the diagonal

element of (XTX)-1 . Reject H0 if Tcal > t , ( n k 1)
2

Sasadhar Bera, IIM Ranchi

Confidence Interval of Mean Response

In matrix notation, the regression equation:
y =X +
where Normal (0, 2)

Mean response at a point x0 = [1, x01, x02, . .,x0j, . . .,x0k ]T

Mean response = y = E(y) = E(X ) + E() = X + 0

y|x = E(y | x0 ) = x0
0

var(y | x0 )

x T0 (XT X)1 x 0

(1-) % confidence interval of mean response at point x0

y|x

( n p )

x T0 (XT X)1 x 0
Sasadhar Bera, IIM Ranchi

Coefficient of Multiple Determination

Coefficient of multiple determination =

SSR
=
TSS

SSR TSS SSE

SSE

1
TSS
TSS
TSS
SSR : Sum of square due to regression
SSE : Sum of square due to error
TSS : Total sum of square

Coefficient of variation is the fraction of variation of the

dependent variable explained by regressor variables.
R2 is measure the goodness of linear fit. The better the
linear fit is, the R2 closer to 1.
14

Sasadhar Bera, IIM Ranchi

Coefficient of Multiple Determination (Contd.)

The major drawback of using coefficient of multiple
determination (R2) is that adding a predictor variable to the
model will always increase R2, regardless of whether the
additional variable is significant or not. To avoid such
situation, regression model builders prefer to use adjusted
R2 statistic.

SSE

2
adj

n 1
( n p)
(1 R 2 )
1
1
TSS
n p

(n 1)

In general, adjusted R2 statistic will not increase as variables

are added to the model.

When R2 and adjusted R2 differ dramatically there is a good

chance that non-significant terms have been included in the
15
model.
Sasadhar Bera, IIM Ranchi

06 Forecasting Methods
No ratings yet
06 Forecasting Methods
58 pages
Advanced Econometrics Problem Set
No ratings yet
Advanced Econometrics Problem Set
8 pages
Mansci Mastery Tests
No ratings yet
Mansci Mastery Tests
34 pages
Decision Sciences II Exam Analysis
No ratings yet
Decision Sciences II Exam Analysis
20 pages
Predicting Winnings For NASCAR Drivers
No ratings yet
Predicting Winnings For NASCAR Drivers
7 pages
Inventory Problems
No ratings yet
Inventory Problems
4 pages
03a Transportation Problem
100% (1)
03a Transportation Problem
19 pages
8103-Managerial Economics - Executive MBA - Question Paper
No ratings yet
8103-Managerial Economics - Executive MBA - Question Paper
4 pages
Self Check Exercises: Exercise 6.2
No ratings yet
Self Check Exercises: Exercise 6.2
11 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
5 pages
Chapter 5 Personnel Planning and Recruiting
No ratings yet
Chapter 5 Personnel Planning and Recruiting
50 pages
Summary 1
No ratings yet
Summary 1
5 pages
IT Doesn't Matter Slide Set
No ratings yet
IT Doesn't Matter Slide Set
15 pages
Why Do Organizations Fail?: Marketing Management 1 Assignment 2 Team: Sixth Sense
No ratings yet
Why Do Organizations Fail?: Marketing Management 1 Assignment 2 Team: Sixth Sense
14 pages
Case Study Gillette
No ratings yet
Case Study Gillette
6 pages
DS II Packet 2
No ratings yet
DS II Packet 2
31 pages
EPGP09 WEB OR Endterm QP
No ratings yet
EPGP09 WEB OR Endterm QP
4 pages
Question Papers
No ratings yet
Question Papers
28 pages
Term Paper Final Sumit Gupta EPGP 05 155
No ratings yet
Term Paper Final Sumit Gupta EPGP 05 155
13 pages
Chapter 6 Inputs and Production Functions
No ratings yet
Chapter 6 Inputs and Production Functions
3 pages
Statistics of Management
No ratings yet
Statistics of Management
7 pages
Nature and Scope of Economics
No ratings yet
Nature and Scope of Economics
26 pages
1 Simulation Case Tri State Corp
0% (1)
1 Simulation Case Tri State Corp
2 pages
Review On IT Doesn't Matter by IT Doesn't Matter" by Nicholas G
No ratings yet
Review On IT Doesn't Matter by IT Doesn't Matter" by Nicholas G
4 pages
Exim Policy
100% (1)
Exim Policy
11 pages
Paper Market Equilibrium Analysis
No ratings yet
Paper Market Equilibrium Analysis
3 pages
Chapter 7 - Autocorelation
No ratings yet
Chapter 7 - Autocorelation
36 pages
Managers and Leaders
No ratings yet
Managers and Leaders
16 pages
FSA - Ratio Analysis
No ratings yet
FSA - Ratio Analysis
13 pages
Trans LAN Project Risk Analysis
No ratings yet
Trans LAN Project Risk Analysis
2 pages
IIMK ePGP Course: Corporate Strategy
No ratings yet
IIMK ePGP Course: Corporate Strategy
5 pages
Statistics Estimation Guide
No ratings yet
Statistics Estimation Guide
14 pages
Managing The Virus Hunters
100% (1)
Managing The Virus Hunters
4 pages
CH 2 Linear Programming in Spreadsheets
No ratings yet
CH 2 Linear Programming in Spreadsheets
63 pages
GI Battles: Rasagola & Darjeeling Tea
No ratings yet
GI Battles: Rasagola & Darjeeling Tea
4 pages
Economic Survey of PAKISTAN 2019-20 PDF
100% (1)
Economic Survey of PAKISTAN 2019-20 PDF
516 pages
Dalton's Principle of Maximum Social Advantage Public Finance
No ratings yet
Dalton's Principle of Maximum Social Advantage Public Finance
5 pages
Quality Management in Systems Development
No ratings yet
Quality Management in Systems Development
35 pages
Two-Variable Regression Estimation
No ratings yet
Two-Variable Regression Estimation
35 pages
Chapter 3 MGT657
No ratings yet
Chapter 3 MGT657
9 pages
KTN FixedEffects
No ratings yet
KTN FixedEffects
10 pages
Pakistan State Oil Overview
100% (1)
Pakistan State Oil Overview
15 pages
Lecture 3 IIMK
No ratings yet
Lecture 3 IIMK
24 pages
OLS Assumptions for Statisticians
100% (1)
OLS Assumptions for Statisticians
7 pages
Upwork
No ratings yet
Upwork
333 pages
Mixed Strategy Game Theory Examples
No ratings yet
Mixed Strategy Game Theory Examples
3 pages
Manac Quiz-3 Key
No ratings yet
Manac Quiz-3 Key
5 pages
Instructions: Question Number Q1 Q2 Q3 Q4 Q5 Total 75
100% (1)
Instructions: Question Number Q1 Q2 Q3 Q4 Q5 Total 75
15 pages
Management by Objectives (MBO) : Peter Drucker
No ratings yet
Management by Objectives (MBO) : Peter Drucker
41 pages
Topic 8 Time Series and Forecasting
No ratings yet
Topic 8 Time Series and Forecasting
33 pages
MT416 - BCommII - Introduction To Business Analytics - MBA - 10039 - 19 - PratyayDas
No ratings yet
MT416 - BCommII - Introduction To Business Analytics - MBA - 10039 - 19 - PratyayDas
44 pages
Time Series Modeling: Shouvik Mani April 5, 2018
No ratings yet
Time Series Modeling: Shouvik Mani April 5, 2018
46 pages
Quantitative Methods MM ZG515 / QM ZG515: L11.1: Transportation Problem L11.2: Assignment Problem
No ratings yet
Quantitative Methods MM ZG515 / QM ZG515: L11.1: Transportation Problem L11.2: Assignment Problem
32 pages
Multiple Regression
No ratings yet
Multiple Regression
22 pages
MRM 02
No ratings yet
MRM 02
35 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Multiple Linear Regression Matrix
No ratings yet
Multiple Linear Regression Matrix
22 pages
Mult Regression
No ratings yet
Mult Regression
28 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
BS Classes V2
No ratings yet
BS Classes V2
70 pages
And Win !: How Hoteliers Can Interact With Tripadvisor Sites
No ratings yet
And Win !: How Hoteliers Can Interact With Tripadvisor Sites
41 pages
Travel and Tourism Industry: Group 8 Amarnath Lakra Manoj Kumar Sahu Mata Naga Sneha Sai Sunil Manohar
No ratings yet
Travel and Tourism Industry: Group 8 Amarnath Lakra Manoj Kumar Sahu Mata Naga Sneha Sai Sunil Manohar
2 pages
Norm Exercise1
No ratings yet
Norm Exercise1
1 page
Air-Asia "Friendsy": A Social Media Campaign Review
No ratings yet
Air-Asia "Friendsy": A Social Media Campaign Review
4 pages
Introduction To Social CRM For Travel
No ratings yet
Introduction To Social CRM For Travel
13 pages
Ultratech and Jaypee Associates Deal
No ratings yet
Ultratech and Jaypee Associates Deal
2 pages
CDK Digital Case Prep Questions
No ratings yet
CDK Digital Case Prep Questions
1 page
Role of Service in An Economy: Welcome To Service Operations Management
No ratings yet
Role of Service in An Economy: Welcome To Service Operations Management
26 pages
Regulatory: Gujarat Electricity Commission Gandhinagar
No ratings yet
Regulatory: Gujarat Electricity Commission Gandhinagar
4 pages
Physical Evidence and The Servicescape
No ratings yet
Physical Evidence and The Servicescape
7 pages
Sport Obermeyer Case
No ratings yet
Sport Obermeyer Case
4 pages
ICICI CRM Study in Hyderabad
No ratings yet
ICICI CRM Study in Hyderabad
9 pages
Inventory
No ratings yet
Inventory
38 pages
Answer 1: Cagr 5% Classical Exponential Smoothing ARRS Exponential Smoothing
No ratings yet
Answer 1: Cagr 5% Classical Exponential Smoothing ARRS Exponential Smoothing
5 pages
The Zoozoo Ad: The Pensive Link To Subtle Messages
No ratings yet
The Zoozoo Ad: The Pensive Link To Subtle Messages
1 page
Final Expenditure and Income
No ratings yet
Final Expenditure and Income
27 pages
The Magnet Nursing Services Recognition Program: A Comparison of Two Groups of Magnet Hospitals
No ratings yet
The Magnet Nursing Services Recognition Program: A Comparison of Two Groups of Magnet Hospitals
10 pages
Theories of Attitude Change
No ratings yet
Theories of Attitude Change
8 pages
United States International University - Africa: Master of Science Management & Organizational Development (Mod)
No ratings yet
United States International University - Africa: Master of Science Management & Organizational Development (Mod)
5 pages
Bioinformatics:: Guide To Bio-Computing and The Internet
No ratings yet
Bioinformatics:: Guide To Bio-Computing and The Internet
34 pages
Multiple Regression With Serial
No ratings yet
Multiple Regression With Serial
15 pages
I. Objectives: Makapuyat NHS Grade 11 Gleceryn R. Rondina Media and Information Literacy
No ratings yet
I. Objectives: Makapuyat NHS Grade 11 Gleceryn R. Rondina Media and Information Literacy
3 pages
Analysis of HR Policies and Engagement in TCS
No ratings yet
Analysis of HR Policies and Engagement in TCS
53 pages
Hilma Af Klint Notes and Methods Ed Chri
0% (1)
Hilma Af Klint Notes and Methods Ed Chri
2 pages
Un Estudio Tipológico, Ambiental y Sociocultural de Los Espacios Semiabiertos en El Mediterráneo Orientalvernáculo Arquitectura El Caso de Chipre
No ratings yet
Un Estudio Tipológico, Ambiental y Sociocultural de Los Espacios Semiabiertos en El Mediterráneo Orientalvernáculo Arquitectura El Caso de Chipre
19 pages
What Happens If You Fail Your Thesis Defense
100% (2)
What Happens If You Fail Your Thesis Defense
5 pages
Dissertation Titles Physical Education
100% (2)
Dissertation Titles Physical Education
6 pages
SDF High Potential Survey Report 2019-20-Compressed
No ratings yet
SDF High Potential Survey Report 2019-20-Compressed
14 pages
Mastering International Climate Negotiations
No ratings yet
Mastering International Climate Negotiations
5 pages
MNL52 Eb.1447916 1
100% (1)
MNL52 Eb.1447916 1
73 pages
Lecture 1 - Introduction2025 - Web
No ratings yet
Lecture 1 - Introduction2025 - Web
42 pages
Ida Jean Orlando: Deliberative Nursing Process Theory
No ratings yet
Ida Jean Orlando: Deliberative Nursing Process Theory
19 pages
Geotechnical Assessment of Dam Site
No ratings yet
Geotechnical Assessment of Dam Site
16 pages
NN.2024.03- UPDATED- Nguyễn Thị Hồng Mến
No ratings yet
NN.2024.03- UPDATED- Nguyễn Thị Hồng Mến
10 pages
Order Statistics Third Edition H. A. David PDF Download
No ratings yet
Order Statistics Third Edition H. A. David PDF Download
78 pages
Canadian Institute For Cybersecurity (CIC), University of New Brunswick (UNB)
No ratings yet
Canadian Institute For Cybersecurity (CIC), University of New Brunswick (UNB)
1 page
Engineering Experiment Analysis
No ratings yet
Engineering Experiment Analysis
31 pages
MBA Thesis - Muru
No ratings yet
MBA Thesis - Muru
79 pages
Chapter 3 Imrad
No ratings yet
Chapter 3 Imrad
7 pages
Project Managers' Guide to CPM
No ratings yet
Project Managers' Guide to CPM
6 pages
Introduction To Security Studies Full-Verzia
No ratings yet
Introduction To Security Studies Full-Verzia
334 pages
Bridge2HE Annotated RIA-IA
No ratings yet
Bridge2HE Annotated RIA-IA
54 pages
Crisis Management Essay
No ratings yet
Crisis Management Essay
7 pages
Jurnal Ilmiah Tesis PDF
No ratings yet
Jurnal Ilmiah Tesis PDF
15 pages
Honey 2025
No ratings yet
Honey 2025
1 page
Materia Magica The Archaeology of Magic in Roman Egypt Cyprus and Spain PDF
100% (4)
Materia Magica The Archaeology of Magic in Roman Egypt Cyprus and Spain PDF
377 pages

2b Multiple Linear Regression

Uploaded by

2b Multiple Linear Regression

Uploaded by

Multiple Linear Regression

Sasadhar Bera, IIM Ranchi

Multiple Linear Regression Model

y is dependent variable and x1, x2 , .

0 , 1 , 2 , . . ., k are total (k+1) unknown regression

Data for Multiple Regression

Sasadhar Bera, IIM Ranchi

Scalar Notation: Multiple Linear Regression

n = total number of observations

j s are model parameters.

Sasadhar Bera, IIM Ranchi

Matrix Notation: Multiple Linear Regression

Sasadhar Bera, IIM Ranchi

Model Parameter Estimation

Model Parameter Estimation (Contd.)

By using least square estimator, we want estimate

The least square estimator must satisfy:

( XT X)1 XT y , estimated model parameters.

The fitted regression line: y X

Estimated Residual and Standard Error

observation (Xi), predicted value or Fit :

Error in the fit called residual:

Mean Square Error = MSE =

where n is the total number of observations, k is number

Standard error (SE) of estimate = =

Testing Significance of Regression Model

Testing Significance of Regression Model (Contd.)

TSS = SSR + SSE

Sasadhar Bera, IIM Ranchi

Significance Test of Individual Regression

Test Statistic = Tcal =

where 2 is mean square error (MSE) and C is the diagonal

Sasadhar Bera, IIM Ranchi

Confidence Interval of Mean Response

Mean response at a point x0 = [1, x01, x02, . .,x0j, . . .,x0k ]T

(1-) % confidence interval of mean response at point x0

Coefficient of Multiple Determination

SSR TSS SSE

Coefficient of variation is the fraction of variation of the

Sasadhar Bera, IIM Ranchi

Coefficient of Multiple Determination (Contd.)

In general, adjusted R2 statistic will not increase as variables

When R2 and adjusted R2 differ dramatically there is a good

You might also like