0% found this document useful (0 votes)

207 views57 pages

Multiple Regression Analysis 1

Multiple regression analysis allows researchers to predict the value of a dependent variable based on the values of two or more independent variables. The document discusses cautions about using linear regression, principles of data analysis including plotting data and choosing appropriate mathematical models, and how to conduct multiple regression analysis to examine the relationships between multiple quantitative variables. It provides details on how to interpret multiple regression results, including the significance of the overall regression model and the importance of individual predictors.

Uploaded by

Jacqueline Carbonel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

207 views57 pages

Multiple Regression Analysis 1

Uploaded by

Jacqueline Carbonel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 57

Multiple Regression

Analysis
Multiple Regression
Cautions About Linear Regression
 Correlation and regression describe only linear
relations.
 Correlation and least-squares regression line
are not resistant to outliers.
 Predictions outside the range of observed data
are often inaccurate.
 Relationship between two variables often
influenced by lurking variables not included in
our model.
General Principle of Data Analysis

Plot your data To understand the data, always start with a

series of graphs

Interpret what Look for all pattern and deviations on that

you see pattern

Numerical Choose an appropriate measure to

summary ? describe the pattern and deviation

Mathematical If the pattern is regular, summarize the data

model ? in a compact mathematical model
Analysis of Two Quantitative Variables

Plot your data For two quantitative variables, use a

scatterplot

Interpret what Describe the direction, form and strength of

you see the relationship

Numerical If pattern is roughly linear, summarize with

summary ? correlation, means and standard deviations

Mathematical Regression gives a compact model of

model ? overall pattern if relationship is roughly
linear
Analysis of Three or More Quantitative
Variables

Plot your data To examine relationships among all

possible pairs, use a scatterplot matrix

Interpret what Describe the direction, form and strength of

you see the relationships

Numerical If pattern is roughly linear, summarize with

summary ? correlations, means and standard
deviations

Mathematical Multiple Regression gives a compact model

model ? of relationship between response variable
and a set of predictors
Multiple Regression

• Can we predict job performance (Y) from overall school

achievement (X1) and IQ scores (X2)?

- How much variance in Y is explained by X1 and X2 in

combination?
- how important is each predictor of job performance?
• Two kinds of research questions in Multiple Regression:

- Is the model significant and important??

- Are the individual predictors significant and important?
The Structural Model

Y  c  b1 X 1  b2 X 2  ...  b p X p  e

Y - any dependent variable score is predicted according to :

c - an intercept on the Y axis, plus
b1 X 1 - a weighted effect of predictor X1
b2 X 2 - a weighted effect of predictor X2
bp X p - a weighted effect of predictor Xp
e - error
The Structural Model

Y  c  b1 X 1  b2 X 2  ...  b p X p  e

DATA = MODEL + RESIDUAL

The Regression Plane – Two Predictors
(3D space)
Unstandardized Partial Regression
Coefficients - b

• Y is calculated according to Least Square Criterion (LSC)

• solved for by finding a set of weights (b) minimising errors
of prediction (around the plane)
- b1 indicates change in Y given unit change in X1 when X2 …
Xp = 0
- when standardised, indicates SD change in Y given SD
change in X, and is denoted by 

• c is the Y intercept
• Y is therefore a weighted combination of the predictors
(and intercept) called a linear composite (LC)
Bivariate regression
Multiple Regression
Multiple Regression
Variance Explained – R2

R 2 is simply the r 2 representing the proportion of variance in Y

which is explained by Yˆ – the linear composite

r2 
SS regression

SSYˆ

 Yˆ  Y 
2

 Y  Y 
2
SS total SSY
a ratio reflecting the proportion of variance captured by our
model relative to the overall variance in our data
R 2 =.50 means 50% of the variance in Y is explained by the
combination of X1, X2… Xp
2
R vs r 2
Significance of the Model

• R 2 tells us how important the model is

• the model can also be tested for statistical
significance
• test is conducted on R the multiple correlation
coefficient, against df = p, N - p - 1

( N  p  1) R 2
MS regression
F 
p (1  R ) 2
MS residual
Importance of Individual Predictors

r – simple correlation coefficient

b – partial regression coefficient
 – standardized partial regression coefficient
pr – partial correlation coefficient
sr – semi-partial correlation coefficient
r – simple correlation coefficient

• indicates importance of predictor in terms of its

direct relationship with the criterion

• not very useful in Multiple Regression as it does

not take into account inter-correlations with other
predictors.
b – Partial Regression Coefficient

• indication of the importance of a predictor in

terms of the model (not the data).

• scale-bound so can’t compare magnitude.

• can however compare significance – each b is

tested by dividing it by its standard error to give
a t-value:
 – standardized partial regression coefficient

• indication of the importance of a predictor in

terms of the model (not the data).
• standardized (scale free) so you can compare
magnitude

• test of significance is same as for b

pr – Partial Correlation Coefficient
sr – semi-partial correlation coefficient
Unique, Shared and Total Variance
Assumptions of Multiple Regression

• Scale (predictor and criterion scores)

• measured using a continuous scale (interval
or ratio)
• normality (variables are normally distributed)
• linearity (there is a straight line relationship
between predictors and criterion)
• predictors are not multicollinear or singular
(extremely highly correlated)
Assumptions of Multiple Regression

• Residuals
• normality: array of Y values are normally
distributed around Yˆ (assumption of normality in
arrays)
• homoscedasticity: variance of Y values are
constant across full range Y values (assumption
of homogeneity of variance in arrays)
• linearity: straight-line relationship between Y
and residuals (with mean = 0 and slope = 0)
• independence (residuals uncorrelated)
Multicollinearity and Singularity

• occurs when predictors are highly correlated (>.90)

• causes unstable calculation of regression weights (b)
• diagnosed with inter-correlations, tolerance and VIF

Tolerance = (1  Rx2 )
2
• where Rx is the overlap between a particular
predictor and all the other predictors
• values below .10 considered problematic

Variance Inflation Factor (VIF) = 1/tolerance

- values above 4 considered problematic

• best solution is to remove or combine collinear predictors

Outliers – Extreme Cases

• distort solution and inflate standard error

• univariate outliers
• cases beyond 3 SD on any variable
• multivariate outliers
• described in terms of:
• leverage (h) – distance of case from group
centroid along line/plane of best fit
• discrepancy – extent to which case
deviates from line/plane of best fit
• influence – combined effect of leverage
and discrepancy: effect of the outlier on
the solution
Multivariate Outliers – high influence

high discrepancy
Multivariate Outliers – low influence

high discrepancy
Multivariate Outliers – Testing

Leverage
• Leverage statistic (h): varies from 0 to 1, values > .50 are
problematic
• Mahalanobis Distance h x (n-1), distributed as chi-square and
tested as such (df = p, <.001)

discrepancy – not directly tested

influence –
assesses change in solution when case is removed
Cook’s Distance, values > 1 are problematic
Working example

A marketing manager of a large supermarket chain wanted to

determine the effect of shelf space and price on the sales of pet food.
A random sample of 15 equal-sized shops was selected, and the
sales, shelf space in square metres and price per kilogram were
recorded

1. What contribution do both shelf space and price make to the

prediction of sales of pet food?
2 . Which is a better predictor of sales of pet food?
3. Do a residual analysis

The data file can be found in Work17.sav

Using SPSS
Graphs
Scatter/Dot
Matrix Scatter
Using SPSS
Graph
[DataSet1] C:\Users\demo\Desktop\Corr&RegressPresentation\SPSS DataFile\Work17.sav
Using SPSS
Multiple Linear Regression:

Starting the Procedure

• In the menu, click on
Analyze
• Point to
Regression

• Point to
Linear…

… and click.
Using SPSS
Multiple Linear Regression:

Selecting Variables

Choose the variables

for analysis from the
list in the variable box.

To select multiple
variables, hold down
the Ctrl Key and chose
the variables that you
want.
Using SPSS
Multiple Linear Regression:

Selecting Variables
Move shelf space
(space) & price per kg
(price), which are
already highlighted, to
the box labeled
Independent(s) then
click the arrow.

Move sales of pet food

(sales) to the box
labeled Dependent by
clicking the arrow.
Using SPSS
Multiple Linear Regression:

Requesting Statistics

Request
descriptive
statistics by
clicking the
button
labeled
Statistics…
Using SPSS
Multiple Linear Regression:

Requesting Statistics
Statistics for the Model
fit and Estimates for
Regression
Coefficients will be
produced by default.

Click the checkbox for

Descriptives. Also,
click the checkbox for
Durbin-Watson for
Residuals.
Click the Continue
button.
Using SPSS
Multiple Linear Regression:

Standardized Residual Plots

You can also request
several different plots.
Click the Plots… button.
In the box labeled
Standardized Residual
Plots, first click the
checkbox for
Histogram,
then click the box
for Normal
probability plot.
Click the Continue
button.
Using SPSS
Multiple Linear Regression:

Enter Method

The independent
variables can be
entered into the
analysis using
five different
methods.

Enter Method, a procedure for variable selection in which all variables in a

block are entered in a single step.
Using SPSS
Multiple Linear Regression:

Enter Method

Enter is the
default method
of variable entry.
Click the OK
button to run the
Multiple Linear
Regression
procedure.
Using SPSS
Multiple Linear Regression Output:

Descriptive Statistics
Regression

[DataSet1] C:\Users\demo\Desktop\Corr&RegressPresentation\SPSS DataFile\Work17.sav

Using SPSS
Multiple Linear Regression Output:

Correlations
Using SPSS
Multiple Linear Regression Enter Method Output:

Variables Entered
Using SPSS
Multiple Linear Regression Enter Method Output:

Model Summary
Correlation Standard Deviation
Coefficient of around the
Determination regression line

Durbin-Watson
Statistic
Using SPSS
Multiple Linear Regression Enter Method Output:

Model Summary
Independence
Durbin-Watson Statistic.
The D-W statistic is
defined as:

Another way to look at the Durbin-Watson Statistic is:

D = 2(1-ρ)

where ρ = the correlation between consecutive errors.

Using SPSS
Multiple Linear Regression Enter Method Output:

ANOVA
Measures of
Variation
Using SPSS
Multiple Linear Regression Enter Method Output:

Coefficients

Regression Equation:
ŷi = 10.50x1 + 0.057x2 + 2.029
Using SPSS
Multiple Linear Regression Enter Method Output:

Residuals Statistics
Using SPSS
Multiple Linear Regression Enter Method Output:

Residuals Histogram
Normality
Normality of residuals
is only required for
valid hypothesis
testing, that is, the
normality assumption
assures that the p-
values for the t-tests
and F-test will be
valid. Normality is not
required in order to
obtain unbiased
estimates of the
regression
coefficients
Using SPSS
Multiple Linear Regression Enter Method Output:

Plot of Standardized Residuals

Normality

A standardized
normal
probability (P-P)
plot is sensitive
to non-normality
in the middle
range of data
tails.
Using SPSS
Multiple Linear Regression Enter Method Output:

Interpretation of Output
1. What contribution do both shelf space and price make to the
prediction of sales of pet food?

Both independent variables (shelf space and price) together explain 85 per
cent of the variance (R Square) in sales of pet food, which is highly
significant as indicated by the F-value of 34.08
Using SPSS
Multiple Linear Regression Enter Method Output:

Interpretation of Output
2. Which of the two variable is a better predictor of sales of pet food?

An examination of the t-values and Beta values indicate that price contributes
better to the prediction of sales. Therefore, you can say that price
significantly predicts sales of pet food with t = 3.22, P < .05. However, the
shelf space allocated is not a significant predictor.

Linear Regression and Correlation Guide
100% (1)
Linear Regression and Correlation Guide
33 pages
Regression (Basic Concepts)
No ratings yet
Regression (Basic Concepts)
39 pages
Unit 3 Z-Scores, Measuring Performance: Learning Outcome
No ratings yet
Unit 3 Z-Scores, Measuring Performance: Learning Outcome
10 pages
Lecture - 8 MLR
No ratings yet
Lecture - 8 MLR
63 pages
Regression Techniques
No ratings yet
Regression Techniques
14 pages
Encrypted Document Analysis
No ratings yet
Encrypted Document Analysis
27 pages
Correlation Analysis
100% (1)
Correlation Analysis
51 pages
Correlation
100% (1)
Correlation
29 pages
Simple Regression and Correlation Analysis
100% (2)
Simple Regression and Correlation Analysis
27 pages
3.exponential Family & Point Estimation - 552
0% (1)
3.exponential Family & Point Estimation - 552
33 pages
Lecture 6 - Measures of Variability
No ratings yet
Lecture 6 - Measures of Variability
3 pages
Regression Analysis
No ratings yet
Regression Analysis
21 pages
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
No ratings yet
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
211 pages
A Learning Guide To R PDF
0% (1)
A Learning Guide To R PDF
255 pages
Multiple Regression Essentials
100% (4)
Multiple Regression Essentials
75 pages
Multiple Regression
No ratings yet
Multiple Regression
20 pages
Linear Regression Using R
No ratings yet
Linear Regression Using R
24 pages
Business Analytics Regression Guide
No ratings yet
Business Analytics Regression Guide
91 pages
Linear Regression Example Data
100% (1)
Linear Regression Example Data
43 pages
Correlation and Regression
100% (2)
Correlation and Regression
30 pages
Linear Regression Analysis
100% (3)
Linear Regression Analysis
53 pages
BAC331 (Old Code BCM331) - Financial Risk Appraisal Module
No ratings yet
BAC331 (Old Code BCM331) - Financial Risk Appraisal Module
123 pages
Regression
No ratings yet
Regression
48 pages
Sampling and Estimation
No ratings yet
Sampling and Estimation
15 pages
Regression Analysis: Mathematical Methods of Cognitive Science
100% (1)
Regression Analysis: Mathematical Methods of Cognitive Science
12 pages
Applied Multivariate Statistical Analysis Solution Manual PDF
No ratings yet
Applied Multivariate Statistical Analysis Solution Manual PDF
18 pages
Correlation and Regression Tutorial
No ratings yet
Correlation and Regression Tutorial
4 pages
Multiple Linear Regression
100% (1)
Multiple Linear Regression
25 pages
Lecture - Correlation and Regression GEG 222
100% (1)
Lecture - Correlation and Regression GEG 222
67 pages
Multiple Regression for Students
100% (2)
Multiple Regression for Students
105 pages
7-Multiple Regression
No ratings yet
7-Multiple Regression
17 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Multiple Linear Regression Slides
No ratings yet
Multiple Linear Regression Slides
17 pages
Unit 4-1
No ratings yet
Unit 4-1
29 pages
Chapter 3 Econometrics
No ratings yet
Chapter 3 Econometrics
34 pages
ADM2304 Multiple Regression Dr. Suren Phansalker
No ratings yet
ADM2304 Multiple Regression Dr. Suren Phansalker
12 pages
Lecture3 4
No ratings yet
Lecture3 4
48 pages
Chapter 8 Linear Regression
No ratings yet
Chapter 8 Linear Regression
34 pages
ML Unit3 MultipleLinearRegression
No ratings yet
ML Unit3 MultipleLinearRegression
70 pages
Multiple-Regression - Batool & Raya
No ratings yet
Multiple-Regression - Batool & Raya
24 pages
Multiple Regression & Model Building
No ratings yet
Multiple Regression & Model Building
20 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Multiple Regr 0
No ratings yet
Multiple Regr 0
44 pages
Multiple Regression Analysis Using SPSS Statistics
No ratings yet
Multiple Regression Analysis Using SPSS Statistics
5 pages
Linear Regression
100% (2)
Linear Regression
28 pages
Engineering Regression Analysis
100% (1)
Engineering Regression Analysis
21 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
6 pages
Day.11 What Is Multiple Linear Regression
No ratings yet
Day.11 What Is Multiple Linear Regression
10 pages
Linear Regression for Researchers
No ratings yet
Linear Regression for Researchers
41 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
23 pages
Correlation, Simple Linear Regression and Multiple Linear Regression Practice
No ratings yet
Correlation, Simple Linear Regression and Multiple Linear Regression Practice
50 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
39 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Psy524 Lecture 5 MR - Updated
No ratings yet
Psy524 Lecture 5 MR - Updated
29 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Session 1.3 Notes
No ratings yet
Session 1.3 Notes
39 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
SOLID WASTE MANAGEMENT OF SHS OF ANHS Autosaved
100% (5)
SOLID WASTE MANAGEMENT OF SHS OF ANHS Autosaved
25 pages
Filipino Values: Ethics & Ambivalence
0% (1)
Filipino Values: Ethics & Ambivalence
11 pages
Impact Assessment: The Problem of Impact
No ratings yet
Impact Assessment: The Problem of Impact
16 pages
EXERCISES FOR PARAMETRIC TESTS (Activity 1-)
No ratings yet
EXERCISES FOR PARAMETRIC TESTS (Activity 1-)
10 pages
PHASE 3.MS PowerPoint Features - Book
No ratings yet
PHASE 3.MS PowerPoint Features - Book
16 pages
Agnes Grandia - The Profession - Case Study 2
No ratings yet
Agnes Grandia - The Profession - Case Study 2
1 page
MS PPT. Session Guide Mark Lyndon B. Baguio
No ratings yet
MS PPT. Session Guide Mark Lyndon B. Baguio
15 pages
Knowledge Attitude and Practices On Disaster
No ratings yet
Knowledge Attitude and Practices On Disaster
39 pages
Group I - Ethics and Culture
No ratings yet
Group I - Ethics and Culture
20 pages
ACTIVITY Organizational Profile
No ratings yet
ACTIVITY Organizational Profile
2 pages
Public Policy & Program Mgmt Syllabus
100% (4)
Public Policy & Program Mgmt Syllabus
2 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
34 pages
Statistics and Parametric Tests
No ratings yet
Statistics and Parametric Tests
74 pages
Supplemental Notes in Natural Law and Conscience Part 2
No ratings yet
Supplemental Notes in Natural Law and Conscience Part 2
3 pages
16-The-Costco-Model CASE
No ratings yet
16-The-Costco-Model CASE
3 pages
Public Service Ethics & Accountability
100% (4)
Public Service Ethics & Accountability
50 pages
Research PDF
No ratings yet
Research PDF
23 pages
SGT 19th June Shift 1 Key
No ratings yet
SGT 19th June Shift 1 Key
133 pages
2014 Ifa Phosphate Method
No ratings yet
2014 Ifa Phosphate Method
21 pages
DOE Course Plan
No ratings yet
DOE Course Plan
5 pages
Final Report - Draft - Feasibility RPL
100% (1)
Final Report - Draft - Feasibility RPL
76 pages
HRM Course for MBA Students
No ratings yet
HRM Course for MBA Students
4 pages
AUGI - Data Extraction in AutoCAD
No ratings yet
AUGI - Data Extraction in AutoCAD
12 pages
RMnotes 1
No ratings yet
RMnotes 1
7 pages
Environment Forecasting
No ratings yet
Environment Forecasting
6 pages
AMERICAN College of Tecnolg Course Business Research Method Factors That Influence Business Income Taxpayers Compliance in Ethiopia
No ratings yet
AMERICAN College of Tecnolg Course Business Research Method Factors That Influence Business Income Taxpayers Compliance in Ethiopia
29 pages
Shuvo Bhai Book
No ratings yet
Shuvo Bhai Book
61 pages
Quarter 4 English 10 Weeks 5-6
No ratings yet
Quarter 4 English 10 Weeks 5-6
4 pages
Test Construction and Validation
100% (2)
Test Construction and Validation
88 pages
Unbalanced Bid Analysis WSDOT
No ratings yet
Unbalanced Bid Analysis WSDOT
2 pages
The Metaphor Police: A Case Study of The Role of Metaphor in Explanation
No ratings yet
The Metaphor Police: A Case Study of The Role of Metaphor in Explanation
12 pages
Unit 1 - Intro To EDA
No ratings yet
Unit 1 - Intro To EDA
40 pages
Inquiry & Brainstorming Test
No ratings yet
Inquiry & Brainstorming Test
1 page
DCG5243 Engineering Surveying 3 PDF
0% (1)
DCG5243 Engineering Surveying 3 PDF
13 pages
Marketing Resource Redeployment
No ratings yet
Marketing Resource Redeployment
16 pages
Lec 2 Critical or Analytical Reading
No ratings yet
Lec 2 Critical or Analytical Reading
22 pages
BIOE Week 14
No ratings yet
BIOE Week 14
4 pages
SMS Final Summary
No ratings yet
SMS Final Summary
13 pages
Cinbrief 2001
100% (1)
Cinbrief 2001
111 pages
FactSheet General LGBT
No ratings yet
FactSheet General LGBT
8 pages
Practice Project U3
No ratings yet
Practice Project U3
3 pages
Survey Lab - 01 (Pacing)
No ratings yet
Survey Lab - 01 (Pacing)
2 pages
CH 17 Correlation Vs Regression
No ratings yet
CH 17 Correlation Vs Regression
17 pages
Mind On Statistics Jessica M Utts Latest PDF 2025
100% (14)
Mind On Statistics Jessica M Utts Latest PDF 2025
111 pages
Change Control Request Form Change Request Number:: General Information
50% (2)
Change Control Request Form Change Request Number:: General Information
2 pages
Analysis of A Comprehensive Wellness Program's Impact On Job Satisfaction in The Workplace
No ratings yet
Analysis of A Comprehensive Wellness Program's Impact On Job Satisfaction in The Workplace
21 pages

Multiple Regression Analysis 1

Uploaded by

Multiple Regression Analysis 1

Uploaded by

Multiple Regression

Plot your data To understand the data, always start with a

Interpret what Look for all pattern and deviations on that

Numerical Choose an appropriate measure to

Mathematical If the pattern is regular, summarize the data

Plot your data For two quantitative variables, use a

Interpret what Describe the direction, form and strength of

Numerical If pattern is roughly linear, summarize with

Mathematical Regression gives a compact model of

Plot your data To examine relationships among all

Interpret what Describe the direction, form and strength of

Numerical If pattern is roughly linear, summarize with

Mathematical Multiple Regression gives a compact model

• Can we predict job performance (Y) from overall school

- How much variance in Y is explained by X1 and X2 in

- Is the model significant and important??

Y - any dependent variable score is predicted according to :

DATA = MODEL + RESIDUAL

• Y is calculated according to Least Square Criterion (LSC)

R 2 is simply the r 2 representing the proportion of variance in Y

• R 2 tells us how important the model is

r – simple correlation coefficient

• indicates importance of predictor in terms of its

• not very useful in Multiple Regression as it does

• indication of the importance of a predictor in

• scale-bound so can’t compare magnitude.

• can however compare significance – each b is

• indication of the importance of a predictor in

• test of significance is same as for b

• Scale (predictor and criterion scores)

• occurs when predictors are highly correlated (>.90)

Variance Inflation Factor (VIF) = 1/tolerance

• best solution is to remove or combine collinear predictors

• distort solution and inflate standard error

discrepancy – not directly tested

A marketing manager of a large supermarket chain wanted to

1. What contribution do both shelf space and price make to the

The data file can be found in Work17.sav

Starting the Procedure

Choose the variables

Move sales of pet food

Click the checkbox for

Standardized Residual Plots

Enter Method, a procedure for variable selection in which all variables in a

[DataSet1] C:\Users\demo\Desktop\Corr&RegressPresentation\SPSS DataFile\Work17.sav

Another way to look at the Durbin-Watson Statistic is:

where ρ = the correlation between consecutive errors.

Plot of Standardized Residuals

You might also like