0% found this document useful (0 votes)

80 views40 pages

Module 8 - Multiple Linear Regression

This document provides an overview of Multiple Linear Regression, including its model, estimation techniques, and hypothesis testing. It outlines learning objectives, explains the least squares estimation method, and presents examples related to wire bond strength in semiconductor manufacturing. The document also discusses the significance of regression and tests on individual regression coefficients.

Uploaded by

Reuben Kline Hisola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views40 pages

Module 8 - Multiple Linear Regression

Uploaded by

Reuben Kline Hisola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

ENGINEERING

DATA ANALYSIS
Prepared by: Engr. Marlo Dexie Dale G. Malaluan

Module 8
Multiple Linear Regression
8 Multiple Linear Regression
MODULE OUTLINE
12.1 Multiple Linear Regression Model 12.2 Hypothesis Tests in Multiple Linear
12.1.1 Introduction Regression
12.1.2 Least squares estimation of the 12.2.1 Test for significance of regression
parameters 12.2.2 Tests on individual regression
12.1.3 Matrix approach to multiple linear coefficients & subsets of coefficients
regression
12.1.4 Properties of the least squares
estimators

2
Learning Objectives for Module 8
After careful study of this module, you should be able to do the following:
1. Use multiple regression techniques to build empirical models to engineering and
scientific data
2. Understand how the method of least squares extends to fitting multiple regression
models.
3. Assess regression model adequacy
4. Test hypotheses and construct confidence intervals on the regression coefficients

3
Multiple Linear Regression Model
• Many applications of regression analysis involve situations in which there are more than
one regressor variable.
• A regression model that contains more than one regressor variable is called a multiple
regression model.

4
Multiple Linear Regression Model

• For example, suppose that the gasoline mileage performance of a vehicle depends on the
vehicle weight and the engine displacement. A multiple regression model that might
describe this relationship is
𝑌𝑌 = 𝛽𝛽0 + 𝛽𝛽1 𝑥𝑥1 + 𝛽𝛽2 𝑥𝑥2 + 𝜖𝜖

where 𝑌𝑌 = 𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚, 𝑥𝑥1 = 𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤, 𝑥𝑥2 = 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑

5
Least Squares Estimation of Parameters
The method of least squares may be used to estimate the regression coefficients in the multiple
regression model, 𝑌𝑌 = 𝛽𝛽0 + 𝛽𝛽1𝑥𝑥1 + 𝛽𝛽2 𝑥𝑥 2 + ⋯ + 𝛽𝛽𝐾𝐾 𝑥𝑥 𝐾𝐾 + 𝜖𝜖.

Suppose that 𝑛𝑛 > 𝑘𝑘 observations are available, and let 𝑥𝑥𝑖𝑖j denote the 𝑖𝑖𝑖𝑖𝑖 observation or level of variable
𝑥𝑥𝑗𝑗. The observations are 𝑥𝑥𝑥𝑥1, 𝑥𝑥𝑖𝑖𝑖 , … , 𝑥𝑥𝑖𝑖𝑖𝑖 , 𝑦𝑦𝑖𝑖 , 𝑖𝑖 = 1, 2, …, 𝑛𝑛 𝑎𝑎𝑎𝑎𝑎𝑎 𝑛𝑛 > 𝑘𝑘.

It is customary to present the data for multiple regression in a table such as Table 12.1

6
Least Squares Estimation of Parameters

7
Least Squares Estimation of Parameters

The solution to the normal equations are the least

squares estimators of the regression coefficients.

8
Example 12.1 | Wire Bond Strength
• In Chapter 1, we used data on pull strength of a wire bond in a semiconductor
manufacturing process, wire length, and die height to illustrate building an empirical
model.
• We will use the same data, repeated for convenience in Table 12.2, and show the
details of estimating the model parameters.
• A three-dimensional scatter plot of the data is presented in Fig. 1.15. Figure 12.4
shows a matrix of two-dimensional scatter plots of the data.
• These displays can be helpful in visualizing the relationships among variables in a
multivariable data set.
• For example, the plot indicates that there is a strong linear relationship between
strength and wire length.

9
Example 12.1 | Wire Bond Strength (ctd)

10
Example 12.1 | Wire Bond Strength (ctd)

11
Example 12.1 | Wire Bond Strength (ctd)

12
Example 12.1 | Wire Bond Strength (ctd)

Practical Interpretation: This

equation can be used to predict
pull strength for pairs of values of
the regressor variables wire
length (𝑥𝑥1) and die height
(𝑥𝑥2). This is essentially the same
regression model given in Section
1.3. Figure 1.16 shows a three-
dimensional plot of the plane of
predicted values generated from
this equation.

13
Matrix Approach to Multiple Linear Regression

14
Matrix Approach to Multiple Linear
Regression

The resulting least squares estimate is

15
Matrix Approach to Multiple Linear
Regression

16
Example 12.2 | Wire Bond Strength
1 2 50   9.95
1 8 110   24.45
   
1 11 120   31.75
   
1 10 550   35.00 
1 8 295  25.02 
   
1 4 200   16.86 
1 2 375  14.38
   
1 2 52   9.60 

The model matrix 𝐗𝐗 1


1
9
8
100 

300 
 24.35
 
 27.50 
and 𝐲𝐲 vector for this 
1 4

412 

 17.08


1 11 400  37.00 
model are 
X = 1 12
 
500  y =  41.95

1 2 360   11.66 
   
1 4 205  21.65
1 4 400   17.89 
   
1 20 600   69.00 
   
1 1 585  10.30 
1 10 540   34.93
   
1 15 250   46.59 
1 15 290   44.88
   
1 16 510   54.12 
1 17 590   56.63
   
1 6 100   22.13
1 400   
 5   21.15

17
Example 12.2 | Wire Bond Strength (ctd)

18
Example 12.2 | Wire Bond Strength (ctd)

19
Example 12.2 | Wire Bond Strength (ctd)

20
Example 12.2 | Wire Bond Strength (ctd)

21
Estimator of Variance

This is an unbiased estimator of 𝜎𝜎 2

22
Properties of the Least Squares Estimators

Unbiased estimators Individual variances and covariances

Covariance Matrix In general,

23
Test for Significance of Regression

24
Test for Significance of Regression

25
Example 12.3 | Wire Bond Strength ANOVA
We will test for significance of regression (with α = 0.05) using the wire
bond pull strength data from2
Example 12.1. The total sum of squares is
 n 
 ∑ yi 
  (725.82) 2
 i =1 
SST = y ′y − = 27,178.5316 −
n 25
= 6105.9447
The regression or model sum of squares is computed as follows:
2
 n 
 ∑ yi 
  (725.82) 2
ˆ  i =1 
SS R = β′ X′ y − = 27,063.3581 −
n 25
and by subtraction = 5990.7712

26
Example 12.3b | Wire Bond Strength ANOVA
• The analysis of variance is shown in Table 12.6. To test H0: β1 = β2 = 0, we
calculate the statistic
MS R 2995.3856
f0 = = = 572.17
MS E 5.2352
• Since f0 > f0.05,2,22= 3.44 (or since the P-value is considerably smaller than
α = 0.05), we reject the null hypothesis and conclude that pull strength is
linearly related to either wire length or die height, or both.

• Practical Interpretation: Rejection of H0 does not necessarily imply that the relationship found is an
appropriate model for predicting pull strength as a function of wire length and die height. Further
tests of model adequacy are required before we can be comfortable using this model in practice.

27
Example 12.3c | Wire Bond Strength ANOVA

28
Test for Significance of Regression

29
Tests on Individual Regression Coefficients
and Subsets of Coefficients

• Reject H0 if |t0| > tα/2,n-p.

• This is called a partial or marginal test

30
Example 12.4 | Wire Bond Strength
Coefficient Test
The wire bond pull strength data, and suppose that we want to test the
hypothesis that the regression coefficient for x2 (die height) is zero. The
hypotheses are
H0: β2 = 0
H1: β2 ≠ 0
The main diagonal element of the (X′X)−1 matrix corresponding to is C22
= 0.0000015, so the t-statistic in Equation 12.25 is
βˆ 2 0.01253
t0 = = = 4.477
σˆ 2C22 (5.2352)(0.0000015)

31
Example 12.4b | Wire Bond Strength
Coefficient Test
Note that we have used the estimate of σ2 reported to four decimal places in Table 12.6.
Since t0.025,22 = 2.074, we reject H0: β2 = 0 and conclude that the variable x2 (die height)
contributes significantly to the model. We could also have used a P-value to draw
conclusions. The P-value for t0 = 4.477 is P = 0.0002, so with α = 0.05 we would reject the
null hypothesis.

Practical Interpretation: Note that this test measures the marginal or partial contribution of
x2 given that x1 is in the model. That is, the t-test measures the contribution of adding the
variable x2 = die height to a model that already contains x1 = wire length. Table 12.4 shows
the computer-generated value of the t-test computed. The t-test statistic is reported to two
decimal places. Note that the computer produces a t-test for each regression coefficient in
the model. These t-tests indicate that both regressors contribute to the model.

32
Tests on Individual Regression Coefficients
and Subsets of Coefficients
The general regression significance test or the extra sum of
squares method:

The model may be written as

33
Tests on Individual Regression Coefficients
and Subsets of Coefficients
For the full model:
SS R (β) = βˆ ′ X′ y ( p = k + 1 degrees of freedom)
y ′ y − β ′ X′ y
MS E =
n− p

If H0 is true, the reduced model is

SS R (β 2 ) βˆ ′2 X′2 y
= ( p − r degrees of freedom )

34
Tests on Individual Regression Coefficients
and Subsets of Coefficients

Reject H0 if f0 > fα,r,n-p

The test in Equation (12.33) is often referred to as a partial F-test

35
Example 12.6 | Wire Bond Strength General
Regression Test
Consider the wire bond pull-strength data in Example 12.1. We investigate the
contribution of two new variables, x3 and x4, to the model using the partial F-test
approach. The new variables are explained at the end of this example. That is,
we wish to test
H0: β3 = β4 = 0 H1: β3 ≠ 0 or β4 ≠ 0

To test this hypothesis, we need the extra sum of squares due to β3 and β4 or

SSR(β4, β3| β2, β1, β0) = SSR (β4, β3, β2, β1, β0) − SSR(β2, β1, β0)
= SSR(β4, β3, β2, β1|β0) − SSR (β2, β1|β0)

36
Example 12.6b | Wire Bond Strength General
Regression Test
• In Example 12.3 we calculated
2
 n 
 ∑ yi 
 
′ ′
SS R (β2 , β1 |β0 ) = β X y −  i =1  = 5990.7712 ( two degrees of freedom)
n
• Also, Table 12.4 shows the Minitab output for the model with only x1 and x2 as predictors. In
the analysis of variance table, we can see that SSR = 5990.8 and this agrees with our
calculation. In practice, the computer output would be used to obtain this sum of squares.
• If we fit the model Y = β0 + β1x1 + β2x2 + β3x3 + β4x4, we can use the same matrix formula.
Alternatively, we can look at SSR from computer output for this model. The analysis of
variance table for this model is shown in Table 12.7 and we see that
SSR(β4, β3, β2, β1|β0) = 6024.0 (four degrees of freedom)
Therefore,
SSR(β4, β3|β2, β1, β0) = 6024.0 − 5990.8 = 33.2 (two degrees of freedom)

37
Example 12.6c | Wire Bond Strength General
Regression Test
• This is the increase in the regression sum of squares due to adding x3 and x4 to a model
already containing x1 and x2. To test H0, calculate the test statistic

• Note that MSE from the full model using x1, x2, x3 and x4 is used in the denominator of the
test statistic. Because f0.05, 2, 20 = 3.49, we reject H0 and conclude that at least one of the new
variables contributes significantly to the model. Further analysis and tests will be needed to
refine the model and determine if one or both of x3 and x4 are important.
• The mystery of the new variables can now be explained. These are quadratic powers of the
original predictors wire length and wire height. That is, 𝑥𝑥3 =𝑥𝑥12 and 𝑥𝑥4 =𝑥𝑥22 . A test for
quadratic terms is a common use of partial F-tests. With this information and the original
data for x1 and x2, you can use computer software to reproduce these calculations. Multiple
regression allows models to be extended in such a simple manner that the real meaning of
x3 and x4 did not even enter into the test procedure.

38
Important Terms and Conditions
• All possible regressions • Hat matrix • Prediction interval on a future
observation
• Analysis of variance test in multiple • Hidden extrapolation
regression • PRESS statistic
• Indicator variables
• Backward elimination • 𝑅𝑅 2
• Inference (tests and intervals) on
• 𝐶𝐶𝑃𝑃 statistic individual model parameters • Residual analysis and model
adequacy checking
• Categorical variables • Influential observations
• Significance of regression
• Confidence interval on the mean • Model parameters and their
response interpretation in multiple regression • Standardized residuals
• Confidence interval on the • Multicollinearity • Stepwise regression
regression coefficient
• Multiple regression model • Studentized residuals
• Cook’s distance measure
• Outliers • Variable selection
• Extra sum of squares method
• Partial or marginal test • Variance inflation factor (VIF)
• Forward selection
• Polynomial regression model
• Full model

39
END OF
PRESENTATION

Linear Regression for Statisticians
No ratings yet
Linear Regression for Statisticians
18 pages
Multiple Linear Regression: Chapter 12
No ratings yet
Multiple Linear Regression: Chapter 12
49 pages
Module 6
No ratings yet
Module 6
8 pages
Lesson 11 Multiple Linear Regression
No ratings yet
Lesson 11 Multiple Linear Regression
35 pages
Topic 12. Multiple Linear Regression
No ratings yet
Topic 12. Multiple Linear Regression
98 pages
Lec 30
No ratings yet
Lec 30
10 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Multiple Linear Regression Guide
No ratings yet
Multiple Linear Regression Guide
35 pages
Multiple Regression Analysis Guide
No ratings yet
Multiple Regression Analysis Guide
60 pages
Module 9 - The Analysis of Variance
No ratings yet
Module 9 - The Analysis of Variance
18 pages
MAS316/Math352 Regression Analysis: 1 Multiple Linear Regression Models
No ratings yet
MAS316/Math352 Regression Analysis: 1 Multiple Linear Regression Models
12 pages
Linear Regression & Python Guide
No ratings yet
Linear Regression & Python Guide
24 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Multiple Regression
No ratings yet
Multiple Regression
20 pages
GENG 300 Lecture 10 Curve Fitting 2
No ratings yet
GENG 300 Lecture 10 Curve Fitting 2
58 pages
Lesson 10 Simple Linear Regression and Correlation
No ratings yet
Lesson 10 Simple Linear Regression and Correlation
70 pages
Part 8 Linear Regression
No ratings yet
Part 8 Linear Regression
6 pages
Linear Regression Full Version
No ratings yet
Linear Regression Full Version
34 pages
CE304-Unit 5-Lect1-Jumah2018
No ratings yet
CE304-Unit 5-Lect1-Jumah2018
10 pages
Cost Function
No ratings yet
Cost Function
31 pages
Unit 5
No ratings yet
Unit 5
10 pages
Chapter 3 Multiple Linear Regression - Jan
No ratings yet
Chapter 3 Multiple Linear Regression - Jan
47 pages
Applied Business Forecasting and Planning: Multiple Regression Analysis
No ratings yet
Applied Business Forecasting and Planning: Multiple Regression Analysis
100 pages
Topic09. Multiple Regression
No ratings yet
Topic09. Multiple Regression
36 pages
Linier Regression
No ratings yet
Linier Regression
19 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
17 pages
Notes 516 Summer 09 Part 2
No ratings yet
Notes 516 Summer 09 Part 2
15 pages
Linear Regression for Researchers
No ratings yet
Linear Regression for Researchers
41 pages
Multiple Regression
100% (1)
Multiple Regression
100 pages
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
No ratings yet
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
23 pages
Regression
No ratings yet
Regression
24 pages
Linear Regression
No ratings yet
Linear Regression
23 pages
Lecture9 Regression
No ratings yet
Lecture9 Regression
24 pages
Multiple Regression
No ratings yet
Multiple Regression
100 pages
AMS 572 Presentation: CH 10 Simple Linear Regression
No ratings yet
AMS 572 Presentation: CH 10 Simple Linear Regression
54 pages
STAT630Slide Adv Data Analysis
0% (1)
STAT630Slide Adv Data Analysis
238 pages
RS T RS T: Chapter 2 Supplemental Text Material S2-1. Models For The Data and The T-Test
No ratings yet
RS T RS T: Chapter 2 Supplemental Text Material S2-1. Models For The Data and The T-Test
10 pages
Lecture Plan 12 - 16!1!1
No ratings yet
Lecture Plan 12 - 16!1!1
7 pages
Multivar 2 - Simple and Multiple Regression PDF
No ratings yet
Multivar 2 - Simple and Multiple Regression PDF
26 pages
Mult Regression
No ratings yet
Mult Regression
28 pages
Chapter 12: Regression
No ratings yet
Chapter 12: Regression
10 pages
Sma32
No ratings yet
Sma32
30 pages
Multiple Regression Analysis Guide
No ratings yet
Multiple Regression Analysis Guide
19 pages
Lecture 13
No ratings yet
Lecture 13
7 pages
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
100% (1)
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
25 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Regression Linear
No ratings yet
Regression Linear
24 pages
125.785 Module 2.2
No ratings yet
125.785 Module 2.2
95 pages
Lecture 3.1
No ratings yet
Lecture 3.1
21 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
L4&5 Multiple Regression 2010B
No ratings yet
L4&5 Multiple Regression 2010B
77 pages
Linear Regression and Tire Correlation
No ratings yet
Linear Regression and Tire Correlation
54 pages
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
No ratings yet
DS303: Introduction To Machine Learning: Manjesh K. Hanawal
17 pages
Lecture 12 - Adv. Correlation and Multiple Regression
No ratings yet
Lecture 12 - Adv. Correlation and Multiple Regression
32 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
Reed Pledge
No ratings yet
Reed Pledge
2 pages
BIOME
No ratings yet
BIOME
23 pages
Reactions of Aldehydes and Ketones
No ratings yet
Reactions of Aldehydes and Ketones
38 pages
Research Proposal Rubric 2023
No ratings yet
Research Proposal Rubric 2023
2 pages
Electric Field Derrick
No ratings yet
Electric Field Derrick
2 pages
Evaluation For Health Teaching
No ratings yet
Evaluation For Health Teaching
1 page
Continuum 2020
100% (1)
Continuum 2020
21 pages
Delete Core or Collection: Data-Driven Schema and Shared Configurations
No ratings yet
Delete Core or Collection: Data-Driven Schema and Shared Configurations
11 pages
Climate Change: The Ultimate Determinant of Health
No ratings yet
Climate Change: The Ultimate Determinant of Health
11 pages
Admission Notification For Nursery Doddakallasandra For The Year 2025-2026
No ratings yet
Admission Notification For Nursery Doddakallasandra For The Year 2025-2026
2 pages
Generation - Wikipedia
No ratings yet
Generation - Wikipedia
156 pages
Design of Modular Scania
No ratings yet
Design of Modular Scania
205 pages
Change Management Exam
No ratings yet
Change Management Exam
8 pages
EngPhy U5
No ratings yet
EngPhy U5
54 pages
Alaminos City Hall Waterproofing Guide
No ratings yet
Alaminos City Hall Waterproofing Guide
3 pages
Poly Studio x52 Datasheet en
No ratings yet
Poly Studio x52 Datasheet en
3 pages
ISO 31000 Risk Management Model
No ratings yet
ISO 31000 Risk Management Model
11 pages
High School Musical
No ratings yet
High School Musical
171 pages
New Admn List 2025-26
No ratings yet
New Admn List 2025-26
63 pages
Topic Outline: Introduction To Human Communication
No ratings yet
Topic Outline: Introduction To Human Communication
6 pages
UGC Care List
No ratings yet
UGC Care List
30 pages
Questions For Journalists PDF
No ratings yet
Questions For Journalists PDF
2 pages
Zimbabwe Electricity Supply Authority
100% (1)
Zimbabwe Electricity Supply Authority
11 pages
40 PDF
No ratings yet
40 PDF
2 pages
Technical Seminar Report
No ratings yet
Technical Seminar Report
15 pages
Instant Download Pharmacology Principles and Applications 3rd Edition Eugenia M. Fulcher PDF All Chapters
100% (11)
Instant Download Pharmacology Principles and Applications 3rd Edition Eugenia M. Fulcher PDF All Chapters
82 pages
Methods of Analysis of Food Components and Additives Second Edition Chemical Functional Properties of Food Components Semih Otles Fast Download
100% (10)
Methods of Analysis of Food Components and Additives Second Edition Chemical Functional Properties of Food Components Semih Otles Fast Download
103 pages
Calculus Applications in Engineering
No ratings yet
Calculus Applications in Engineering
2 pages
Grade 3 Science: Life Cycles
No ratings yet
Grade 3 Science: Life Cycles
5 pages
Natural Gas Demand Insights
No ratings yet
Natural Gas Demand Insights
11 pages
Cursive Rubric
100% (3)
Cursive Rubric
4 pages
Construction Products Catalog
No ratings yet
Construction Products Catalog
46 pages
AcceptU Info Sheet
No ratings yet
AcceptU Info Sheet
2 pages
Testbank For Phlebotomy 6th Edition Warekois Solution Manual
No ratings yet
Testbank For Phlebotomy 6th Edition Warekois Solution Manual
18 pages
Group 4 Hallticket
No ratings yet
Group 4 Hallticket
4 pages
AN Assignment ON Marketing Mix (4 PS) of Dabur India LTD
No ratings yet
AN Assignment ON Marketing Mix (4 PS) of Dabur India LTD
15 pages

Module 8 - Multiple Linear Regression

Uploaded by

Module 8 - Multiple Linear Regression

Uploaded by

ENGINEERING

where 𝑌𝑌 = 𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚𝑚, 𝑥𝑥1 = 𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤𝑤, 𝑥𝑥2 = 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑

The solution to the normal equations are the least

Practical Interpretation: This

The resulting least squares estimate is

The model matrix 𝐗𝐗 1

This is an unbiased estimator of 𝜎𝜎 2

Unbiased estimators Individual variances and covariances

Covariance Matrix In general,

• Reject H0 if |t0| > tα/2,n-p.

The model may be written as

If H0 is true, the reduced model is

Reject H0 if f0 > fα,r,n-p

You might also like