0% found this document useful (0 votes)

9 views36 pages

Simple Linear Regression

The document discusses simple linear regression analysis, a statistical method used to estimate the relationship between a dependent variable and an independent variable. It covers the construction of regression models, the interpretation of regression coefficients, and the use of regression for description, control, and prediction. Additionally, it addresses the assumptions necessary for regression inference and the analysis of residuals to validate the model's appropriateness.

Uploaded by

Michael Arieh Medina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views36 pages

Simple Linear Regression

Uploaded by

Michael Arieh Medina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 36

Applied Business Forecasting

and Planning

Simple Linear Regression

Simple Regression
 Simple regression analysis is a statistical tool That
gives us the ability to estimate the mathematical
relationship between a dependent variable (usually
called y) and an independent variable (usually
called x).
 The dependent variable is the variable for which
we want to make a prediction.
 While various non-linear forms may be used,
simple linear regression models are the most
common.
Introduction
• The primary goal of quantitative lot size Man-hours
analysis is to use current
information about a phenomenon
30 73
to predict its future behavior. 20 50
• Current information is usually in 60 128
the form of a set of data. 80 170
• In a simple case, when the data 40 87
form a set of pairs of numbers,
we may interpret them as
50 108
representing the observed values 60 135
of an independent (or predictor ) 30 69
variable X and a dependent ( or
response) variable Y. 70 148
60 132
Introduction
 The goal of the analyst Statistical relation between Lot size and Man-Hour

who studies the data is to

180

160

find a functional relation 140

y  f (x) 120

between the response

100

Man-Hour
80

variable y and the 60

predictor variable x. 40

0
0 10 20 30 40 50 60 70 80 90
Lot size
Regression Function
 The statement that the relation
between X and Y is statistical
should be interpreted as providing
the following guidelines:
1. Regard Y as a random variable.
2. For each X, take f (x) to be the
expected value (i.e., mean value) of
y.
3. Given that E (Y) denotes the
expected value of Y, call the
equation
E (Y )  f ( x)
the regression function.
Pictorial Presentation of Linear Regression
Model
Historical Origin of Regression
 Regression Analysis was
first developed by Sir
Francis Galton, who
studied the relation
between heights of sons
and fathers.
 Heights of sons of both
tall and short fathers
appeared to “revert” or
“regress” to the mean of
the group.
Construction of Regression Models
 Selection of independent variables
• Since reality must be reduced to manageable proportions whenever we
construct models, only a limited number of independent or predictor
variables can or should be included in a regression model. Therefore a
central problem is that of choosing the most important predictor variables.
 Functional form of regression relation
• Sometimes, relevant theory may indicate the appropriate functional form.
More frequently, however, the functional form is not known in advance
and must be decided once the data have been collected and analyzed.
 Scope of model
 In formulating a regression model, we usually need to restrict the
coverage of model to some interval or region of values of the independent
variables.
Uses of Regression Analysis
 Regression analysis serves Three major
purposes.
1. Description

2. Control

3. Prediction

 The several purposes of regression analysis

frequently overlap in practice
Formal Statement of the Model
 General regression model
Y  0  1 X  
1. 0, and 1 are parameters

2. X is a known constant
3. Deviations  are independent N(o, 2)
Meaning of Regression Coefficients
 The values of the regression parameters 0,
and 1 are not known.We estimate them
from data.
 1 indicates the change in the mean
response per unit increase in X.
Regression Line
 If the scatter plot of our sample data
suggests a linear relationship between two
variables i.e.
y  0  1 x

we can summarize the relationship by

drawing a straight line on the plot.
 Least squares method give us the “best”
estimated line for our set of sample data.
Regression Line
 We will write an estimated regression line
based on sample data as
yˆ b0  b1 x

 The method of least squares chooses the

values for b0, and b1 to minimize the sum of
squared errors
n n 2

SSE  ( yi  yˆ i ) 2  y  b0  b1 x 
i 1 i 1
Regression Line
 Using calculus, we obtain estimating
formulas: n n n n

 (x i  x )( yi  y ) n  xi yi  x y i i
b1  i 1
n
 i 1
n
i 1
n
i 1

 (x i  x) 2
n xi2  ( xi ) 2
or i 1 i 1 i 1

Sy
b1 r
Sx

b0  y  b1 x
Estimation of Mean Response
 Fitted regression line can be used to estimate the
mean value of y for a given value of x.
 Example
 The weekly advertising expenditure (x) and weekly
sales (y) are presented in the following table.
y x
1250 41
1380 54
1425 63
1425 54
1450 48
1300 46
1400 62
1510 61
1575 64
1650 71
Point Estimation of Mean Response
 From previous table we have:
n 10  x 564  x 326042

 y 14365  xy 818755
 The least squares estimates of the regression
coefficients are:
n xy  x y 10(818755)  (564)(14365) 10.8
b1 
n  x  ( x )
2 2
10(32604)  (564)2

b0 1436.5  10.8(56.4) 828

Point Estimation of Mean Response
 The estimated regression function is:
ŷ 828  10.8x
Sales 828  10.8 Expenditure

 This means that if the weekly advertising

expenditure is increased by $1 we would expect
the weekly sales to increase by $10.8.
Point Estimation of Mean Response
 Fitted values for the sample data are
obtained by substituting the x value into the
estimated regression function.
 For example if the advertising expenditure
is $50, then the estimated Sales is:
Sales 828  10.8(50) 1368
 This is called the point estimate (forecast)
of the mean response (sales).
Example:Retail sales and floor space
 It is customary in retail operations to asses the
performance of stores partly in terms of their
annual sales relative to their floor area (square
feet). We might expect sales to increase linearly as
stores get larger, with of course individual
variation among stores of the same size. The
regression model for a population of stores says
that
SALES = 0 + 1 AREA + 
Example:Retail sales and floor space
 The slope 1 is as usual a rate of change: it is the
expected increase in annual sales associated with
each additional square foot of floor space.
 The intercept 0 is needed to describe the line but
has no statistical importance because no stores
have area close to zero.
 Floor space does not completely determine sales.
The term  in the model accounts for difference
among individual stores with the same floor space.
A store’s location, for example, is important.
Residual
 The difference between the observed value
yi and the corresponding fitted value ŷi .
ˆi
ei  yi  y

 Residuals are highly useful for studying

whether a given regression model is
appropriate for the data at hand.
Example: weekly advertising expenditure
y x y-hat Residual (e)
1250 41 1270.8 -20.8
1380 54 1411.2 -31.2
1425 63 1508.4 -83.4
1425 54 1411.2 13.8
1450 48 1346.4 103.6
1300 46 1324.8 -24.8
1400 62 1497.6 -97.6
1510 61 1486.8 23.2
1575 64 1519.2 55.8
1650 71 1594.8 55.2
Estimation of the variance of the error
terms, 2
 The variance 2 of the error terms i in the
regression model needs to be estimated for
a variety of purposes.
 It gives an indication of the variability of the
probability distributions of y.
 It is needed for making inference concerning
regression function and the prediction of y.
Regression Standard Error
 To estimate  we work with the variance and take the
square root to obtain the standard deviation.
 For simple linear regression the estimate of 2 is the
average squared residual.

1 1
 i n 2 i i
2 2 2
s y. x  e  ( y  ˆ
y )
n 2
 To estimate  , use
2
 s 
s estimates the standard deviation
y. x s y . x  of the error term  in
the statistical model for simple linear regression.
Regression Standard Error
y x y-hat Residual (e) square(e)
1250 41 1270.8 -20.8 432.64
1380 54 1411.2 -31.2 973.44
1425 63 1508.4 -83.4 6955.56
1425 54 1411.2 13.8 190.44
1450 48 1346.4 103.6 10732.96
1300 46 1324.8 -24.8 615.04
1400 62 1497.6 -97.6 9525.76
1510 61 1486.8 23.2 538.24
1575 64 1519.2 55.8 3113.64
1650 71 1594.8 55.2 3047.04

y-hat = 828+10.8X total 36124.76

Sy.x 67.19818
Basic Assumptions of a Regression Model
 A regression model is based on the following
assumptions:
1. There is a probability distribution of Y for each
level of X.
2. Given that µy is the mean value of Y, the
standard form of the model is
 y  f (x)  
where  is a random variable with a normal
distribution with mean 0 and standard deviation .
Conditions for Regression Inference
 You can fit a least-squares line to any set of
explanatory-response data when both variables are
quantitative.
 If the scatter plot doesn’t show an approximately
linear pattern, the fitted line may be almost
useless.
Conditions for Regression Inference
 The simple linear regression model, which
is the basis for inference, imposes several
conditions.
 We should verify these conditions before
proceeding with inference.
 The conditions concern the population, but
we can observe only our sample.
Conditions for Regression Inference
 In doing Inference, we assume:
1. The sample is an SRS from the population.
2. There is a linear relationship in the population.
1. We can not observe the population , so we check the
scatter plot of the sample data.
3. The standard deviation of the responses about the
population line is the same for all values of the
explanatory variable.
1. The spread of observations above and below the least-
squares line should be roughly uniform as x varies.
Conditions for Regression Inference
 Plotting the residuals against the
explanatory variable is helpful in checking
these conditions because a residual plot
magnifies patterns.
Analysis of Residual
 To examine whether the regression model is
appropriate for the data being analyzed, we can
check the residual plots.
 Residual plots are:
 Plot a histogram of the residuals
 Plot residuals against the fitted values.
 Plot residuals against the independent variable.
 Plot residuals over time if the data are chronological.
Analysis of Residual
 A histogram of the residuals provides a check on the
normality assumption. A Normal quantile plot of the
residuals can also be used to check the Normality
assumptions.
 Regression Inference is robust against moderate lack of
Normality. On the other hand, outliers and influential
observations can invalidate the results of inference for
regression
 Plot of residuals against fitted values or the independent
variable can be used to check the assumption of
constant variance and the aptness of the model.
Analysis of Residual
 Plot of residuals against time provides a
check on the independence of the error
terms assumption.
 Assumption of independence is the most
critical one.
Residual plots
 The residuals should
have no systematic Degree Days Residual Plot
pattern.
1
 The residual plot to 0.5
right shows a scatter

Residuals
0
of the points with no 0 20 40 60
individual -0.5

observations or -1
Degree Days
systematic change as x
increases.
Residual plots
 The points in this
residual plot have a
curve pattern, so a
straight line fits poorly
Residual plots
 The points in this plot
show more spread for
larger values of the
explanatory variable x,
so prediction will be
less accurate when x is
large.

Simple Linear Regression
No ratings yet
Simple Linear Regression
95 pages
Lecture 6 Simple Linear Regression
No ratings yet
Lecture 6 Simple Linear Regression
36 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
C6 Regression
No ratings yet
C6 Regression
27 pages
Chapter 2 Simple Linear Regression - Jan2023
No ratings yet
Chapter 2 Simple Linear Regression - Jan2023
66 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
39 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
Simple Linear Regression Guide
100% (1)
Simple Linear Regression Guide
76 pages
1 - Stat-701 Regression
No ratings yet
1 - Stat-701 Regression
18 pages
Session 18 Regression
No ratings yet
Session 18 Regression
16 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Course 10-Part 1
No ratings yet
Course 10-Part 1
32 pages
F Regression
No ratings yet
F Regression
65 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
Linear Regression Essentials
No ratings yet
Linear Regression Essentials
14 pages
Lecture-4 Simple Linear Regression
No ratings yet
Lecture-4 Simple Linear Regression
58 pages
ch12 0
No ratings yet
ch12 0
43 pages
BST 32202 Linear Regression 6 SLR Assumptions Lse
No ratings yet
BST 32202 Linear Regression 6 SLR Assumptions Lse
20 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
06 Least Squar Regression
No ratings yet
06 Least Squar Regression
25 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
68 pages
Raw Introduction to Linear Regression (서울대 회귀분석 강의노트)
No ratings yet
Raw Introduction to Linear Regression (서울대 회귀분석 강의노트)
226 pages
Regression (Hrishikesh)
No ratings yet
Regression (Hrishikesh)
30 pages
Regression Analysis
No ratings yet
Regression Analysis
21 pages
Lecture 6 - Regression Analysis
No ratings yet
Lecture 6 - Regression Analysis
34 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
Complete Business Statistics: Simple Linear Regression and Correlation
No ratings yet
Complete Business Statistics: Simple Linear Regression and Correlation
50 pages
15.simple Linear Regression-530
No ratings yet
15.simple Linear Regression-530
54 pages
Lect5 Math231
No ratings yet
Lect5 Math231
31 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Regression Kann Ur 14
No ratings yet
Regression Kann Ur 14
43 pages
Simple Regression Analysis Guide
No ratings yet
Simple Regression Analysis Guide
58 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
46 pages
Regression Analysis and Multiple Regression: Session 7
No ratings yet
Regression Analysis and Multiple Regression: Session 7
100 pages
Simple Linear Regression Sample
No ratings yet
Simple Linear Regression Sample
55 pages
1 - Simple Linear Regression
No ratings yet
1 - Simple Linear Regression
43 pages
Basic Economterics - I
No ratings yet
Basic Economterics - I
17 pages
Business Stats: Regression Basics
No ratings yet
Business Stats: Regression Basics
55 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Simple Linear Regression Analysis - ReliaWiki
No ratings yet
Simple Linear Regression Analysis - ReliaWiki
29 pages
Regression Analysis Using Excel
100% (1)
Regression Analysis Using Excel
85 pages
AI & ML: Linear Regression Guide
No ratings yet
AI & ML: Linear Regression Guide
55 pages
SEHH231320Chapter200120Simple20Linear20Regression20Part20I For20students1
No ratings yet
SEHH231320Chapter200120Simple20Linear20Regression20Part20I For20students1
29 pages
Linear Regression for Managers
No ratings yet
Linear Regression for Managers
9 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
Regression Analysis for Students
No ratings yet
Regression Analysis for Students
10 pages
Unit 3 Da
No ratings yet
Unit 3 Da
20 pages
14 Statistics and Probability
No ratings yet
14 Statistics and Probability
37 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
Mda-Session-7 Simple Linear Regression
No ratings yet
Mda-Session-7 Simple Linear Regression
75 pages
10 - Regression 1
No ratings yet
10 - Regression 1
58 pages
Module 3 - Regression and Correlation Analysis
No ratings yet
Module 3 - Regression and Correlation Analysis
54 pages
Simple Linear Regression Guide
100% (1)
Simple Linear Regression Guide
23 pages
Grambling State University - Department-Level Strategic Plan
No ratings yet
Grambling State University - Department-Level Strategic Plan
4 pages
Department Chair Supervisory Plan
No ratings yet
Department Chair Supervisory Plan
2 pages
Quantitative Research Thesis With Visuals
No ratings yet
Quantitative Research Thesis With Visuals
13 pages
Relevance of Research in Environmental Science
No ratings yet
Relevance of Research in Environmental Science
7 pages
Research Process Water Pollution
No ratings yet
Research Process Water Pollution
8 pages
Research Process Climate Change
No ratings yet
Research Process Climate Change
1 page
Impactofconst activitiesonEnvironmentJan2023
No ratings yet
Impactofconst activitiesonEnvironmentJan2023
9 pages
Philippine Regional Temperature Data
No ratings yet
Philippine Regional Temperature Data
2 pages
Nike Marketing Dissertation Help
100% (2)
Nike Marketing Dissertation Help
8 pages
Dorian Armstrong Resume-2022
No ratings yet
Dorian Armstrong Resume-2022
3 pages
State of Connecticut Department of Revenue Services 450 COLUMBUS BLVD. HARTFORD, CT 06103-1837 Mark Boughton, Commissioner
No ratings yet
State of Connecticut Department of Revenue Services 450 COLUMBUS BLVD. HARTFORD, CT 06103-1837 Mark Boughton, Commissioner
1 page
Inflation's Impact on Worker Performance
No ratings yet
Inflation's Impact on Worker Performance
6 pages
WZKat 0811 E
No ratings yet
WZKat 0811 E
70 pages
Freya Ponce
No ratings yet
Freya Ponce
10 pages
Nimbin Radio Sponsors & Programs
No ratings yet
Nimbin Radio Sponsors & Programs
1 page
Unit 3 Clutches
No ratings yet
Unit 3 Clutches
54 pages
SayreMicro11e - PPT - CH - 1 2
No ratings yet
SayreMicro11e - PPT - CH - 1 2
63 pages
Welfare Measrtes Literatrure
No ratings yet
Welfare Measrtes Literatrure
4 pages
Machine Learning: The Hundred-Page Book
No ratings yet
Machine Learning: The Hundred-Page Book
4 pages
Flowline Integrity & Risk Analysis
No ratings yet
Flowline Integrity & Risk Analysis
10 pages
OM - Sport - BSIV - Rev 03 - SBT PDF
No ratings yet
OM - Sport - BSIV - Rev 03 - SBT PDF
67 pages
For Telling Great Stories With Data: 5 Best Practices
No ratings yet
For Telling Great Stories With Data: 5 Best Practices
7 pages
OD432056579686600100
No ratings yet
OD432056579686600100
7 pages
04 Data Kids Activity Guide - What Are Your Favorite Songs
No ratings yet
04 Data Kids Activity Guide - What Are Your Favorite Songs
22 pages
Dynamic Modelling and Simulation of Gear Transmission Error For Gearbox Vibration Analysis
No ratings yet
Dynamic Modelling and Simulation of Gear Transmission Error For Gearbox Vibration Analysis
227 pages
Community Safety Plan
No ratings yet
Community Safety Plan
21 pages
WAUBNCF55JA073744 Min
No ratings yet
WAUBNCF55JA073744 Min
9 pages
Functions MTU DiaSys V2.73 PRO
No ratings yet
Functions MTU DiaSys V2.73 PRO
8 pages
Developer Log Analysis
No ratings yet
Developer Log Analysis
33 pages
Pengaruh Persepsi Atas Task Commitment Dan Sikap Terhadap Pemahaman Konsep Matematika Siswa
No ratings yet
Pengaruh Persepsi Atas Task Commitment Dan Sikap Terhadap Pemahaman Konsep Matematika Siswa
5 pages
Academic Profile: Eugene Agichtein
No ratings yet
Academic Profile: Eugene Agichtein
17 pages
Inkjet Print Speed Methodology
No ratings yet
Inkjet Print Speed Methodology
6 pages
Edmeston SX Welding Recommendations - Rev - 02-MO Okt 2020
No ratings yet
Edmeston SX Welding Recommendations - Rev - 02-MO Okt 2020
4 pages
Fillers - Riempitivi
No ratings yet
Fillers - Riempitivi
6 pages
Chapter 1 & 2 Businees
No ratings yet
Chapter 1 & 2 Businees
4 pages
Power Transformer Fire and Explosion - Causes and Control
0% (1)
Power Transformer Fire and Explosion - Causes and Control
10 pages
Art 1193-1206 Case Digest - From GOOGLE DRIVE
No ratings yet
Art 1193-1206 Case Digest - From GOOGLE DRIVE
3 pages
Consolidation Q35
No ratings yet
Consolidation Q35
2 pages

Simple Linear Regression

Uploaded by

Simple Linear Regression

Uploaded by

Applied Business Forecasting

Simple Linear Regression

who studies the data is to

find a functional relation 140

between the response

variable y and the 60

 The several purposes of regression analysis

we can summarize the relationship by

 The method of least squares chooses the

b0 1436.5  10.8(56.4) 828

 This means that if the weekly advertising

 Residuals are highly useful for studying

y-hat = 828+10.8X total 36124.76

You might also like