0% found this document useful (0 votes)

53 views23 pages

Statistics For Management: Omar Paccagnella

This document is a lecture on simple linear regression from the University of Padua. It introduces simple linear regression and key concepts like the correlation coefficient, fitting a straight line to summarize the relationship between two variables, and using the least squares method to calculate the intercept and slope of the best fitting regression line that minimizes the sum of squared errors between predicted and observed values. Equations for calculating the sample correlation coefficient, estimated intercept and slope of the regression line using least squares are provided.

Uploaded by

The Panda Entertainer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views23 pages

Statistics For Management: Omar Paccagnella

Uploaded by

The Panda Entertainer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

University of Padua

Statistics for Management

Simple Linear Regression (1)

Omar Paccagnella

Department of Statistical Sciences

University of Padua

omar.paccagnella@unipd.it
http://www.stat.unipd.it/~paccagnella

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 1/ 23

University of Padua

Introduction
What happens if:

• Data are time-oriented

• There is more than 1 variable

Unit Shop surface Weekly sales (1000 e)

1 95 43.2
2 144 132.0
3 210 155.0
4 156 76.0
5 188 100.9
6 321 187.4
7 250 185.0
8 115 60.7
9 178 82.9
10 105 61.3

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 2/ 23

University of Padua

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 3/ 23

University of Padua

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 4/ 23

University of Padua

Introduction

• A scatter diagram or scatter plot (that is a two-dimensional graph)

may help to show a relationship between two variables
• Is this relationship linear? Is this positive or negative? If linear, could
we summarise such relationship by fitting a straight line through the
data points (like a trend for a time series)?

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 5/ 23

University of Padua

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 6/ 23

University of Padua

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 7/ 23

University of Padua

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 8/ 23

University of Padua

The Correlation Coefficient

It measures the extent to which 2 variables (usually called X and Y ) are

linearly related to each other
(in other words, the strength of such linear relationship)

• In the population (that contains all possible values of the pair X − Y

of interest): ρ
• In the (random) sample drawn from this population: r
Often the 2 variables are measured in different units (in the example,
square metres & e); nevertheless it is important to measure the extent to
which X and Y are related

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 9/ 23

University of Padua

The Correlation Coefficient

• Standardize the variables (constructing the Z -scores)

(X − X) ( Y − Y)
ZX = ZY =
SX SY

I X: average value (mean) of X

I SX : standard deviation of X
I n: number of units
• Calculate the mean cross product of Z -scores:
P
1 X (X − X)(Y − Y)
r = ZX ZY = qP q
n−1
(X − X)2 (Y − Y)2

I Correlation and not causation is measured

I −1 ≤ r ≤ 1 (check the value of r = 0.8927 in the previous example)

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 10/ 23
University of Padua

Fitting a Straight Line

• Could we find a straight line that is able to summarise the pattern of

all X − Y data points?
• Could we fit the best straight line?
• Could we exploit this best straight line to forecast unknown (future?)
values of the variable of interest (Y )?

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 11/ 23
University of Padua

Fitting a Straight Line

• We may introduce a mathematical procedure to calculate both the

Y -intercept and the slope of the best-fitting straight line.
• Since many straight lines can be calculated, the most common
approach to determine the best fit is the method of least squares
(OLS - Ordinary Least Squares)

The best fitting line is the one that minimises the sum of
the squared distances between the data points and the line itself,
as measured in the vertical (Y ) direction

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 12/ 23
University of Padua

Fitting a Straight Line

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 13/ 23
University of Padua

Fitting a Straight Line

• In the population the straight line may be mathematically defined as:

Y = β 0 + β1 X

• In the sample the straight line may be mathematically defined as:

Y = b0 + b1 X

where b0 and b1 are estimates of the true (but unknown) population

intercept and slope.
According to the values of the sample, we can predict the Y values in the
fitted line
Ŷ = b̂0 + b̂1 X
Ŷ is the value that we would observe if X ’s were on the line

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 14/ 23
University of Padua

Least Squares

The idea behind this method is that the line will be appropriate to
describe the relationship under investigation if the observed values are
closed to the straight line.
The distance between observed and fitted values is the residual:

ei = Yi − Ŷi = Yi − b0 − b1 Xi

According to the OLS criterium, the values of b0 and b1 are chosen in

order to minimise the sum of squared errors (residuals):
n
X n
X
SSE = f (b0 , b1 ) = ei2 = (Yi − b0 − b1 Xi )2
i =1 i =1

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 15/ 23
University of Padua

Least Squares
First order conditions (that is the derivates of f (b0 , b1 ) with respect to b0
and b1 ) are applied to minimise SSE. Using little calculi:
Pn
i =1 (X − X)(Y − Y)
b̂1 = Pn 2
i =1 (X − X)

b̂0 = Y − b̂1 X

Least Squares slope is related to sample correlation coefficient, so that

qP
n
i = 1 (Y − Y)2
b̂1 = qP r
n
i =1 (X − X)2

Hence, b̂1 and r are proportional to one another and have the same sign.

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 16/ 23
University of Padua

The linear regression model

According to the least squares criterion, we have the identity

Observation = Fit + Residual

formally
Y = Ŷ + (Y − Ŷ )

• The fit represents the overall pattern in the data

• The residuals represent deviations from the pattern

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 17/ 23
University of Padua

The linear regression model

Observed data is a sample of observations on an underlying relation that
holds in the population.

For all values of X , the observed values for Y are identically distributed
around a mean µ that depends linearly on X :

µy = β0 + β1 X
As X changes, the means of the distributions of the possible values of Y
lie along a straight line. This is the so-called
population regression line

• Observed values for Y vary because of the presence of unknown

(and unmeasured) factors.
• This variation is the same for all X ’s values and is measured by the
standard deviation σ .
• The distance between a Y value and its mean is called error ().
Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 18/ 23
University of Padua

The linear regression model

In the simple linear regression model:

• Y is the response or dependent variable.
• X is the controlled or explanatory (independent) variable.
• The dependent variable is the sum of its mean and a random
deviation () from this mean.
• Deviations represent variation in Y due to unobserved factors that
prevent the pair (X , Y ) values from lying exactly on the straight line.

The population regression line may be defined as:

Y = β0 + β1 X +

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 19/ 23
University of Padua

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 20/ 23
University of Padua

The linear regression model

The sample regression line may be regarded as an estimate of the

population regression line,

µY = β0 + β1 X

and the residuals e = Y − Ŷ may be regarded as estimates of the error

components
Therefore:

Y = b0 + b1 X + e

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 21/ 23
University of Padua

Some notes

• We may also write

Cov (X , Y )
b1 =
Var (X )
if Var (X ) 6= 0
• b1 = 0 if and only if Cov (X , Y ) = 0, that is the two variables are
linearly independent
• Cov (X , Y ) provides the sign of b1 estimate
• The regression line always passes through the means of X and Y

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 22/ 23
University of Padua

Steps of a Linear Regression Analysis

• Hypothesis on the linear functional relationship between the variable

of interest and the other variable(s)
• Estimation of the parameters of this functional relationship,
based on the available sample data
• Statistical testing of model estimates and goodness of fit
• Robustness checks on the main assumptions of the linear
regression model

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 23/ 23

Intro to Linear Regression
No ratings yet
Intro to Linear Regression
22 pages
Chapter 9-Correlation and Regression
No ratings yet
Chapter 9-Correlation and Regression
23 pages
STATG5 - Simple Linear Regression Using SPSS Module
No ratings yet
STATG5 - Simple Linear Regression Using SPSS Module
16 pages
Lekcija 10 - Korelacija I Regresija
No ratings yet
Lekcija 10 - Korelacija I Regresija
76 pages
STAT630Slide Adv Data Analysis
0% (1)
STAT630Slide Adv Data Analysis
238 pages
CH 4 - Correlation and Regression YARA&LAMA
No ratings yet
CH 4 - Correlation and Regression YARA&LAMA
27 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Simple Regression and Simple Correlation: MA261 Statistical and Numerical Techniques March 24, 2022
No ratings yet
Simple Regression and Simple Correlation: MA261 Statistical and Numerical Techniques March 24, 2022
52 pages
DAM Class 21-24 Regression Analysis
No ratings yet
DAM Class 21-24 Regression Analysis
93 pages
Module - 05 Statistical Computing and R Programming
No ratings yet
Module - 05 Statistical Computing and R Programming
53 pages
2023 Statistics Fin 10
No ratings yet
2023 Statistics Fin 10
14 pages
Engineering Regression Analysis
No ratings yet
Engineering Regression Analysis
22 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
Predictive Analytics-Mid Sem Exam Question Bank
No ratings yet
Predictive Analytics-Mid Sem Exam Question Bank
28 pages
(Revised) Simple Linear Regression and Correlation
No ratings yet
(Revised) Simple Linear Regression and Correlation
41 pages
AI & ML: Linear Regression Guide
No ratings yet
AI & ML: Linear Regression Guide
55 pages
Excel Regression for Finance Students
No ratings yet
Excel Regression for Finance Students
19 pages
Regression Analysis Basics
No ratings yet
Regression Analysis Basics
12 pages
14 Statistics and Probability
No ratings yet
14 Statistics and Probability
37 pages
Simple Linear Regression and Correlation 568a5ac2ce9b3
No ratings yet
Simple Linear Regression and Correlation 568a5ac2ce9b3
31 pages
AI Lec23
No ratings yet
AI Lec23
36 pages
BA3 4 5modules
No ratings yet
BA3 4 5modules
258 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
Unit 2-1
No ratings yet
Unit 2-1
30 pages
DS 3 2
No ratings yet
DS 3 2
17 pages
1 - Linear Models
No ratings yet
1 - Linear Models
22 pages
Regression Analysis for Students
No ratings yet
Regression Analysis for Students
10 pages
Quantitative Techniques AMC 301
No ratings yet
Quantitative Techniques AMC 301
20 pages
Lecture 6 - Regression Analysis
No ratings yet
Lecture 6 - Regression Analysis
34 pages
Business Stats: Regression Basics
No ratings yet
Business Stats: Regression Basics
55 pages
d90840b8 1721727178674
No ratings yet
d90840b8 1721727178674
43 pages
Linear Regression
No ratings yet
Linear Regression
216 pages
Regression
No ratings yet
Regression
32 pages
Linear Regression Basics Guide
No ratings yet
Linear Regression Basics Guide
6 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
Lectures 14 15
No ratings yet
Lectures 14 15
66 pages
Linear Regression. Com
No ratings yet
Linear Regression. Com
13 pages
BA Unit3
No ratings yet
BA Unit3
42 pages
Ra Web
No ratings yet
Ra Web
70 pages
04 - Linear Regression
No ratings yet
04 - Linear Regression
4 pages
Linear Regression for Academics
No ratings yet
Linear Regression for Academics
28 pages
Linear Regression & Correlation Guide
No ratings yet
Linear Regression & Correlation Guide
92 pages
Chapter 1
No ratings yet
Chapter 1
24 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
Python ML Course Notes
No ratings yet
Python ML Course Notes
36 pages
Probablity
No ratings yet
Probablity
4 pages
TCMG - MEEG 573 - SP - 20 - Lecture - 7
No ratings yet
TCMG - MEEG 573 - SP - 20 - Lecture - 7
69 pages
325unit 1 Simple Regression Analysis
No ratings yet
325unit 1 Simple Regression Analysis
10 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
39 pages
Regression
No ratings yet
Regression
32 pages
Simple Regression and Correlation
No ratings yet
Simple Regression and Correlation
30 pages
1 - Stat-701 Regression
No ratings yet
1 - Stat-701 Regression
18 pages
Business Forecasting with Regression
No ratings yet
Business Forecasting with Regression
69 pages
Company Profile - 2 PDF
No ratings yet
Company Profile - 2 PDF
1 page
Neurobiology Assignment PDF
No ratings yet
Neurobiology Assignment PDF
2 pages
Unieuro HalfYearFinancialReport2019
No ratings yet
Unieuro HalfYearFinancialReport2019
97 pages
Unieuro InvestorPresentation May17 - 1
No ratings yet
Unieuro InvestorPresentation May17 - 1
54 pages
25 Learning - Memory1 PDF
No ratings yet
25 Learning - Memory1 PDF
28 pages
Unieuro's Interim Financial Report
No ratings yet
Unieuro's Interim Financial Report
31 pages
UNIEURO FY1718Resultsnn
No ratings yet
UNIEURO FY1718Resultsnn
37 pages
Mechanisms of Short-Term Synaptic Plasticity
No ratings yet
Mechanisms of Short-Term Synaptic Plasticity
13 pages
28 Optical - Techniques PDF
No ratings yet
28 Optical - Techniques PDF
28 pages
Neurobiology Assignment PDF
No ratings yet
Neurobiology Assignment PDF
2 pages
Stimulation of Neuronal Activity With Light
No ratings yet
Stimulation of Neuronal Activity With Light
29 pages
LTP1 ( 2 Hours, 1 TBS) Depends On Modifications of Postsynaptic Proteins (Mainly
No ratings yet
LTP1 ( 2 Hours, 1 TBS) Depends On Modifications of Postsynaptic Proteins (Mainly
12 pages
25 Learning - Memory1 PDF
No ratings yet
25 Learning - Memory1 PDF
28 pages
21 Synaptic Plasticity - Short - Term PDF
No ratings yet
21 Synaptic Plasticity - Short - Term PDF
19 pages
Mechanisms of Short-Term Synaptic Plasticity
No ratings yet
Mechanisms of Short-Term Synaptic Plasticity
13 pages
24 Short - Term - Depression - TC FFI - Optional PDF
No ratings yet
24 Short - Term - Depression - TC FFI - Optional PDF
9 pages
Synaptic Depression: Many Different Possible Mechanisms
No ratings yet
Synaptic Depression: Many Different Possible Mechanisms
24 pages
20 Synaptic Transmission - Vel - Ca - Nano Vs Microdomains PDF
No ratings yet
20 Synaptic Transmission - Vel - Ca - Nano Vs Microdomains PDF
23 pages
26 Learning - Memory2 - LTP1 PDF
No ratings yet
26 Learning - Memory2 - LTP1 PDF
13 pages
28 Optical - Techniques PDF
No ratings yet
28 Optical - Techniques PDF
28 pages
StormCAD QuickStart
No ratings yet
StormCAD QuickStart
62 pages
Space
No ratings yet
Space
3 pages
Manufacturing Systems & CNC Basics
No ratings yet
Manufacturing Systems & CNC Basics
44 pages
The Emergence of Life On Eart - 2001 - Progress in Biophysics and Molecular Biol
No ratings yet
The Emergence of Life On Eart - 2001 - Progress in Biophysics and Molecular Biol
46 pages
Iso 4624 2002 en FR PDF
100% (1)
Iso 4624 2002 en FR PDF
6 pages
Matthew Sharpe, Michael Ure - Philosophy As A Way of Life - History, Dimensions, Directions-Bloomsbury Academic (2021)
100% (4)
Matthew Sharpe, Michael Ure - Philosophy As A Way of Life - History, Dimensions, Directions-Bloomsbury Academic (2021)
425 pages
CORING (Petroleum)
No ratings yet
CORING (Petroleum)
64 pages
Group 1 Feasibility Study Chapter 1 3
50% (2)
Group 1 Feasibility Study Chapter 1 3
8 pages
A Project Report On Company Analysis
58% (12)
A Project Report On Company Analysis
27 pages
Antioxidant and Anticancer Activities of
No ratings yet
Antioxidant and Anticancer Activities of
3 pages
PHYSICS
No ratings yet
PHYSICS
1 page
Kryotherm Catalog
No ratings yet
Kryotherm Catalog
41 pages
Fin Irjmets1656581789
No ratings yet
Fin Irjmets1656581789
3 pages
Timeline - The Endurance
No ratings yet
Timeline - The Endurance
5 pages
Personal Information: Curriculum Vitae - Ye Win Aung
No ratings yet
Personal Information: Curriculum Vitae - Ye Win Aung
5 pages
Copeladder: Cable Ladder System For Power, Control, Instrumentation Cable & Pneumatic Tubing
No ratings yet
Copeladder: Cable Ladder System For Power, Control, Instrumentation Cable & Pneumatic Tubing
42 pages
1 Complex Analysis: 1.1 Analytic Functions
No ratings yet
1 Complex Analysis: 1.1 Analytic Functions
15 pages
ASNT Questions and Answers Levels
No ratings yet
ASNT Questions and Answers Levels
28 pages
Stewart's Strong Ion Theory
No ratings yet
Stewart's Strong Ion Theory
36 pages
Polynomial Equations from Zeros
No ratings yet
Polynomial Equations from Zeros
3 pages
Vue 10 Reference Manual
No ratings yet
Vue 10 Reference Manual
715 pages
Serie 03F Series 04F Valves: 2/2 NC 2/2 NO
No ratings yet
Serie 03F Series 04F Valves: 2/2 NC 2/2 NO
18 pages
Year 12 IAL Biology Week 1
No ratings yet
Year 12 IAL Biology Week 1
34 pages
River System2
No ratings yet
River System2
1 page
XSCF Cheatsheet
No ratings yet
XSCF Cheatsheet
8 pages
Build A Remote Control Deadbolt
No ratings yet
Build A Remote Control Deadbolt
10 pages
Seltos Price List Final (Omr) - Nov'23 25.11.23
No ratings yet
Seltos Price List Final (Omr) - Nov'23 25.11.23
1 page
Sri Chaitanya IIT Academy., India.: Physics
No ratings yet
Sri Chaitanya IIT Academy., India.: Physics
12 pages
CW Practical PDF - Merged
No ratings yet
CW Practical PDF - Merged
35 pages
YEM Media
No ratings yet
YEM Media
2 pages

Statistics For Management: Omar Paccagnella

Uploaded by

Statistics For Management: Omar Paccagnella

Uploaded by

University of Padua

Statistics for Management

Department of Statistical Sciences

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 1/ 23

• Data are time-oriented

Unit Shop surface Weekly sales (1000 e)

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 2/ 23

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 3/ 23

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 4/ 23

• A scatter diagram or scatter plot (that is a two-dimensional graph)

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 5/ 23

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 6/ 23

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 7/ 23

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 8/ 23

The Correlation Coefficient

It measures the extent to which 2 variables (usually called X and Y ) are

• In the population (that contains all possible values of the pair X − Y

Statistics for Management, a.y. 2018/19 - Simple linear regression (1) 9/ 23

The Correlation Coefficient

I X: average value (mean) of X

I Correlation and not causation is measured

Fitting a Straight Line

• Could we find a straight line that is able to summarise the pattern of

Fitting a Straight Line

• We may introduce a mathematical procedure to calculate both the

Fitting a Straight Line

Fitting a Straight Line

• In the population the straight line may be mathematically defined as:

• In the sample the straight line may be mathematically defined as:

where b0 and b1 are estimates of the true (but unknown) population

According to the OLS criterium, the values of b0 and b1 are chosen in

Least Squares slope is related to sample correlation coefficient, so that

The linear regression model

According to the least squares criterion, we have the identity

Observation = Fit + Residual

• The fit represents the overall pattern in the data

The linear regression model

• Observed values for Y vary because of the presence of unknown

The linear regression model

In the simple linear regression model:

The population regression line may be defined as:

The linear regression model

The sample regression line may be regarded as an estimate of the

and the residuals e = Y − Ŷ may be regarded as estimates of the error

• We may also write

Steps of a Linear Regression Analysis

• Hypothesis on the linear functional relationship between the variable

You might also like