Correlation and Regression
Lecture 17
By
Amir Iqbal
1
Correlation
Finding the relationship between two quantitative variables
without being able to infer causal relationships
Correlation is a statistical technique used to determine the
degree to which two variables are related
Scatter plots
The pattern of data is indicative of the type of relationship
between your two variables:
positive relationship
negative relationship
no relationship
Positive relationship
Negative relationship
Reliability
AgeofCar
No relation
Correlation Coefficient
Statistic showing the degree of relation between two variables
Simple Correlation coefficient (r)
It is also called Pearson's correlation or product moment correlation
coefficient.
It measures the nature and strength between two variables of the
quantitative type.
The sign of r denotes the nature of association
while the value of r denotes the strength of association.
If the sign is +ve this means the relation is direct (an increase in one
variable is associated with an increase in the other variable and a decrease
in one variable is associated with a decrease in the other variable).
While if the sign is -ve this means an inverse or indirect relationship
(which means an increase in one variable is associated with a decrease in
the other).
Cont
The value of r ranges between ( -1) and ( +1)
The value of r denotes the strength of the association as illustrated
by the following diagram.
strong
-1
intermediate
-0.75
-0.25
weak
weak
indirect
perfect
correlation
intermediate
0.25
strong
0.75
Direct
no relation
perfect
correlation
Cont.
If r = Zero this means no association or correlation between the
two variables.
If 0 < r < 0.25 = weak correlation.
If 0.25 r < 0.75 = intermediate correlation.
If 0.75 r < 1 = strong correlation.
If r = l = perfect correlation.
Exercise
Regression Analyses
Regression: technique concerned with predicting some
variables by knowing others
The process of predicting variable Y using variable X
Uses a variable (x) to predict some outcome variable (y)
Tells you how values in y change as a function of changes in
values of x
Correlation and Regression
Correlation describes the strength of a linear relationship
between two variables
Linear means straight line
Regression tells us how to draw the straight line described by
the correlation
Regression Equation
Regression equation
describes the regression line
mathematically
Intercept
Slope
Types of Regression
1. Simple Regression
.The simple regression model contains two variables (one
independent variable X and one dependent variable Y); this model is
therefore called Two-variables or Simple regression analysis model.
.For Example Y = 0 + 1X + e
2. Multiple regression
.Multiple regression analysis is a straightforward extension of
simple regression analysis which allows more than one
independent variable.
.For Example
Y = 0 + 1X1 + 2X2 + 3X3 + ... + NXN + e
Thank
You