0% found this document useful (0 votes)

10 views35 pages

Module 6

Statistics of AIDS

Uploaded by

aditideo624

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views35 pages

Module 6

Statistics of AIDS

Uploaded by

aditideo624

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

University of Mumbai

Program – Bachelor of Engineering in

Computer Science and Engineering (Artificial Intelligence
and Machine Learning)

Class - T.E.
Course Code – CSDLO5011

Course Name – Statistics for Artificial

Intelligence Data Science

By
Prof. A.V.Phanse
Correlation & Regression

Correlation is a statistical measure that describes the strength and direction of a

relationship between two variables.

It helps to understand how changes in one variable are associated with changes
in another.

Correlation can be positive, negative, or zero.

Positive Correlation: When one variable increases, the other also increases.
Negative Correlation: When one variable increases, the other decreases.
Zero Correlation: No consistent relationship exists between the variables.

The correlation is usually measured by a value called the correlation coefficient

(denoted as r), which ranges from -1 to +1:
r = +1: Perfect positive correlation.
r = -1: Perfect negative correlation.
r = 0: No correlation.
Example:

Consider two variables:

Hours studied (X)
Exam score (Y)
If we observe that as the number of hours a student studies increases, their exam
score tends to increase, this indicates a positive correlation.

For example:
2 hours studied → score of 60
4 hours studied → score of 75
6 hours studied → score of 90

In this case, more studying seems to result in higher scores, indicating a positive
relationship.
Regression is a statistical method used to model and analyze the relationships
between a dependent variable (the outcome) and one or more independent
variables (predictors).

The primary goal of regression analysis is to predict the value of the dependent
variable based on the values of the independent variables and to understand how
the independent variables are associated with the dependent variable.

Types of Regression:

Linear Regression: The relationship between the dependent and independent

variables is modeled as a straight line (linear).

Multiple Regression: Similar to linear regression, but with more than one
independent variable.

Non-linear Regression: The relationship between the dependent and independent

variables is non-linear.

Logistic Regression: Used when the dependent variable is binary (e.g., yes/no,
0/1).
Key Differences Between Regression and Correlation:

Correlation quantifies the strength and direction of a relationship between two

variables but doesn't provide a model for predicting values.
Regression not only quantifies the relationship but also creates a predictive model,
allowing us to estimate outcomes based on input values.
Simple Linear Regression is a type of linear regression where we model the
relationship between two variables: one independent variable (predictor, X) and
one dependent variable (outcome, Y).

The goal is to fit a straight line through the data that best describes the relationship
between the two variables.

The Equation:
The simple linear regression model is represented by the equation:

Where:
Y is the predicted exam score.
X is the number of hours studied.
a is the intercept (the exam score when X = 0).
b is the slope (how much the exam score changes for each additional hour studied).
Example:
Let’s say we are interested in predicting a student's exam score (Y) based on the
number of hours studied (X). Here, X is the independent variable (hours studied),
and Y is the dependent variable (exam score).
Suppose after collecting data, we run a simple linear regression analysis and get the
following equation:

This equation can be interpreted as follows:

Intercept (50): If a student does not study at all (X = 0), we predict that their score
will be 50.
Slope (5): For each additional hour of studying, the student’s score is expected to
increase by 5 points.

Predicting an Outcome:
If a student studies for 6 hours, their predicted exam score can be calculated by
plugging the value of X into the regression equation:

Thus, if a student studies for 6 hours, their expected exam score would be 80.
Assumptions of Simple Linear Regression:

1. Linearity: The relationship between the independent and dependent variable is

linear.

2. Independence: The observations are independent of each other.

3. Homoscedasticity: The differences between observed and predicted values is

constant across all values of X.

4. Normality: The residuals are normally distributed.

Key Applications:

 Predicting future outcomes (e.g., predicting sales based on advertising spend).

 Understanding relationships (e.g., how much weight depends on caloric intake).
 Building simple predictive models.
Method of least squares
 The method of least squares is a fundamental technique used in regression
analysis to find the best-fitting line (regression line) through a set of data points.
 It minimizes the sum of the squared differences between the observed values
and the predicted values by the regression line. These squared differences are
called residuals.
Coefficient of Determination
As R square value is 0.98, it means that 98% variance in the test score can be
explained by no. of hours studied indicating excellent fit of regression model.
Example from University Exam for Practice
Multiple linear regression
Multiple linear regression is an extension of simple linear regression, where instead
of modeling the relationship between a single independent variable x and a
dependent variable y, we model the relationship between multiple independent
variables x1,x2,…,xk and the dependent variable y.
Example from University Exam for Practice
Thank You…

Regression Analysis Basics
No ratings yet
Regression Analysis Basics
12 pages
Correlation and Regression Notes
No ratings yet
Correlation and Regression Notes
5 pages
Regression Analysis Linear and Multiple Regression
No ratings yet
Regression Analysis Linear and Multiple Regression
6 pages
Regression Analysis Linear and Multiple Regression
No ratings yet
Regression Analysis Linear and Multiple Regression
6 pages
Regression Analysis Linear and Multiple Regression
No ratings yet
Regression Analysis Linear and Multiple Regression
6 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
34 pages
Updated Lecture 7
No ratings yet
Updated Lecture 7
29 pages
Definition 3. Use of Regression 4. Difference Between Correlation and Regression 5. Method of Studying Regression 6. Conclusion 7. Reference
No ratings yet
Definition 3. Use of Regression 4. Difference Between Correlation and Regression 5. Method of Studying Regression 6. Conclusion 7. Reference
11 pages
CH 5
No ratings yet
CH 5
36 pages
Screenshot 2024-02-26 at 3.47.20 PM
No ratings yet
Screenshot 2024-02-26 at 3.47.20 PM
15 pages
Module 2
No ratings yet
Module 2
21 pages
Module 3
No ratings yet
Module 3
34 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Unit 3 Da
No ratings yet
Unit 3 Da
20 pages
Probablity
No ratings yet
Probablity
4 pages
Correlation and Simple Linear Regression Analyses: Objectives
No ratings yet
Correlation and Simple Linear Regression Analyses: Objectives
6 pages
Regression Analysis for Training Events
No ratings yet
Regression Analysis for Training Events
18 pages
Introduction To Regression Analysis
No ratings yet
Introduction To Regression Analysis
14 pages
Regression Analysis for Students
No ratings yet
Regression Analysis for Students
10 pages
10 Regression Analysis
No ratings yet
10 Regression Analysis
55 pages
Chapter No 11 (Simple Linear Regression)
No ratings yet
Chapter No 11 (Simple Linear Regression)
3 pages
REGRESSION
No ratings yet
REGRESSION
38 pages
Inferential Analysis
No ratings yet
Inferential Analysis
45 pages
Regression Analysis (AI)
No ratings yet
Regression Analysis (AI)
9 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
34 pages
Intermediate Analytics-Regression-Week 1
No ratings yet
Intermediate Analytics-Regression-Week 1
52 pages
DA&V Module 2 (SAMI)
No ratings yet
DA&V Module 2 (SAMI)
14 pages
1.1.2simple Linear Regression
No ratings yet
1.1.2simple Linear Regression
14 pages
Da Module 3
No ratings yet
Da Module 3
54 pages
SNM - PRESENTATION - GRP 8
No ratings yet
SNM - PRESENTATION - GRP 8
15 pages
Correlation and Regression Analysis - Updated
No ratings yet
Correlation and Regression Analysis - Updated
49 pages
Module 3
No ratings yet
Module 3
92 pages
Unit-2-Linear Regression-R1
No ratings yet
Unit-2-Linear Regression-R1
21 pages
CH 4 - Correlation and Regression YARA&LAMA
No ratings yet
CH 4 - Correlation and Regression YARA&LAMA
27 pages
Chapter Regression PDF
No ratings yet
Chapter Regression PDF
95 pages
Business Analytics: Advance: Simple & Multiple Linear Regression
No ratings yet
Business Analytics: Advance: Simple & Multiple Linear Regression
38 pages
Simple Linear Regression and Correlation PDF
No ratings yet
Simple Linear Regression and Correlation PDF
7 pages
Lecture3 4
No ratings yet
Lecture3 4
48 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
65 pages
Simple Linear Regression and Correlation 568a5ac2ce9b3
No ratings yet
Simple Linear Regression and Correlation 568a5ac2ce9b3
31 pages
Thesis With Regression Analysis
100% (3)
Thesis With Regression Analysis
7 pages
Unit 2 Regression
No ratings yet
Unit 2 Regression
31 pages
Statistics for Data Analysts
No ratings yet
Statistics for Data Analysts
11 pages
Correlation & Regression Guide
No ratings yet
Correlation & Regression Guide
25 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Regression Analysis
No ratings yet
Regression Analysis
10 pages
Investigating Variables
No ratings yet
Investigating Variables
15 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
Introduction To Simple Linear Regression: - K.Tejashree (23H51A66F8)
No ratings yet
Introduction To Simple Linear Regression: - K.Tejashree (23H51A66F8)
10 pages
Regression
No ratings yet
Regression
14 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
What Is Regression Analysis
No ratings yet
What Is Regression Analysis
18 pages
Chapter 14 Simple Linear Regression .
No ratings yet
Chapter 14 Simple Linear Regression .
39 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
6 pages
Engineering Regression Analysis
No ratings yet
Engineering Regression Analysis
22 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
23 pages
BCT Ass3 Ans
No ratings yet
BCT Ass3 Ans
6 pages
BCT All - PYQ
No ratings yet
BCT All - PYQ
3 pages
Module 2 - Part 1
No ratings yet
Module 2 - Part 1
42 pages
Mod 3
No ratings yet
Mod 3
9 pages
Do Economic Complexity and Strong Institutions Affect Income Inequality?
No ratings yet
Do Economic Complexity and Strong Institutions Affect Income Inequality?
21 pages
Forward Rate Bias
100% (2)
Forward Rate Bias
9 pages
Model Sum of Squares DF Mean Square F Sig. 1 Regression .471 4 .118 1.576 .196 Residual 3.590 48 .075 Total 4.061 52 A. Predictors: (Constant), LC, EXT, DEBT, TANG B. Dependent Variable: DPR
No ratings yet
Model Sum of Squares DF Mean Square F Sig. 1 Regression .471 4 .118 1.576 .196 Residual 3.590 48 .075 Total 4.061 52 A. Predictors: (Constant), LC, EXT, DEBT, TANG B. Dependent Variable: DPR
3 pages
Tutorial Covering Topic 11 - Solutions-2-3 (1) - 1
No ratings yet
Tutorial Covering Topic 11 - Solutions-2-3 (1) - 1
7 pages
General Physics1: Quarter 1 - Module 1: Title: Graphical Presentation
100% (2)
General Physics1: Quarter 1 - Module 1: Title: Graphical Presentation
27 pages
Easy Education: ECON 2123
No ratings yet
Easy Education: ECON 2123
10 pages
NMR 8 Suryanto Dan Dai 2016 PDF
No ratings yet
NMR 8 Suryanto Dan Dai 2016 PDF
15 pages
111722202030-M Ramya ML Assignment-1
No ratings yet
111722202030-M Ramya ML Assignment-1
13 pages
Effect of Price, Promotion, Location and Interest Buy As Variable Moderation To Decision Purchase of Cendana Homes Property at PT. Lippo Karawaci
No ratings yet
Effect of Price, Promotion, Location and Interest Buy As Variable Moderation To Decision Purchase of Cendana Homes Property at PT. Lippo Karawaci
16 pages
The Impact of Work-Life Balance On Job Satisfaction: With Special Reference To ABC Private Limited in Sri Lanka
No ratings yet
The Impact of Work-Life Balance On Job Satisfaction: With Special Reference To ABC Private Limited in Sri Lanka
13 pages
Factors Influencing Sustainable Logistics
No ratings yet
Factors Influencing Sustainable Logistics
15 pages
ME Paper - HIG Mill Modelling&optimization
No ratings yet
ME Paper - HIG Mill Modelling&optimization
9 pages
Correlation and Regression: © The Mcgraw-Hill Companies, Inc., 2000
No ratings yet
Correlation and Regression: © The Mcgraw-Hill Companies, Inc., 2000
32 pages
Sta 2270 Sps 2291 Assignment
No ratings yet
Sta 2270 Sps 2291 Assignment
2 pages
Machine Learning with SVM & Regression
No ratings yet
Machine Learning with SVM & Regression
9 pages
Assignment DSMM
No ratings yet
Assignment DSMM
24 pages
Regional Salary ANOVA Analysis
No ratings yet
Regional Salary ANOVA Analysis
8 pages
Chapter 6 - Forcasting & Budgeting
No ratings yet
Chapter 6 - Forcasting & Budgeting
35 pages
Evans - Analytics2e - PPT - 07 and 08
No ratings yet
Evans - Analytics2e - PPT - 07 and 08
49 pages
CH 02 Wooldridge 5e ppt20250307
No ratings yet
CH 02 Wooldridge 5e ppt20250307
51 pages
Business Statistics Basics
No ratings yet
Business Statistics Basics
2 pages
Chap 014
No ratings yet
Chap 014
16 pages
The Effects of Taxation On Economic Development TH
No ratings yet
The Effects of Taxation On Economic Development TH
22 pages
Project Report On Customer Lifetime Value
No ratings yet
Project Report On Customer Lifetime Value
23 pages
Statistics For Managers Using Microsoft® Excel 5th Edition: Multiple Regression Model Building
No ratings yet
Statistics For Managers Using Microsoft® Excel 5th Edition: Multiple Regression Model Building
35 pages
Predictive Modelling Using Linear Regression: © Analy Datalab Inc., 2016. All Rights Reserved
No ratings yet
Predictive Modelling Using Linear Regression: © Analy Datalab Inc., 2016. All Rights Reserved
16 pages
Business Statistics and Management Science Notes
No ratings yet
Business Statistics and Management Science Notes
74 pages
This Sheet Is For 1 Mark Questions S.R No
No ratings yet
This Sheet Is For 1 Mark Questions S.R No
56 pages
Correlation Coefficient's Assignment
100% (1)
Correlation Coefficient's Assignment
7 pages
Model Haramaya-1
No ratings yet
Model Haramaya-1
25 pages

Module 6

Uploaded by

Module 6

Uploaded by

University of Mumbai

Program – Bachelor of Engineering in

Course Name – Statistics for Artificial

Correlation is a statistical measure that describes the strength and direction of a

Correlation can be positive, negative, or zero.

The correlation is usually measured by a value called the correlation coefficient

Consider two variables:

Linear Regression: The relationship between the dependent and independent

Non-linear Regression: The relationship between the dependent and independent

Correlation quantifies the strength and direction of a relationship between two

This equation can be interpreted as follows:

1. Linearity: The relationship between the independent and dependent variable is

2. Independence: The observations are independent of each other.

3. Homoscedasticity: The differences between observed and predicted values is

4. Normality: The residuals are normally distributed.

 Predicting future outcomes (e.g., predicting sales based on advertising spend).

You might also like