0% found this document useful (0 votes)

7 views13 pages

Training Models

Uploaded by

Ivan Lara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views13 pages

Training Models

Uploaded by

Ivan Lara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Artificial Intelligence and Automation

Training Linear Models

Ph.D. Gerardo Marx Chávez-Campos

Instituto Tecnológico de Morelia: Ing. Mecatrónica

Introduction

Summary

◮ The Classification/Prediction task is made by a function that

converts some input in a desired output
◮ Error is the main measure used to determine if our
Classification/Prediction task is good
◮ A problem with the model’s adjustments is that the model is
updated to match the last training example, discarding all
previous training examples.
◮ A good way to fix this is to moderate the updates with a
learning rate (α); thus, no single training example totally
dominates the learning.
Introduction
For now Machine Learning model and their training are black boxes
for now. In this Lecture, we will start by looking at the Linear Re-
gression model, one of the simplest models. Thus, we will discuss
two diﬀerent ways to train it:

◮ using a direct “closed-form” equation that directly computes

the model parameters that best fit the model to the training
set.
◮ Using an iterative optimization approach called Gradient
Descent (GD) that gradually tweaks the model parameters to
minimize the cost function over the training set.

Next, we will look at Polynomial Regression, a more complex

model that can fit non-linear datasets.

Finally, we will look at two more models that are commonly used for
classification tasks: Logistic Regression and Softmax Regression.
Linear Regression I

In the first laboratory session, we develop a simple regression model

of life satisfaction:

lifeSatis = θ0 + θ1 × GPDperCapita (1)

here θ0 and θ1 are the model parameters.

Linear Regression II

More generally, a linear model makes a prediction by simply comput-

ing a weighted sum of the input features plus a constant called the
bias term (intercept term):

ŷ = θ0 + θ1 x1 + θ2 x2 + θ3 x3 + · · · + θn xn (2)
with ŷ as the predicted value and

◮ n is the number of features

◮ xi is the ith feature
◮ θj as the j th model parameter
Vectorized form

A vectorized form of the Linear Regressor is:

ŷ = hθ (x) = θ · x (3)

◮ θ is the model’s parameter vector

◮ x is the instances’s feature vector, containing x0 to x1 , with
x0 = 1
◮ θ · x is the dot product θ0 x0 + θ1 x1 + θ2 x2 + θ3 x3 + · · · + θn xn
◮ hθ is hypothesis function, using the model parameter θ
How do we train it?

◮ Training a model means setting its parameters that best fits

the training set.
◮ We need a measure to determine how well (or poorly) the
model fits the data
◮ The Root Mean Square Error (RMSE) is the most common
measure
◮ To train the LR Model, you need to find θ that minimizes the
RMSE
The MSE Cost Function

The Mean Square Error (MSE) of a Linear Regression hypothesis

hθ on a training set X is calculated using:

1 󰁛 󰀓 (i) 󰀔2
m
M SE(X, hθ ) = θx − y (i) (4)
m
i=0

J(θ) = M SE(X, hθ ) (5)

The Normal Equation I

To find the value of θ that minimizes the cost function J(θ), there
is a closed -form solution— in other words, a mathematical
equation that gives the result directly. This is called the Normal
Equation:

∂J(θ)
=0
∂θ
m
∂J(θ) 1 󰁛
= (θx − y)2 = (θx − y)T (θx − y)
∂θ m
i=1
󰀅 󰀆
= (θx)T − y T [θx − y]
The Normal Equation II

∂J(θ)
=0
∂θ
∂J(θ) ∂ ∂
= (θx − y)2 = (θx − y)T (θx − y)
∂θ ∂θ ∂θ
∂ 󰀅 󰀆
= (θx)T − y T [θx − y]
∂θ
The Normal Equation III

Theorem. The following properties hold:

(AT )T = A
(A + B)T = AT + B T
(kA)T = kAT
(AB)T = AT B T
The Normal Equation IV
just considers that (θx)T y = y T (θx)

∂ 󰀅 󰀆
0= (θx)T θx − (θx)T y − y T θx + y T y
∂θ
∂ 󰀅 T T 󰀆
0= θ x θx − 2(θx)T y + y T y
∂θ
∂ 󰀅 2 T 󰀆
0= θ x x − 2(θ T xT )y
∂θ
0 =2θxT x − 2(xT )y
2θxT x =2xT y
θxT x =xT y
θ =(xT x)−1 (xT y)
θ̂ =(xT x)−1 (xT y)
Referencias

https://www.geeksforgeeks.org/ml-normal-equation-in-linear-
regression/
https://prutor.ai/normal-equation-in-linear-regression/
https://towardsdatascience.com/performing-linear-regression-
using-the-normal-equation-6372ed3c57
Géron, Aurélien. "Hands-on machine learning with scikit-learn
and tensorflow: Concepts." Tools, and Techniques to build
intelligent systems (2017).

Lab 4 - Markdown Practical - Solution
No ratings yet
Lab 4 - Markdown Practical - Solution
5 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
02 - Linear Models - A
No ratings yet
02 - Linear Models - A
23 pages
Module3 Ch1
No ratings yet
Module3 Ch1
83 pages
D2L CH3 Part1
No ratings yet
D2L CH3 Part1
36 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
Linear Regression Lecture Notes
No ratings yet
Linear Regression Lecture Notes
34 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
ML Cheatsheet PDF
100% (1)
ML Cheatsheet PDF
211 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
ML Cheatsheet for Beginners
100% (1)
ML Cheatsheet for Beginners
211 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Machine Learning (CSO851) - Lecture 02
No ratings yet
Machine Learning (CSO851) - Lecture 02
74 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Module-1 Deep Learning (Autosaved)
No ratings yet
Module-1 Deep Learning (Autosaved)
100 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Data Science Course Syllabus
No ratings yet
Data Science Course Syllabus
104 pages
ML4 Linear Models
No ratings yet
ML4 Linear Models
34 pages
Regression Analysis
No ratings yet
Regression Analysis
11 pages
AC-ED L04 - Logistic Regression, Regularization
No ratings yet
AC-ED L04 - Logistic Regression, Regularization
80 pages
DSBDL - Write - Ups - 4 To 7
No ratings yet
DSBDL - Write - Ups - 4 To 7
11 pages
Algorithms For Data Science: Attendance: 88772147
No ratings yet
Algorithms For Data Science: Attendance: 88772147
35 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
ML - Mca
No ratings yet
ML - Mca
48 pages
Cost Function
No ratings yet
Cost Function
17 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Supervised Learning & Regression
No ratings yet
Supervised Learning & Regression
41 pages
Linear Regression A Foundational ML Algorithm
No ratings yet
Linear Regression A Foundational ML Algorithm
10 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
Lec6 7 Linear Regression
No ratings yet
Lec6 7 Linear Regression
38 pages
Lec 6
No ratings yet
Lec 6
19 pages
Iet Cipher ML Bootcamp (Session-1)
No ratings yet
Iet Cipher ML Bootcamp (Session-1)
67 pages
Chapter Regression
No ratings yet
Chapter Regression
10 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
S1 - 25 (NSP) - ML - CS 34 - 10th17th Aug 2025
No ratings yet
S1 - 25 (NSP) - ML - CS 34 - 10th17th Aug 2025
89 pages
ML: Introduction 1. What Is Machine Learning?
No ratings yet
ML: Introduction 1. What Is Machine Learning?
38 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
CS435 Ch6
No ratings yet
CS435 Ch6
14 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
20 pages
Lecture 2 - Linear Regression
No ratings yet
Lecture 2 - Linear Regression
54 pages
ML 02 Regression 2
No ratings yet
ML 02 Regression 2
30 pages
Lec1 PDF
No ratings yet
Lec1 PDF
56 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
CS550 Lec2
No ratings yet
CS550 Lec2
24 pages
(Machine Learning Coursera) Lecture Note Week 1
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
8 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Types of Property: Immovable vs. Movable
No ratings yet
Types of Property: Immovable vs. Movable
1 page
Tutorial: Setup For Android Development: Adam C. Champion CSE 5236: Mobile Application Development Autumn 2017
No ratings yet
Tutorial: Setup For Android Development: Adam C. Champion CSE 5236: Mobile Application Development Autumn 2017
34 pages
1CDI 1 Ok
No ratings yet
1CDI 1 Ok
6 pages
PP Trilene Hi10ho
No ratings yet
PP Trilene Hi10ho
2 pages
A New Narrative For Psychology
100% (1)
A New Narrative For Psychology
281 pages
2.2 Measures of Central Location
No ratings yet
2.2 Measures of Central Location
17 pages
Reliability Analysis Report
No ratings yet
Reliability Analysis Report
2 pages
ProgressionRules 230216 194752
100% (5)
ProgressionRules 230216 194752
6 pages
How To Draw Uml Diagrams
No ratings yet
How To Draw Uml Diagrams
13 pages
Deliberate Practice in Education
No ratings yet
Deliberate Practice in Education
9 pages
Hofstede Model of Royal Dutch Shell
100% (2)
Hofstede Model of Royal Dutch Shell
27 pages
Maire Pav 5 Final Program
No ratings yet
Maire Pav 5 Final Program
12 pages
School Book List 2020-2021
No ratings yet
School Book List 2020-2021
15 pages
Tanzania's Soil for Agri Growth
100% (1)
Tanzania's Soil for Agri Growth
36 pages
Thalassa PLS Box IP66 Datasheet
No ratings yet
Thalassa PLS Box IP66 Datasheet
2 pages
(CREATIVE CHECKLIST) YouTube
100% (5)
(CREATIVE CHECKLIST) YouTube
2 pages
M.Tech Soil Dynamics Exam
No ratings yet
M.Tech Soil Dynamics Exam
3 pages
CTS Assessment Mark
No ratings yet
CTS Assessment Mark
2 pages
Quality Assurance Handbook Vol 3
No ratings yet
Quality Assurance Handbook Vol 3
62 pages
Homonyms, Synonyms, Antonyms
100% (1)
Homonyms, Synonyms, Antonyms
8 pages
Ultramicrotome Sectioning Guide
No ratings yet
Ultramicrotome Sectioning Guide
30 pages
Index of Exhibits-Rfe
No ratings yet
Index of Exhibits-Rfe
6 pages
Cab1 Business Mathiematics and Statistics en
No ratings yet
Cab1 Business Mathiematics and Statistics en
16 pages
3 An Empirical Research Framework
100% (1)
3 An Empirical Research Framework
3 pages
PC6B-20030512-LabGuide 23 Jul
No ratings yet
PC6B-20030512-LabGuide 23 Jul
223 pages
Nursery Lesson Plan Week 3
No ratings yet
Nursery Lesson Plan Week 3
2 pages
Decimal and Fraction Operations Worksheets
No ratings yet
Decimal and Fraction Operations Worksheets
30 pages
My CV 2019
No ratings yet
My CV 2019
10 pages
Bare Bones Kathy Reichs PDF Download
No ratings yet
Bare Bones Kathy Reichs PDF Download
31 pages
Decoder and Encoder by Rab Nawaz Jadoon PDF
No ratings yet
Decoder and Encoder by Rab Nawaz Jadoon PDF
26 pages

Training Models

Uploaded by

Training Models

Uploaded by

Artificial Intelligence and Automation

Training Linear Models

Ph.D. Gerardo Marx Chávez-Campos

Instituto Tecnológico de Morelia: Ing. Mecatrónica

◮ The Classification/Prediction task is made by a function that

◮ using a direct “closed-form” equation that directly computes

Next, we will look at Polynomial Regression, a more complex

In the first laboratory session, we develop a simple regression model

lifeSatis = θ0 + θ1 × GPDperCapita (1)

here θ0 and θ1 are the model parameters.

More generally, a linear model makes a prediction by simply comput-

◮ n is the number of features

A vectorized form of the Linear Regressor is:

◮ θ is the model’s parameter vector

◮ Training a model means setting its parameters that best fits

The Mean Square Error (MSE) of a Linear Regression hypothesis

J(θ) = M SE(X, hθ ) (5)

Theorem. The following properties hold:

You might also like