0% found this document useful (0 votes)

61 views38 pages

Foundations of Machine Learning - 3

This document provides an overview of machine learning foundations, including TensorFlow, directed acyclic graphs, linear regression, and gradient descent algorithms. It discusses machine learning concepts such as defining a data space and loss functions. Linear regression aims to find the best fitting line to represent a data set by minimizing a loss function. Gradient descent is then used to iteratively update the parameters to reduce the loss by moving downhill on the error surface. Various gradient descent algorithms can be applied such as stochastic gradient descent.

Uploaded by

takunda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views38 pages

Foundations of Machine Learning - 3

Uploaded by

takunda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Foundations of Machine Learning

Dr. Panashe Chiurunge

Machine Learning

 TensorFlow
 Directed Acyclic Graphs
 TensorFlow Eager Execution Mode
 TensorFlow KERAS API
 Linear Regression with TensorFlow
What is Machine Learning
The types of Machine Learning
What is Machine Learning

 We are trying to learn from data or learn the

representation of the data
 To formulate the basic learning from data problem,
we must specify several basic elements:
 data spaces, probability measures, loss
functions, and statistical risk
Machine Learning – Data Space

 We have to learn from some data

 Learning from data begins with a specification of
two spaces

 The Input space is also sometimes called the

feature space
 The Output space is also called the "label space",
"outcome space", "signal range", or in statistical
regression the "response space"
Machine Learning

 We then want to create a function that can map the

representation of the feature space given some random
noise within the data
Machine Learning

 The basic problem in machine learning is to determine

a mapping

 That takes an input

 Predicts the output

Machine Learning – Loss Functions

 Since we are trying to predict/classify labels we need to

measure the performance of our learner in some way.

 Suppose we have a true label and a label

prediction
 A loss function measures how "different" are these two
quantities. Formally, a loss function is a map
Machine Learning – Loss Functions

 Suppose we have a true label and a label

prediction
 A loss function measures how "different" are these two
quantities. Formally, a loss function is a map

Cost function
Machine Learning – Loss Functions

Cost function

 In regression or estimation problems , . The squared

error loss function is often employed.
Machine Learning – Loss Functions

Cost function

 The loss function can be used to measure the " risk"

of a learning rule.
 We have to minimize this " risk" of a learning rule as
we learn our data representation
Machine Learning – Loss Functions

Cost function

 The loss function can be used to measure the " risk"

of a learning rule.
 We have to minimize this " risk" of a learning rule as
we learn our data representation
Machine Learning – Linear Regression

 Linear Regression is simply finding the best possible

line of fit that represent a set of data point.
 In Machine Learning terms we are creating a
learning rule that fits a line of representation of our
data
Machine Learning – Linear Regression

 Linear Regression is simply finding the best possible

line of fit that represent a set of data point.
 In Machine Learning terms we are creating a
learning rule that fits a line of representation of our
data
Machine Learning – Linear Regression

 Let’s suppose we want to model the above set of

points with a line.
 To do this we’ll use the standard line
equation where is the line’s gradient and is
the line’s intercept.
Machine Learning – Linear Regression

 To find the best line for our data, we need to find the
best set of gradient and intercept values.
Machine Learning – Linear Regression

 A standard approach to solving this type of problem is

to define an error function (also called a cost
function/loss function) that measures how “good” a
given line is.
 This function will take in a (m,b) pair and return an error
value based on how well the line fits our data.
Machine Learning – Linear Regression

 To compute this error for a given line, we’ll iterate

through each (x,y) point in our data set and sum the
square distances between each point’s y value and the
candidate line’s y value (computed at mx + b).
 It’s conventional to square this distance to ensure that
it is positive and to make our cost function
Machine Learning – Linear Regression

 Our loss function is

 Which is
Machine Learning – Linear Regression

 Our loss function is

 Lines that fit our data better (where better is defined by

our cost function) will result in lower error values.
 If we minimize this function, we will get the best line of
fit to represent our data.
Machine Learning – Linear Regression

 Our cost function consists of two parameters (m and b)

we can visualize it as a two-dimensional surface
Machine Learning – Gradient Descent

 Each point in this two-dimensional space represents a line. The

height of the function at each point is the error value for that line.
You can see that some lines yield smaller error values than
others (i.e., fit our data better). When we run gradient descent
search, we will start from some location on this surface and
move downhill to find the line with the lowest error.
Machine Learning – Gradient Descent

 To run gradient descent on this error function, we first need to

compute its gradient.
 The gradient will act like a compass and always point us downhill.
 To compute it, we will need to differentiate our error function.
 Since our function is defined by two parameters (m and b), we
will need to compute a partial derivative for each.
 These derivatives work out to be:
Machine Learning – Gradient Descent

 We can initialize our search to start at any pair of m and b

values (i.e., any line) and let the gradient descent algorithm
march downhill on our error function towards the best line.
 Each iteration will update m and b to a line that yields slightly
lower error than the previous iteration.
 The direction to move in for each iteration is calculated using the
two partial derivatives
Machine Learning – Gradient Descent

 The learning Rate variable controls how large of a step we take

downhill during each iteration. If we take too large of a step, we
may step over the minimum.
 However, if we take small steps, it will require many iterations to
arrive at the minimum
Machine Learning – Gradient Descent

 We can also observe how the error changes as we move toward

the minimum. A good way to ensure that gradient descent is
working correctly is to make sure that the error decreases for
each iteration.
Machine Learning – Gradient Descent

 We can also observe how the error changes as we move toward

the minimum. A good way to ensure that gradient descent is
working correctly is to make sure that the error decreases for
each iteration.
Machine Learning – Gradient Descent
Machine Learning – Gradient Descent
Machine Learning – Gradient Descent
Machine Learning – Gradient Descent
Machine Learning – Gradient Descent
Machine Learning – Gradient Descent
 Do the following until convergence
Machine Learning –
Stochastic Gradient Descent
Machine Learning –
Stochastic Gradient Descent
Machine Learning – GD Algorithms
 Stochastic Gradient Descent
 Adaptive Momentum Estimation
 Nesterov accelerated Gradient
 Adaptive Gradient Descent
 Adaptive Learning Rate Method
 Root Mean Square Propagation
Machine Learning

Q&A

Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
CSE445 Linear-Regression
No ratings yet
CSE445 Linear-Regression
40 pages
Machine Learning: Introduction and Linear Regression
No ratings yet
Machine Learning: Introduction and Linear Regression
29 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
11 pages
Week 1 Lecture Notes
No ratings yet
Week 1 Lecture Notes
7 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
ML 2
No ratings yet
ML 2
155 pages
ML Notes
No ratings yet
ML Notes
14 pages
Linear Regression
No ratings yet
Linear Regression
91 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Module 3
No ratings yet
Module 3
27 pages
Week 04
No ratings yet
Week 04
101 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
10 pages
Unit 3.1 Gradient Descent in Linear Regression
No ratings yet
Unit 3.1 Gradient Descent in Linear Regression
6 pages
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37
No ratings yet
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37
115 pages
2022 Linear Regression
No ratings yet
2022 Linear Regression
34 pages
Brief Summary ML
No ratings yet
Brief Summary ML
25 pages
Regression
No ratings yet
Regression
6 pages
Week 4
No ratings yet
Week 4
101 pages
3.linear Regression
No ratings yet
3.linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
9 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
Linear Regression Techniques
No ratings yet
Linear Regression Techniques
25 pages
ML PPT 2
No ratings yet
ML PPT 2
206 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
(Machine Learning Coursera) Lecture Note Week 1
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
8 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
MACHINE LEARNING ALGORITHM Unit-II
No ratings yet
MACHINE LEARNING ALGORITHM Unit-II
115 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Complete Chapter Revision Takeaways Supervised ML Regression
No ratings yet
Complete Chapter Revision Takeaways Supervised ML Regression
22 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
Supervised Learning Essentials
No ratings yet
Supervised Learning Essentials
30 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Topic 07 - Data Modelling - Part I
No ratings yet
Topic 07 - Data Modelling - Part I
40 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Predictive Maintenance
No ratings yet
Predictive Maintenance
66 pages
Linear Regression
No ratings yet
Linear Regression
61 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Module3 Ch1
No ratings yet
Module3 Ch1
83 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Linear Regression Lecture Notes
No ratings yet
Linear Regression Lecture Notes
34 pages
Machine Learning Guide 2017
No ratings yet
Machine Learning Guide 2017
15 pages
Regression - Docx 1 2
No ratings yet
Regression - Docx 1 2
2 pages
CSE 412 Lab Manual 3 Linear Regression
No ratings yet
CSE 412 Lab Manual 3 Linear Regression
10 pages
Machine Learning Questions and Answers For Interview
No ratings yet
Machine Learning Questions and Answers For Interview
20 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
Pricing Mercari
No ratings yet
Pricing Mercari
41 pages
Financial Markets Assignment
No ratings yet
Financial Markets Assignment
7 pages
Sponsorship Resume Template
No ratings yet
Sponsorship Resume Template
12 pages
College Information
No ratings yet
College Information
29 pages
Business Research Methods Guide
No ratings yet
Business Research Methods Guide
11 pages
Foundations of Deep Learning
No ratings yet
Foundations of Deep Learning
48 pages
Telecom Churn Analysis with Logistic Regression
No ratings yet
Telecom Churn Analysis with Logistic Regression
6 pages
Improved YOLOX-X Based UAV Aerial Photography Object Detection Algorithm
No ratings yet
Improved YOLOX-X Based UAV Aerial Photography Object Detection Algorithm
14 pages
C, C++ Questions
No ratings yet
C, C++ Questions
78 pages
Congestion Control Using Network Based Protocol Abstract
No ratings yet
Congestion Control Using Network Based Protocol Abstract
5 pages
1 s2.0 S1566253522002081 Main
No ratings yet
1 s2.0 S1566253522002081 Main
19 pages
Rfid Windshield Tag
No ratings yet
Rfid Windshield Tag
3 pages
Concepts of Database Management (10th Edition) Chapter 1 Solutions
No ratings yet
Concepts of Database Management (10th Edition) Chapter 1 Solutions
4 pages
Infineon Product Brief - TLE985x ProductBrief v01 - 00 EN
No ratings yet
Infineon Product Brief - TLE985x ProductBrief v01 - 00 EN
2 pages
Network Security Essentials Guide
No ratings yet
Network Security Essentials Guide
30 pages
Advanced Battery Management System
No ratings yet
Advanced Battery Management System
2 pages
Arwa S.Bazmalah:, Noorfazila Kamal
No ratings yet
Arwa S.Bazmalah:, Noorfazila Kamal
10 pages
Clients 2024 1 26 8 57 49 945
No ratings yet
Clients 2024 1 26 8 57 49 945
6 pages
Circuit Cellar 2015-09
No ratings yet
Circuit Cellar 2015-09
84 pages
Secure Remote Work and Browsing
No ratings yet
Secure Remote Work and Browsing
1 page
IQ+ & IQ4H LomaData Comms Protocol 1.6
No ratings yet
IQ+ & IQ4H LomaData Comms Protocol 1.6
14 pages
8085 Assembly: Square Root Program
50% (2)
8085 Assembly: Square Root Program
2 pages
NP000414 NP000418 CT037 3 2 NWS
No ratings yet
NP000414 NP000418 CT037 3 2 NWS
44 pages
Cloud Computing Training Material-2
No ratings yet
Cloud Computing Training Material-2
76 pages
New Microsoft Office Word Document Final
No ratings yet
New Microsoft Office Word Document Final
105 pages
Security Product Evaluation Guide
No ratings yet
Security Product Evaluation Guide
66 pages
Full NESCOM AM Software Engineer Preparation
No ratings yet
Full NESCOM AM Software Engineer Preparation
5 pages
Validation Controls
No ratings yet
Validation Controls
6 pages
Cambridge International General Certificate of Secondary Education
No ratings yet
Cambridge International General Certificate of Secondary Education
12 pages
Graphing Polynomial Functions
100% (1)
Graphing Polynomial Functions
19 pages
Maintenance Service Procedure Document For AMC: Scada &telecom System For Agcl Gas Pipeline Network
No ratings yet
Maintenance Service Procedure Document For AMC: Scada &telecom System For Agcl Gas Pipeline Network
17 pages
Semantic SPARQL Similarity Search Over RDF Knowledge Graphs
No ratings yet
Semantic SPARQL Similarity Search Over RDF Knowledge Graphs
25 pages
Datasheet Canon MF656cdw
No ratings yet
Datasheet Canon MF656cdw
4 pages
Charts 043916
No ratings yet
Charts 043916
27 pages
Analytical Geometry 6. Straight Lines 6.1. Equations of Lines
No ratings yet
Analytical Geometry 6. Straight Lines 6.1. Equations of Lines
9 pages
Computer Knowledge: Upsc Epfo
No ratings yet
Computer Knowledge: Upsc Epfo
10 pages
Get Front End Web Development The Big Nerd Ranch Guide Free All Chapters
100% (6)
Get Front End Web Development The Big Nerd Ranch Guide Free All Chapters
13 pages

Foundations of Machine Learning - 3

Uploaded by

Foundations of Machine Learning - 3

Uploaded by

Foundations of Machine Learning

Dr. Panashe Chiurunge

 We are trying to learn from data or learn the

 We have to learn from some data

 The Input space is also sometimes called the

 We then want to create a function that can map the

 The basic problem in machine learning is to determine

 That takes an input

 Predicts the output

 Since we are trying to predict/classify labels we need to

 Suppose we have a true label and a label

 Suppose we have a true label and a label

 In regression or estimation problems , . The squared

 The loss function can be used to measure the " risk"

 The loss function can be used to measure the " risk"

 Linear Regression is simply finding the best possible

 Linear Regression is simply finding the best possible

 Let’s suppose we want to model the above set of

 A standard approach to solving this type of problem is

 To compute this error for a given line, we’ll iterate

 Our loss function is

 Our loss function is

 Lines that fit our data better (where better is defined by

 Our cost function consists of two parameters (m and b)

 Each point in this two-dimensional space represents a line. The

 To run gradient descent on this error function, we first need to

 We can initialize our search to start at any pair of m and b

 The learning Rate variable controls how large of a step we take

 We can also observe how the error changes as we move toward

 We can also observe how the error changes as we move toward

You might also like