ML and Ls

The document discusses the relationship between maximum likelihood hypotheses and least-squared error in machine learning, particularly in the context of learning continuous-valued target functions. It explains how minimizing squared errors leads to a maximum likelihood hypothesis under the assumption of normally distributed noise in training data. The document also justifies the use of the Normal distribution for characterizing noise based on its properties and the Central Limit Theorem.

Uploaded by

pdacollege9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

106 views2 pages

ML and Ls

Uploaded by

pdacollege9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

lOMoARcPSD|44977908

MODULE 5: Machine Learning 18AI61

MAXIMUM LIKELIHOOD AND LEAST-SQUARED ERROR HYPOTHESES

Consider the problem of learning a continuous-valued target function such as neural network
learning, linear regression, and polynomial curve fitting

A straightforward Bayesian analysis will show that under certain assumptions any learning
algorithm that minimizes the squared error between the output hypothesis predictions and the
training data will output a maximum likelihood (ML) hypothesis

• Learner L considers an instance space X and a hypothesis space H consisting of some

class of real-valued functions defined over X, i.e., (∀ h ∈ H)[ h : X → R] and training
examples of the form <xi,di>
• The problem faced by L is to learn an unknown target function f : X → R
• A set of m training examples is provided, where the target value of each example is
corrupted by random noise drawn according to a Normal probability distribution with
zero mean (di = f(xi) + ei)
• Each training example is a pair of the form (xi ,di ) where di = f (xi ) + ei .
– Here f(xi) is the noise-free value of the target function and ei is a random variable
representing the noise.
– It is assumed that the values of the ei are drawn independently and that they are
distributed according to a Normal distribution with zero mean.
• The task of the learner is to output a maximum likelihood hypothesis or a MAP
hypothesis assuming all hypotheses are equally probable a priori.

Using the definition of hML we have

Assuming training examples are mutually independent given h, we can write P(D|h) as the
product of the various (di|h)

Given the noise ei obeys a Normal distribution with zero mean and unknown variance σ2 , each
di must also obey a Normal distribution around the true targetvalue f(xi). Because we are
writing the expression for P(D|h), we assume h is the correct description of f.
Hence, µ = f(xi) = h(xi)

8
Downloaded by PDA COLLEGE (pdacollege9@gmail.com)
lOMoARcPSD|44977908

MODULE 5: Machine Learning 18AI61

Maximize the less complicated logarithm, which is justified because of the monotonicity of
function p

The first term in this expression is a constant independent of h, and can therefore be
discarded, yielding

Maximizing this negative quantity is equivalent to minimizing the corresponding positive

quantity

Finally, discard constants that are independent of h.

Thus, above equation shows that the maximum likelihood hypothesis hML is the one that
minimizes the sum of the squared errors between the observed training values di and the
hypothesis predictions h(xi)

Note:
Why is it reasonable to choose the Normal distribution to characterize noise?
• Good approximation of many types of noise in physical systems
• Central Limit Theorem shows that the sum of a sufficiently large number of
independent, identically distributed random variables itself obeys a Normal distribution
Only noise in the target value is considered, not in the attributes describing the instances
themselves

9
Downloaded by PDA COLLEGE (pdacollege9@gmail.com)

Data Science: Maximum Likelihood Basics
No ratings yet
Data Science: Maximum Likelihood Basics
15 pages
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
No ratings yet
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
31 pages
Bayesian Learning for ML Experts
No ratings yet
Bayesian Learning for ML Experts
18 pages
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
No ratings yet
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
123 pages
Module 4 - Bayesian Learning
No ratings yet
Module 4 - Bayesian Learning
36 pages
Module - 4 QB Solved-1
No ratings yet
Module - 4 QB Solved-1
31 pages
##7 Rev ML Module-2 Bayesian Learning
No ratings yet
##7 Rev ML Module-2 Bayesian Learning
7 pages
Bayesian Learning
No ratings yet
Bayesian Learning
81 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
31 pages
MAP Hypothesis-1
No ratings yet
MAP Hypothesis-1
10 pages
AI&ML-Q With Answer
No ratings yet
AI&ML-Q With Answer
18 pages
Unit 4
No ratings yet
Unit 4
18 pages
Output 25
No ratings yet
Output 25
8 pages
SL09. Bayesian Learning
No ratings yet
SL09. Bayesian Learning
4 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Lec11 Introduction2BayesianStatistics
No ratings yet
Lec11 Introduction2BayesianStatistics
48 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
Log-Linear Models and Conditional Random Fieldsels
No ratings yet
Log-Linear Models and Conditional Random Fieldsels
27 pages
Maximum-Likelihood & Bayesian Parameter Estimation: Srihari: CSE 555
No ratings yet
Maximum-Likelihood & Bayesian Parameter Estimation: Srihari: CSE 555
9 pages
Output 23
No ratings yet
Output 23
6 pages
Lecture 9: Bayesian Learning: Cognitive Systems II - Machine Learning SS 2005
No ratings yet
Lecture 9: Bayesian Learning: Cognitive Systems II - Machine Learning SS 2005
39 pages
Bayesian Learning in Machine Learning
No ratings yet
Bayesian Learning in Machine Learning
60 pages
Unit - 5 ML
No ratings yet
Unit - 5 ML
57 pages
Wa0002.
No ratings yet
Wa0002.
24 pages
Logistic Regression (Probability Concepts) and Perceptron
No ratings yet
Logistic Regression (Probability Concepts) and Perceptron
20 pages
Machine Learning Homework Guide
No ratings yet
Machine Learning Homework Guide
6 pages
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
No ratings yet
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
6 pages
Mod 4
No ratings yet
Mod 4
23 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Machine Learning Exam Solutions
100% (1)
Machine Learning Exam Solutions
17 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Bayesian Learning: Salma Itagi, Svit
No ratings yet
Bayesian Learning: Salma Itagi, Svit
14 pages
Week 10 Merged
No ratings yet
Week 10 Merged
116 pages
ML - Unit4pdf
No ratings yet
ML - Unit4pdf
65 pages
Bayes Theorem in Machine Learning
No ratings yet
Bayes Theorem in Machine Learning
40 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
91 pages
Probability Theory For Machine Learning: Chris Cremer September 2015
No ratings yet
Probability Theory For Machine Learning: Chris Cremer September 2015
40 pages
21Csc305P-Machine Learning: Offline
No ratings yet
21Csc305P-Machine Learning: Offline
8 pages
Machine Learning - Unit 2
No ratings yet
Machine Learning - Unit 2
104 pages
Logistic Regression and SGD
No ratings yet
Logistic Regression and SGD
10 pages
Bcs602 ML Module-04
No ratings yet
Bcs602 ML Module-04
24 pages
15CS73 Module 4
No ratings yet
15CS73 Module 4
60 pages
Bayesian Learning
No ratings yet
Bayesian Learning
22 pages
L13 Bayesian Methods
No ratings yet
L13 Bayesian Methods
30 pages
Bishop2008 Chapter ANewFrameworkForMachineLearnin
No ratings yet
Bishop2008 Chapter ANewFrameworkForMachineLearnin
24 pages
CMU ML Homework: MLE & MAP Analysis
No ratings yet
CMU ML Homework: MLE & MAP Analysis
10 pages
Maximum Likelihood Estimation Guide
No ratings yet
Maximum Likelihood Estimation Guide
34 pages
AIML-Unit 3 Notes-Assignment 3
No ratings yet
AIML-Unit 3 Notes-Assignment 3
37 pages
Section 5
No ratings yet
Section 5
18 pages
Learning Models From Data: 1 Parametric Estimation
No ratings yet
Learning Models From Data: 1 Parametric Estimation
14 pages
W8 - Logistic Regression
No ratings yet
W8 - Logistic Regression
18 pages
DA Unit 2
No ratings yet
DA Unit 2
124 pages
S23 Midterm2 Practice Problems Sol
No ratings yet
S23 Midterm2 Practice Problems Sol
42 pages
Understanding SE, SD, and MLE
No ratings yet
Understanding SE, SD, and MLE
10 pages
Notes Chapter Logistic Regression
No ratings yet
Notes Chapter Logistic Regression
6 pages
Slide 1
No ratings yet
Slide 1
37 pages
18CS71 Module 4
No ratings yet
18CS71 Module 4
30 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
Introduction To Automation and Siemens PLCS: by Pda College
No ratings yet
Introduction To Automation and Siemens PLCS: by Pda College
8 pages
Tentative Syllabus 21cc740e Composite Materials
No ratings yet
Tentative Syllabus 21cc740e Composite Materials
3 pages
Enroll Courses
No ratings yet
Enroll Courses
1 page
Beige Elegant Watercolor Background Notes A4 Document
No ratings yet
Beige Elegant Watercolor Background Notes A4 Document
14 pages
Control Styles
No ratings yet
Control Styles
3 pages
CN LAB Manual Keval
No ratings yet
CN LAB Manual Keval
86 pages
Python Lab - Programs
No ratings yet
Python Lab - Programs
27 pages
Week 2 Lecture - 6-10 CC - Watermark
No ratings yet
Week 2 Lecture - 6-10 CC - Watermark
165 pages
Oxford Textbook of Clinical Nephrology 4th Edition Neil Turner Et Al. (Eds.) Download
100% (8)
Oxford Textbook of Clinical Nephrology 4th Edition Neil Turner Et Al. (Eds.) Download
108 pages
Activation Assignment With Cover Page
No ratings yet
Activation Assignment With Cover Page
5 pages
Worksheet Q3 Week 5
No ratings yet
Worksheet Q3 Week 5
4 pages
Web of Modularity Arithmetic of The Coefficients of Modular Forms and Qseries Ken Ono Instant Download
100% (2)
Web of Modularity Arithmetic of The Coefficients of Modular Forms and Qseries Ken Ono Instant Download
84 pages
Mathematics-IB Exam Paper 2016
No ratings yet
Mathematics-IB Exam Paper 2016
4 pages
Thesis in Mathematics
100% (3)
Thesis in Mathematics
7 pages
Worksheet Unit 3 Place Value and Rounding and Unit 4 Decimals
No ratings yet
Worksheet Unit 3 Place Value and Rounding and Unit 4 Decimals
2 pages
C# Practical Solution
No ratings yet
C# Practical Solution
61 pages
Bs Business Administration Degree Completion Program
No ratings yet
Bs Business Administration Degree Completion Program
2 pages
A Short List of Some Useful R Commands: Input and Display
No ratings yet
A Short List of Some Useful R Commands: Input and Display
2 pages
Essential Resources for Aspiring Software Engineers
No ratings yet
Essential Resources for Aspiring Software Engineers
3 pages
Infinite Series Explained
No ratings yet
Infinite Series Explained
13 pages
Data Structures for Data Scientists
No ratings yet
Data Structures for Data Scientists
26 pages
Descriptive and Inferential Statistics
100% (1)
Descriptive and Inferential Statistics
31 pages
Venn Diagram Puzzle
No ratings yet
Venn Diagram Puzzle
6 pages
Solution Manual For Calculus of A Single Variable Early Transcendental Functions 6th Edition by Larson and HEdwards ISBN 1285774795 9781285774794 PDF Download
100% (5)
Solution Manual For Calculus of A Single Variable Early Transcendental Functions 6th Edition by Larson and HEdwards ISBN 1285774795 9781285774794 PDF Download
103 pages
DLD Chapter1
No ratings yet
DLD Chapter1
117 pages
U1 4-RVDistributions
No ratings yet
U1 4-RVDistributions
36 pages
Pergamon: Int. J. Impact Enyno Vol. 16, No. 5/6, Pp. 801-831, 1995
No ratings yet
Pergamon: Int. J. Impact Enyno Vol. 16, No. 5/6, Pp. 801-831, 1995
31 pages
Variable Coefficients Second Order Linear ODE (Sect. 3.2) .: Summary
No ratings yet
Variable Coefficients Second Order Linear ODE (Sect. 3.2) .: Summary
12 pages
Roadma To Math Mastery
No ratings yet
Roadma To Math Mastery
6 pages
Nasugbu East Secondary-Abstracts
No ratings yet
Nasugbu East Secondary-Abstracts
12 pages
Excel Matrix Solution for Thermal Resistance
No ratings yet
Excel Matrix Solution for Thermal Resistance
7 pages
Taxicab Geometry Triangle Circumcircles
No ratings yet
Taxicab Geometry Triangle Circumcircles
3 pages
Lecture 1 - Roots of Nonlinear
No ratings yet
Lecture 1 - Roots of Nonlinear
17 pages
Engineering Exam Calculus Problems
No ratings yet
Engineering Exam Calculus Problems
12 pages
Electrical Machines With Matlab R Second Edition 90063
0% (2)
Electrical Machines With Matlab R Second Edition 90063
3 pages
Resources For k-12 Home Schooling 1 1
No ratings yet
Resources For k-12 Home Schooling 1 1
12 pages
Math Practice Paper with Solutions
No ratings yet
Math Practice Paper with Solutions
17 pages
Pytearcat PYthon TEnsor AlgebRa calCulATor
No ratings yet
Pytearcat PYthon TEnsor AlgebRa calCulATor
11 pages

ML and Ls

Uploaded by

ML and Ls

Uploaded by

lOMoARcPSD|44977908

MODULE 5: Machine Learning 18AI61

MAXIMUM LIKELIHOOD AND LEAST-SQUARED ERROR HYPOTHESES

• Learner L considers an instance space X and a hypothesis space H consisting of some

Using the definition of hML we have

MODULE 5: Machine Learning 18AI61

Maximizing this negative quantity is equivalent to minimizing the corresponding positive

Finally, discard constants that are independent of h.

You might also like