0% found this document useful (0 votes)

9 views34 pages

Data Science L15 - OptimizationMultiVariable

The document outlines the concepts of optimization in data science, focusing on unconstrained optimization for single and multiple variables. It discusses key topics such as optimality criteria, gradients, and the Hessian matrix, along with their applications in finding minima and maxima. Additionally, it includes practical examples and methods for gradient descent and unidirectional search techniques.

Uploaded by

Abhishek Goutam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views34 pages

Data Science L15 - OptimizationMultiVariable

Uploaded by

Abhishek Goutam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

ED5340 - Data Science: Theory apa th y

and Practise an
h ug
u t
M
a n
L15 - Optimization for multiple
n a t variable h
a
Ram

Ramanathan Muthuganapathy (https://ed.iitm.ac.in/~raman)

Course web page: https://ed.iitm.ac.in/~raman/datascience.html
Moodle page: Available at https://courses.iitm.ac.in/
Unconstrained optimization

2 3
• Single variable (e.g. min J(w), e.g J(w) = w th, yJ(w) = w ,
2
J(w) = w + 54/w) n ap a
a
h ug
t 2 2
• multivariable (e.g. min J(w1, w2) = (w
n
M1 − 2) + (w2 − 2) )
u
h a
a t
• n-dimensional multivariable (e.g. n
m
a
a 2 2 2
J(w1, w2, . . . . . . . , wn) = (w1 − 2) + (w2 − 2) + . . . . + (wn − 2) ))
R

• min J(w1, w2, . . . . . . . , wn)

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Surface plot
2 2
J(w1, w2) = (w1 − 2) + (w2 − 2)

h y
a t
ap
an
h ug
u t
M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Contour plot / level set / height function
2 2
J(w1, w2) = (w1 − 2) + (w2 − 2)

• Two points in a contour have h y

the same J value a t
ap
an
ug
• Imaging cutting with J-plane u th
at diﬀerent J-values M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Demo using SrfPlots.py

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Optimality criteria - multiple variables

• min J(w) th y
p a
n a
• The value of w for which the function J(w)
h u ga has the least (minimum) value
u t
M
• Local minimum t h a n
n a
a
a m
R

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient - multiple variables
2 2
min J(w1, w2) = (w1 − 2) + (w2 − 2) - Partial derivatives
2 2
• J(w1, w2) = (w1 − 2) + (w2 − 2) y
th
a
∂J n ap
- Partial derivation of J(w1, w2) wrt w a
g1
• ∂w1 u th u
M
a n
∂J a t h
- Partial derivation of J(w1a,mw
a ) wrt w
n
2 2
• ∂w2 R

( ∂w1 ∂w2 )
∂J ∂J
• ∇J(w ,
1 2w ) = , , where ∇J(w1 , w2 ) or grad. J

• NOTE: grad. J is a vector.

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras
Gradient - multiple variables
What is ∇J(w1, w2) or grad. J? f(x, y, z)

• Surface f(x, y, z) = c
h y
a t
• Any curve f(x(t), y(t), z(t)) an ap
ug
∂f dx ∂f dy ∂f dz u th
+
• ∂x dt ∂y dt ∂z dt + = 0 h a n
M
a t
a n

• ( ∂x ∂y ∂z ) ( dt dt dt )
am
∂f ∂f ∂f dx dy dzR
, , . , , = 0

′ ′ ′
• ∇f . (x (t), y (t) . z (t)) = 0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient - multiple variables
What is ∇J(w1, w2) or grad. J?
f(x, y, z)
′ ′ ′
• ∇f is the grad. f and (x (t), y (t) . z (t)) is the y
tangent vector. th
pa
n a
a
h ug
u t
M
a n
t h
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient - multiple variables
What is ∇J(w1, w2) or grad. J? f(x, y, z)

• Take another curve (blue)

h y
a t
′ ′ ′
• ∇f . (x (t), y (t) . z (t)) = 0 an ap

h ug
t
• Dot product n
M
u
h a
a t
• ∇f is perpendicular to set of tangents
am
a at n
that
point. R

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient - multiple variables
What is ∇J(w1, w2) or grad. J? f(x, y, z)

′ ′ ′
• ∇f . (x (t), y (t) . z (t)) = 0 y
th
pa
• Dot product an a
h ug
t
• ∇f is perpendicular to set of tangents
u
Mat that
an h
point. a t
an
am
• ∇f is the Normal vector at a R
point.

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient at a point
What is ∇J(w1, w2) or grad. J? - Back to our notation

• ∇J(w1, w2) is a normal vector th y

pa
n a
a
h ug
u t
M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Hessian Matrix
2 2
min J(w1, w2) = (w1 − 2) + (w2 − 2) - Second partial derivatives

• ∂w 2 ∂w1 ( ∂w1 )
2
∂J ∂ ∂J
= th y
1 pa
n a
a

• ∂w 2 ∂w2 ( ∂w2 )
2
∂J ∂ ∂J h ug
t
= n
M
u
2 h a
a t
a n

• ∂w1∂w2 ∂w1 ( ∂w2 )

2
∂J ∂ ∂J Ram
=

• ∂w2∂w1 ∂w2 ( ∂w1 )

2
∂J ∂ ∂J
=

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Hessian Matrix
Matrix of second partial derivatives

h y
2 2 ap t
∂J ∂ganJ
a

u
M∂w1∂w2
h
∂w1
2
a n
u t

th
n a
H= 2 a 2
∂ JR am ∂J
∂w2∂w1 ∂w2
2

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Optimality Criteria for Multiple Variables
2 2
min J(w1, w2) = (w1 − 2) + (w2 − 2)

• ∇J(w1, w2) = 0, Get w* = (w*

1
, w*
2
)
a th y
ap
an
• Hessian H should be positive definite th ug
at w* for min M
u
n a
th
a
• Hessian H should be negative ma n
a
definite at w* for max R

• At a saddle point, H is indefinite

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Optimality Criteria for Multiple Variables
How to find the type for H? (Use LA)

• H is positive definite if all the Eigenvalues are >thy0 (All λ′i s > 0 )
p a
n a
• H is negative definite if all the Eigenvalues u g are < 0 (All λ′i s < 0 )
a
th
u
M
n
• H is indefinite if some Eigenvaluesathare > 0 and some are < 0 (All λ′i s
a >0)
an
a m
R

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Example
2 2
min J(w1, w2) = (w1 − 2) + (w2 − 2)

• ∇J(w1, w2) = 0, Get w* = (w*

1
, w*
2
)
a th y
ap
∂J ug
an

• ∂w1 = 2(w1 − 2) u th
M
a n
th
a
∂J a n

• ∂w2 = (2w2 − 2) Ram

• Critical point
w* = (w* 1
, w*
2
) = (2, 2)

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Example
Compute Hessian at (2, 2)

2
∂J y
= 2 a th
• ∂w 2 n ap
1 a
2 h ug
∂J M
u t
= 2 a n
• ∂w 2 a th
2 a n
2 am
∂J R
• ∂w1∂w2 = 0

2
∂J
• ∂w2∂w1 = 0

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Hessian Matrix
Matrix of second partial derivatives

[0 2]
y
2 0
th
pa
n a
a
H= h ug
u t
M
a n
th
n a
a
Ram
Eigen values ?

H is then _____________ definite and hence the point

w* = (w*, w*) = (2, 2) is local _________________
1 2

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

CW: Do a similar exercise for
2 2
J(w1, w2) = w1 − w2

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Unidirectional search
2 2
J(w1, w2) = (w1 − 2) + (w2 − 2)

• Starting point
s s s
w = (w1, w2) = (−4, − 4) p
s a th y
n a
a
h ug
• Search direction s (vector) n
M
u t

h a
a t
s
• w* = w + αS am
a n
R
• Bracketing method to find α
• Fine tuning with interval
halving (or golden search etc.)

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Unidirectional search - Issues
2 2
J(w1, w2) = (w1 − 2) + (w2 − 2)

• Starting point
p
s a th y
a
• Search direction s (vector) ug
a n
th
u
M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient at a point
What is ∇J(w1, w2) or grad. J? - Back to our notation

• ∇J(w1, w2) is a normal vector th y

pa
n a
a
h ug
u t
M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient at a point
Traveling along grad. J

• If you travel along the direction of h y

t
the grad. J, what happens to J? apa
an
h ug
u t
M
a n
th
n a
a
R am

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Gradient descent
Traveling along -grad. J or − ∇J(w1, w2)

• We should travel along − ∇J th y

pa
n a
a
h ug
u t
M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Potential directions and steepest descent
Traveling along -grad. J or − ∇J(w1, w2)

• Let d be such that ∇J . d is -ve. th y

pa
n a
• Let d be such that d = − ∇J h ug
a
u t
M
• ∇J. − ∇J = − 1 a th a n
a n
am
• Hence − ∇J is the steepest! R
− ∇J
• Steepest (Cauchy’s) Gradient
Descent d
∇J

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Algorithm - Gradient descent
Traveling along -grad. J or − ∇J(w1, w2)

• Starting point w* = (w*

1
, w*
2
)
a th y
ap
an
• Compute J, − ∇J at w*
k
= w*.
u th ug
M
a n
• Update w’s a n a th

Ram
• w*
k+1
= w*
k
− αk ∇J
− ∇J
• Check for stopping criteria d
• Else continue the iteration ∇J

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Algorithm - Update step
w*
k+1
= w*
k
− αk ∇J

• Update w’s y
th
pa
k+1 k
• w1 = w1 − αk ∇J an a
h ug
k+1 k u t
• w2 = w2 − αk ∇J n
M
h a
a t
• Compute J, − ∇J at w*
k+1
.
am
a n
R
• Finding αk − ∇J
• Unidirectional search (or)
d
• Make it a constant (Learning rate in ML) ∇J

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Algorithm - Stopping criteria

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Himmelblau function
2 2 2 2
J(w1, w2) = (w1 + w2 − 11) + (w1 + w2 − 7)

h y
a t
ap
an
h ug
u t
M
a n
th
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Recap - Single vs Multiple
w*
k+1
= w*
k
− αk ∇J

h y
a t
p
wga2n a
h u
w (1) u t
w (1) M
a n
J(w) a th − ∇J
a n
R am
d
∇J
w
w1

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Optimization strategies
Variations in (steepest) gradient descent

• Constant step length α h y

a t
ap
• Adaptive step length (αk) using line search
ug
a n
th
u
M
• Stochastic gradient descent th a n
n a
a
Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Line Search

s s
[w* w*
1 2
] = [w1 w2] + αS
s s
[w* w*
1 2
] = [w1 w2] + α( − ∇J)

α=0 α = αi α = 10
s s
[w1 w2] − ∇J

s s
[w1 w2] + α( − ∇J)
S. Grad. Des.

s s
[w1 w2]
α = 0.01

− ∇J
− ∇J
− ∇J

L14 OptimizationSingleVariable
No ratings yet
L14 OptimizationSingleVariable
33 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
L26 NeuralNetwork BP
No ratings yet
L26 NeuralNetwork BP
37 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
ECOM 6302: Engineering Optimization: Chapter Three
100% (1)
ECOM 6302: Engineering Optimization: Chapter Three
56 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Gradient Descent - Xiaowei Huang
No ratings yet
Gradient Descent - Xiaowei Huang
53 pages
Math YHPLinear Regression
No ratings yet
Math YHPLinear Regression
13 pages
Calculus - Class Notes
No ratings yet
Calculus - Class Notes
4 pages
Regression
No ratings yet
Regression
39 pages
Least Square Vs Gradient Descent
100% (1)
Least Square Vs Gradient Descent
52 pages
Lecture 32 34
No ratings yet
Lecture 32 34
71 pages
Nonlinear Programming PDF
No ratings yet
Nonlinear Programming PDF
224 pages
04 LinearRegression
No ratings yet
04 LinearRegression
61 pages
Nonlinear Programming Concepts PDF
No ratings yet
Nonlinear Programming Concepts PDF
224 pages
Data Science L20 - Regularization
No ratings yet
Data Science L20 - Regularization
41 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Gradients Derivatives
No ratings yet
Gradients Derivatives
23 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Chapter Gradient Descent
No ratings yet
Chapter Gradient Descent
6 pages
Lec 17 Multivariable OT
No ratings yet
Lec 17 Multivariable OT
30 pages
Lecture 6 Actual
No ratings yet
Lecture 6 Actual
52 pages
Gradient Descent
No ratings yet
Gradient Descent
52 pages
Maximum Slope Method
No ratings yet
Maximum Slope Method
14 pages
Connexions Module: m11240
100% (2)
Connexions Module: m11240
4 pages
Exam With Solutions PDF
0% (1)
Exam With Solutions PDF
17 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
48 pages
Background/Random Processes
No ratings yet
Background/Random Processes
33 pages
Machine Learning - Home - Week 2 - Notes - Coursera
No ratings yet
Machine Learning - Home - Week 2 - Notes - Coursera
10 pages
Lec3 Gradient Based Method Part I
No ratings yet
Lec3 Gradient Based Method Part I
30 pages
Clnote Sept28
No ratings yet
Clnote Sept28
30 pages
Optimization
No ratings yet
Optimization
89 pages
ME554 Sheet 3 Final PDF
No ratings yet
ME554 Sheet 3 Final PDF
31 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Gradinet
No ratings yet
Gradinet
51 pages
Lecture1 introductionPCA
No ratings yet
Lecture1 introductionPCA
75 pages
Lec06 Matt
No ratings yet
Lec06 Matt
60 pages
Lecture 0.2 - Linear Methods For Regression, Optimization
No ratings yet
Lecture 0.2 - Linear Methods For Regression, Optimization
53 pages
Week 6
No ratings yet
Week 6
72 pages
Lecture 2.1 Linear Regression
No ratings yet
Lecture 2.1 Linear Regression
36 pages
Lecture8 UnconstrainedII 2023
No ratings yet
Lecture8 UnconstrainedII 2023
57 pages
Unconstrained Multivariable Optimization
No ratings yet
Unconstrained Multivariable Optimization
42 pages
Lect. #8
No ratings yet
Lect. #8
14 pages
CS-6777 Liu Abs
100% (1)
CS-6777 Liu Abs
103 pages
Week 11
No ratings yet
Week 11
42 pages
Process Optimization
100% (1)
Process Optimization
70 pages
Optimization-Module Iv
No ratings yet
Optimization-Module Iv
7 pages
MS Key-4
No ratings yet
MS Key-4
4 pages
Computing For Data Sciences: Introduction To Regression Analysis
No ratings yet
Computing For Data Sciences: Introduction To Regression Analysis
9 pages
Gradient Of A Function هّلادلا رادحنإ
No ratings yet
Gradient Of A Function هّلادلا رادحنإ
11 pages
C Programming Loyola University Chicago CH06
No ratings yet
C Programming Loyola University Chicago CH06
37 pages
OS Lecture32 Filesystem in Xv6
No ratings yet
OS Lecture32 Filesystem in Xv6
12 pages
OS Lecture31 Device Driver and Block I-O in Xv6
No ratings yet
OS Lecture31 Device Driver and Block I-O in Xv6
9 pages
OS Lecture27 Virtual Memory and Paging in Xv6
No ratings yet
OS Lecture27 Virtual Memory and Paging in Xv6
10 pages
C Programming Loyola University Chicago CH05
No ratings yet
C Programming Loyola University Chicago CH05
40 pages
OS Lecture25 Scheduling and Context Switching in Xv6
No ratings yet
OS Lecture25 Scheduling and Context Switching in Xv6
10 pages
OS Lecture33 Network I-O Using Sockets
No ratings yet
OS Lecture33 Network I-O Using Sockets
7 pages
Python Lecture 18-File Input Output
No ratings yet
Python Lecture 18-File Input Output
20 pages
OS Lecture24 Trap Handling in Xv6
100% (1)
OS Lecture24 Trap Handling in Xv6
11 pages
Linux Lecture 013 IITK
No ratings yet
Linux Lecture 013 IITK
8 pages
Python Lecture 5 Python Functions
No ratings yet
Python Lecture 5 Python Functions
23 pages
Python Lecture 10-Efficiency
No ratings yet
Python Lecture 10-Efficiency
26 pages
Python Lecture 16-High Order Function
No ratings yet
Python Lecture 16-High Order Function
20 pages
Python Lecture 6 Python Recursion
No ratings yet
Python Lecture 6 Python Recursion
22 pages
Python Lecture 2-Fundamental-Algorithms
No ratings yet
Python Lecture 2-Fundamental-Algorithms
14 pages
Linux Lecture 008 IITK
No ratings yet
Linux Lecture 008 IITK
9 pages
Linux Lecture 012 IITK
No ratings yet
Linux Lecture 012 IITK
11 pages
Civil Engineering DE Module
No ratings yet
Civil Engineering DE Module
19 pages
15MA206 Numerical Methods
No ratings yet
15MA206 Numerical Methods
3 pages
The Origin of Laplace Transform
0% (1)
The Origin of Laplace Transform
48 pages
Table of Integrals PDF
No ratings yet
Table of Integrals PDF
1 page
Calculus of Variations Problems
No ratings yet
Calculus of Variations Problems
8 pages
Answers For Calculus Early Transcendentals 3rd Edition William L Briggs Lyle Cochran Bernard Gillett Eric Schulz
No ratings yet
Answers For Calculus Early Transcendentals 3rd Edition William L Briggs Lyle Cochran Bernard Gillett Eric Schulz
326 pages
Maths 3 Syllabus
No ratings yet
Maths 3 Syllabus
3 pages
Integrals Involving Inverse Trigonometric Functions
No ratings yet
Integrals Involving Inverse Trigonometric Functions
10 pages
Microeconomics Salvatore Chapter 2
100% (1)
Microeconomics Salvatore Chapter 2
49 pages
Kartu Trigonometri Pelajar
No ratings yet
Kartu Trigonometri Pelajar
2 pages
Jacobi Method For Nonlinear First-Order Pdes
No ratings yet
Jacobi Method For Nonlinear First-Order Pdes
3 pages
Syllabus BCA 21
No ratings yet
Syllabus BCA 21
26 pages
Vector Calculus - Part 2-1
No ratings yet
Vector Calculus - Part 2-1
44 pages
Undetermined Coefficients and Cauchy-Euler
No ratings yet
Undetermined Coefficients and Cauchy-Euler
16 pages
Finite Difference Wikipedia
No ratings yet
Finite Difference Wikipedia
6 pages
Math 1201 Calculus
No ratings yet
Math 1201 Calculus
3 pages
Apcalcprobbook Mccaffrey Revised
No ratings yet
Apcalcprobbook Mccaffrey Revised
216 pages
Fundamentals of Partial Differential Equations
No ratings yet
Fundamentals of Partial Differential Equations
11 pages
PDE Second Order 1
No ratings yet
PDE Second Order 1
7 pages
24 Maclaurin Series
No ratings yet
24 Maclaurin Series
5 pages
Math Prep for Competitive Exams
100% (1)
Math Prep for Competitive Exams
218 pages
UNIT-III-T. Veerarajan Complex Differentiation
100% (1)
UNIT-III-T. Veerarajan Complex Differentiation
100 pages
06 Linear Differential Equations With Constants & Variable Coefficients
No ratings yet
06 Linear Differential Equations With Constants & Variable Coefficients
63 pages
Linear Multistep Methods: Lei Liu
No ratings yet
Linear Multistep Methods: Lei Liu
28 pages
12th MATHEMATICS NCERT SUMMARY
No ratings yet
12th MATHEMATICS NCERT SUMMARY
20 pages
4 - Infinite Series Solutions
No ratings yet
4 - Infinite Series Solutions
9 pages
Real Analysis
No ratings yet
Real Analysis
167 pages
Basic Calculus - LM v5 111616
100% (7)
Basic Calculus - LM v5 111616
376 pages
Curvilinear Coordinates PDF
No ratings yet
Curvilinear Coordinates PDF
4 pages
Calculus On Graphs
No ratings yet
Calculus On Graphs
63 pages

Data Science L15 - OptimizationMultiVariable

Uploaded by

Data Science L15 - OptimizationMultiVariable

Uploaded by

ED5340 - Data Science: Theory apa th y

Ramanathan Muthuganapathy (https://ed.iitm.ac.in/~raman)

• min J(w1, w2, . . . . . . . , wn)

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Two points in a contour have h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• NOTE: grad. J is a vector.

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Take another curve (blue)

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• ∇J(w1, w2) is a normal vector th y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• ∂w1∂w2 ∂w1 ( ∂w2 )

• ∂w2∂w1 ∂w2 ( ∂w1 )

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• ∇J(w1, w2) = 0, Get w* = (w*

• At a saddle point, H is indefinite

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• ∇J(w1, w2) = 0, Get w* = (w*

• ∂w2 = (2w2 − 2) Ram

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

H is then _____________ definite and hence the point

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• ∇J(w1, w2) is a normal vector th y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• If you travel along the direction of h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• We should travel along − ∇J th y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Let d be such that ∇J . d is -ve. th y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Starting point w* = (w*

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

• Constant step length α h y

Ramanathan Muthuganapathy, Department of Engineering Design, IIT Madras

You might also like