Optimization for Data Scientists

The document discusses optimization techniques, focusing on the Gradient Descent and Steepest Descent algorithms. Gradient Descent minimizes functions by iteratively updating parameters in the direction of the steepest descent, while Steepest Descent computes an optimal step size for each iteration, making it more reliable but computationally expensive. Key differences include step size choice, convergence robustness, speed, implementation complexity, and suitability for various problems.

Uploaded by

gi7282

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Optimization for Data Scientists

Uploaded by

gi7282

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

OPTIMIZATION TECHNIQUES

[AcSIR-01-ES-AD-009]
Presented by:
Dr. R. S. Bisht, Pr. Scientist,
rsbisht@cbri.res.in
CSIR-CBRI, Roorkee, India
Gradient Descent Algorithm
• Gradient Descent is a fundamental optimization algorithm used to
minimize functions by iteratively moving toward the function’s local
minimum. It is widely used in machine learning, especially for training
models by minimizing loss functions.
• The main idea is to update the variables or parameters of the function
in the direction of the steepest descent (i.e., the negative gradient),
which is the direction that reduces the function's value the fastest.
Gradient Descent Algorithm
• Gradient Calculation (Derivative)
• Update Rule:
• Learning Rate The learning rate α controls how fast we move towards
the minimum.
• If α is too small, convergence will be slow.
• If α is too large, we may overshoot the minimum and fail to converge.
• Convergence The algorithm stops when the updates to the
parameters become very small, meaning the cost function is
minimized, or after a fixed number of iterations.
gradient descent method
Mathematical Formulation:
Disadvantages:
• Sensitive to the choice of the learning rate α. If the learning rate is too
large, it may overshoot; if too small, convergence can be slow.

• For functions with a complex geometry (e.g., ill-conditioned

problems), the algorithm may suffer from slow convergence,
especially in narrow valleys.
Steepest Descent Method
• The steepest descent method is similar to gradient descent, but it
finds the optimal step size for each iteration.

• Instead of using a fixed learning rate, it computes the step size by

solving a line search problem that minimizes the function along the
direction of the negative gradient.
Mathematical Formulation:
Advantages:
• Steepest descent is generally more reliable because it always finds the
best step size at each iteration.

• It works well in situations where the landscape of the function is

complicated (e.g., narrow valleys, ill-conditioned problems).
The steepest descent method can be computationally
Disadvantages: expensive because it requires solving a line search problem at
every iteration, which can significantly slow down the
algorithm for large-scale problems.
In practice, for many machine learning tasks, it can be overkill
due to the overhead of computing the optimal step size.
Key Differences:
Aspect Gradient Descent Steepest Descent
Fixed step size α chosen before the Dynamically computed αk using line
Step Size (Learning Rate)
process. search.
Also moves in the direction of the
Moves in the direction of the
Direction negative gradient, but with an
negative gradient.
optimal step size for each step.
Sensitive to the choice of the More robust convergence due to
Convergence
learning rate. optimal step size choice.

Potentially faster for well- Can be slower due to the overhead

Speed
conditioned problems. of calculating αk in each iteration.

Simple to implement; commonly Requires line search, which adds

Implementation
used in large-scale optimization. complexity and computational cost.

Suitable for problems where high

Suitable for machine learning precision and robust convergence
Suitability
problems with large datasets. are important (e.g., scientific
computations).

Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
98 pages
Different Types of Gradient Descent
No ratings yet
Different Types of Gradient Descent
4 pages
Gradient Descent Algorithm Is A First
No ratings yet
Gradient Descent Algorithm Is A First
5 pages
Gradient Descent for ML Practitioners
No ratings yet
Gradient Descent for ML Practitioners
2 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
Steepest Descent for Optimization
No ratings yet
Steepest Descent for Optimization
29 pages
Maths
No ratings yet
Maths
13 pages
Gradient Descent Types Explained
No ratings yet
Gradient Descent Types Explained
5 pages
DL Unit - 2
No ratings yet
DL Unit - 2
20 pages
Gradient Descent Final
No ratings yet
Gradient Descent Final
27 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
3 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
4 - Gradient Descent and Stochastic GD
No ratings yet
4 - Gradient Descent and Stochastic GD
37 pages
Gradient Descent Types & Implementation
No ratings yet
Gradient Descent Types & Implementation
58 pages
Neural Network Optimization Tactics
No ratings yet
Neural Network Optimization Tactics
20 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
Gradient Descent
No ratings yet
Gradient Descent
2 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Gradient Descent New
No ratings yet
Gradient Descent New
42 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
3.1.4gradient Descent Methods
No ratings yet
3.1.4gradient Descent Methods
12 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent and Optimization in Machine Learning
No ratings yet
Gradient Descent and Optimization in Machine Learning
9 pages
Gradient Descent Algorithm.Y...
No ratings yet
Gradient Descent Algorithm.Y...
10 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
Gradient Descent DS Rohit Sharma Fench Knjs
No ratings yet
Gradient Descent DS Rohit Sharma Fench Knjs
15 pages
Clnote Oct8
No ratings yet
Clnote Oct8
39 pages
Gradient Descent
No ratings yet
Gradient Descent
4 pages
Mathematical Analysis of Descent Algorithms in Artificial Intelligence Convergence, Loss Landscapes, and Structural Optimization
No ratings yet
Mathematical Analysis of Descent Algorithms in Artificial Intelligence Convergence, Loss Landscapes, and Structural Optimization
8 pages
Lecture 5
No ratings yet
Lecture 5
31 pages
Optimization Algorithms Deep PDF
No ratings yet
Optimization Algorithms Deep PDF
9 pages
Gradient Descent Insights
No ratings yet
Gradient Descent Insights
4 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Adam Optimizer
No ratings yet
Adam Optimizer
22 pages
Steepest Descent Method Overview
No ratings yet
Steepest Descent Method Overview
7 pages
Steepest Descent
No ratings yet
Steepest Descent
7 pages
Gradient Descent for ML Experts
No ratings yet
Gradient Descent for ML Experts
5 pages
Gradient Descent for Data Scientists
No ratings yet
Gradient Descent for Data Scientists
9 pages
Assignment No 3
No ratings yet
Assignment No 3
7 pages
Backpropagation Optimization Tutorial
No ratings yet
Backpropagation Optimization Tutorial
14 pages
CH 4
No ratings yet
CH 4
28 pages
Discussion 4 CS771
No ratings yet
Discussion 4 CS771
25 pages
Assignment 4
No ratings yet
Assignment 4
8 pages
Chap 4 2
No ratings yet
Chap 4 2
214 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
Gradient Descent (GD) - GD With Momentum - Nesterov Accelerated GD - Stochastic GD - OrIGINAL
No ratings yet
Gradient Descent (GD) - GD With Momentum - Nesterov Accelerated GD - Stochastic GD - OrIGINAL
25 pages
AIMLB PGP 2025 Session 5
No ratings yet
AIMLB PGP 2025 Session 5
67 pages
chp2 Gradient Descent Algorithm
No ratings yet
chp2 Gradient Descent Algorithm
5 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
LInear
No ratings yet
LInear
14 pages
AI33
No ratings yet
AI33
6 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
Understanding Cost Function & Gradient Descent
No ratings yet
Understanding Cost Function & Gradient Descent
142 pages
Super GD
No ratings yet
Super GD
15 pages
Gradient Descent and SGD
No ratings yet
Gradient Descent and SGD
8 pages
Fletcher Reeves Gradient Based Techniques
No ratings yet
Fletcher Reeves Gradient Based Techniques
24 pages
Sieve Analysis Experiment
No ratings yet
Sieve Analysis Experiment
3 pages
OPTIMIZATION - Lecture3 - RSB
No ratings yet
OPTIMIZATION - Lecture3 - RSB
31 pages
DST
No ratings yet
DST
7 pages
Geotech Lab Instrumentation
No ratings yet
Geotech Lab Instrumentation
3 pages
Proctor Compaction
No ratings yet
Proctor Compaction
3 pages
1725887986module 2 Neural Networks and Deep Learning Basics
No ratings yet
1725887986module 2 Neural Networks and Deep Learning Basics
17 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
CS61A Midterm 1 Review Guide
No ratings yet
CS61A Midterm 1 Review Guide
8 pages
MATLAB Code of LMS Algorithm
100% (1)
MATLAB Code of LMS Algorithm
69 pages
Optimize Welding Stress Reduction
No ratings yet
Optimize Welding Stress Reduction
22 pages
Mathematics For Intelligent Systems
No ratings yet
Mathematics For Intelligent Systems
7 pages
Lec 11
No ratings yet
Lec 11
13 pages
Traffic Modeling, Prediction, and Congestion Control For High-Speed Networks: A Fuzzy AR Approach
No ratings yet
Traffic Modeling, Prediction, and Congestion Control For High-Speed Networks: A Fuzzy AR Approach
18 pages
IOE Thapathali Campus Minor and Major Project Report Template 1
No ratings yet
IOE Thapathali Campus Minor and Major Project Report Template 1
48 pages
Optimization for Engineers
No ratings yet
Optimization for Engineers
166 pages
Optimizing Control Systems with PID
No ratings yet
Optimizing Control Systems with PID
1 page
Chương 05
No ratings yet
Chương 05
62 pages
LTE QoS Parameters Prediction Using Multivariate Linear Regression Algorithm
No ratings yet
LTE QoS Parameters Prediction Using Multivariate Linear Regression Algorithm
6 pages
CE6146 Lecture 1
No ratings yet
CE6146 Lecture 1
63 pages
Deep Learning
100% (1)
Deep Learning
189 pages
Least Mean Square Adaptive Filters
No ratings yet
Least Mean Square Adaptive Filters
502 pages
Module 3
No ratings yet
Module 3
83 pages
Adaptive Control Tutorial
100% (6)
Adaptive Control Tutorial
403 pages
Optimization Techniques Lab
No ratings yet
Optimization Techniques Lab
9 pages
Using Fmincon As An Optimization Tool For Calibrations
No ratings yet
Using Fmincon As An Optimization Tool For Calibrations
21 pages
Ai Resos
No ratings yet
Ai Resos
16 pages
Calculus Data Science
100% (5)
Calculus Data Science
271 pages
Chapter 4 Least-Mean-Square Algorithm (LMS Algorithm)
No ratings yet
Chapter 4 Least-Mean-Square Algorithm (LMS Algorithm)
10 pages
ML Final MCQsa
No ratings yet
ML Final MCQsa
7 pages
Stochastic Gradient Descent - Math and Python Code
No ratings yet
Stochastic Gradient Descent - Math and Python Code
28 pages
Deep Learning Course Material
No ratings yet
Deep Learning Course Material
47 pages
Optimization Method Minor Project Levenberg-Marquardt Method Semester 6 3 SSCM 2019/2020
No ratings yet
Optimization Method Minor Project Levenberg-Marquardt Method Semester 6 3 SSCM 2019/2020
18 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 7. Steady State Process Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 7. Steady State Process Optimization (2010)
32 pages

Optimization for Data Scientists

Uploaded by

Optimization for Data Scientists

Uploaded by

OPTIMIZATION TECHNIQUES

• For functions with a complex geometry (e.g., ill-conditioned

• Instead of using a fixed learning rate, it computes the step size by

• It works well in situations where the landscape of the function is

Potentially faster for well- Can be slower due to the overhead

Simple to implement; commonly Requires line search, which adds

Suitable for problems where high

You might also like