0% found this document useful (0 votes)

11 views9 pages

Department of Electronics and Communication Engineering: Seminar On Subject: Artificial Neural Network (21EC641)

Uploaded by

kumarravi955r

Artificial neural network

Copyright:

Available Formats

Download as KEY, PDF, TXT or read online from Scribd

Download as key, pdf, or txt

Department of Electronics and Communication Engineering: Seminar On Subject: Artificial Neural Network (21EC641)

Uploaded by

kumarravi955r

0% found this document useful (0 votes)

11 views9 pages

Artificial neural network

Original Title

ANN

Copyright

Available Formats

KEY, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Artificial neural network

Copyright:

Available Formats

Download as KEY, PDF, TXT or read online from Scribd

Download as key, pdf, or txt

0% found this document useful (0 votes)

11 views9 pages

Department of Electronics and Communication Engineering: Seminar On Subject: Artificial Neural Network (21EC641)

Uploaded by

kumarravi955r

Artificial neural network

Copyright:

Available Formats

Download as KEY, PDF, TXT or read online from Scribd

Download as key, pdf, or txt

You are on page 1/ 9

Department of Electronics and Communication Engineering

Seminar on
“Gradient Descent, SGD, Adam, RMSProp”

Subject: Artificial Neural Network (21EC641)

Introduction

Definition: Neural network optimization refers to techniques used to minimize the error
or loss function during training.

Importance: Efficient optimization improves convergence speed and model

performance.
Gradient Descent

Concept: Gradient Descent minimizes the loss function by iteratively adjusting

parameters in the direction of the negative gradient.
Formula:
θt+1 =θt −η∇J(θt )

Visual Aid: Illustration showing how Gradient Descent progresses towards minimizing
loss.
Stochastic Gradient Descent (SGD)

Overview: SGD updates parameters using gradients computed from a subset (batch) of
training data.
Advantages: Faster convergence and scalability to large datasets.
Mini-batch SGD: Balances efficiency and accuracy by processing data in batches.
Momentum and Nesterov Accelerated Gradient (NAG)

Concept: Momentum enhances SGD by adding a fraction of the previous update vector
to the current update.
NAG: Improves upon Momentum by considering future gradient estimates to update
parameters.
Benefits: Faster convergence, especially in the presence of noise or high curvature.
Adam Optimization

Overview: Adam adapts learning rates for each parameter by estimating first and second
moments of gradients.
Advantages: Effective for sparse gradients and varying magnitudes, often outperforming
traditional methods.
Considerations: Sensitivity to hyperparameters and computational cost.
RMSProp

Concept: RMSProp adjusts the learning rate based on the average of recent magnitudes
of gradients for each parameter.

Advantages: Stable performance across different types of neural networks.

Comparison: Contrast with Adam in terms of adaptive learning rate mechanisms.

Choosing the Right Optimizer

Considerations: Factors include dataset size, model complexity, and characteristics of

the loss function.
Selection Guidelines: Practical tips for selecting between Adam, RMSProp, SGD, and
variants.
Conclusion: Optimization choice significantly impacts training efficiency and final
model performance.
Thank you