Backpropagation Example With Numbers Step by Step

This document provides a detailed step-by-step example of the backpropagation algorithm for a neural network with three input neurons, one hidden layer with two neurons, and an output layer with two neurons. It outlines the initialization of weights, forward propagation, error calculation, backpropagation of errors, and updating parameters, with actual numerical calculations included. The process is repeated iteratively to minimize error, and Python code is provided for automation of the calculations.

Uploaded by

Ghozlane Hdr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views8 pages

Backpropagation Example With Numbers Step by Step

Uploaded by

Ghozlane Hdr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

BACKPROPAGATION EXAMPLE WITH NUMBERS STEP BY STEP

When I come across a new mathematical concept or before I use a canned software package, I like to replicate the calculations in order to get a
deeper understanding of what is going on. This type of computation based approach from rst principles helped me greatly when I rst came
across material on arti cial neural networks.

In this post, I go through a detailed example of one iteration of the backpropagation algorithm using full formulas from basic principles and actual
values. The neural network I use has three input neurons, one hidden layer with two neurons, and an output layer with two neurons.

The following are the (very) high level steps that I will take in this post. Details on each step will follow after.

(1) Initialize weights for the parameters we want to train

(2) Forward propagate through the network to get the output values

(3) De ne the error or cost function and its rst derivatives

(4) Backpropagate through the network to determine the error derivatives

(5) Update the parameter estimates using the error derivative and the current value

Step 1

The input and target values for this problem are and . I will initialize weights as shown in the diagram
below. Generally, you will assign them randomly but for illustration purposes, I’ve chosen these numbers.
Step 2

Mathematically, we have the following relationships between nodes in the networks. For the input and output layer, I will use the somewhat strange
convention of denoting , , , and to denote the value before the activation function is applied and the notation of , , , and to
denote the values after application of the activation function.

Input to hidden layer

Hidden layer to output layer

We can use the formulas above to forward propagate through the network. I’ve shown up to four decimal places below but maintained all decimals
in actual calculations.
Step 3

We now de ne the sum of squares error using the target values and the results from the last layer from forward propagation.

Step 4

We are now ready to backpropagate through the network to compute all the error derivatives with respect to the parameters. Note that although
there will be many long formulas, we are not doing anything fancy here. We are just using the basic principles of calculus such as the chain rule.

First we go over some derivatives we will need in this step. The derivative of the sigmoid function is given here. Also, given that
and , we have , , , , , and .

We are now ready to calculate , , , and using the derivatives we have already discussed.

I will omit the details on the next three computations since they are very similar to the one above. Feel free to leave a comment if you are unable to
replicate the numbers below.
The error derivative of is a little bit more involved since changes to a ect the error through both and .

To summarize, we have computed numerical values for the error derivatives with respect to , , , , and . We will now backpropagate
one layer to compute the error derivatives of the parameters connecting the input layer to the hidden layer. These error derivatives are , ,
, , , , and .

I will calculate , , and rst since they all ow through the node.

The calculation of the rst term on the right hand side of the equation above is a bit more involved than previous calculations since a ects the
error through both and .

Now I will proceed with the numerical values for the error derivatives above. These derivatives have already been calculated above or are similar in
style to those calculated above. If anything is unclear, please leave a comment.

Plugging the above into the formula for , we get

The calculations for and are below

I will now calculate , , and since they all ow through the node.

The calculation of the rst term on the right hand side of the equation above is a bit more involved since a ects the error through both and .

Plugging the above into the formula for , we get

The calculations for and are below

The nal error derivative we have to calculate is , which is done next

We now have all the error derivatives and we’re ready to make the parameter updates after the rst iteration of backpropagation. We will use the
learning rate of

So what do we do now? We repeat that over and over many times until the error goes down and the parameter estimates stabilize or converge to
some values. We obviously won’t be going through all these calculations manually. I’ve provided Python code below that codi es the calculations
above. Nowadays, we wouldn’t do any of these manually but rather use a machine learning package that is already readily available.
I ran 10,000 iterations and we see below that sum of squares error has dropped signi cantly after the rst thousand or so iterations.
FIRST DERIVATIVE OF THE SIGMOID FUNCTION

The sigmoid activation function is given by

We can write as . The derivative can be written as

CS1102 Programming Assignment Unit 5
No ratings yet
CS1102 Programming Assignment Unit 5
3 pages
Chapter 8
No ratings yet
Chapter 8
67 pages
DB2 Java Stored Procedures
100% (2)
DB2 Java Stored Procedures
418 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
Lecture 7 - Backpropagation Example With Numbers Step by Step – a Not So Random Walk
No ratings yet
Lecture 7 - Backpropagation Example With Numbers Step by Step – a Not So Random Walk
10 pages
Neural Networks - Learning
No ratings yet
Neural Networks - Learning
26 pages
Building Neural Networks_ A Hands-On Journey from Scratch with Python _ by Long Nguyen _ Medium
No ratings yet
Building Neural Networks_ A Hands-On Journey from Scratch with Python _ by Long Nguyen _ Medium
21 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Chap5 3-BackProp
No ratings yet
Chap5 3-BackProp
41 pages
A Step by Step Backpropagation Example
No ratings yet
A Step by Step Backpropagation Example
9 pages
A Step by Step Backpropagation
No ratings yet
A Step by Step Backpropagation
8 pages
Backpropagation
No ratings yet
Backpropagation
4 pages
Derivations For Back Propagation of Multilayer Neural Network
No ratings yet
Derivations For Back Propagation of Multilayer Neural Network
14 pages
Back Propagation
No ratings yet
Back Propagation
5 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Backpropagation Example
No ratings yet
Backpropagation Example
9 pages
09: Neural Networks - Learning: Neural Network Cost Function
No ratings yet
09: Neural Networks - Learning: Neural Network Cost Function
9 pages
Assignment 4
No ratings yet
Assignment 4
2 pages
Neural Network (Perceptrons)
No ratings yet
Neural Network (Perceptrons)
31 pages
2. Neural Network Training
No ratings yet
2. Neural Network Training
73 pages
Backpropagation in Neural Nets
No ratings yet
Backpropagation in Neural Nets
13 pages
07autodiff Nnets
No ratings yet
07autodiff Nnets
12 pages
2403B05107_DL_ACTIVITY_04(1)
No ratings yet
2403B05107_DL_ACTIVITY_04(1)
9 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
AyushChokhani AI Asiignment 2
No ratings yet
AyushChokhani AI Asiignment 2
12 pages
Back propogation
No ratings yet
Back propogation
9 pages
Day1 06 Simple NN Python
No ratings yet
Day1 06 Simple NN Python
18 pages
0111CS191028
No ratings yet
0111CS191028
4 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
ANN_example
No ratings yet
ANN_example
10 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Pr2_ANN_WriteUp.docx
No ratings yet
Pr2_ANN_WriteUp.docx
11 pages
Feedforward Propagation: 1.1 Visualizing The Data
No ratings yet
Feedforward Propagation: 1.1 Visualizing The Data
11 pages
ML Exp 4
No ratings yet
ML Exp 4
7 pages
Eio Supplementary
No ratings yet
Eio Supplementary
6 pages
14-backprop
No ratings yet
14-backprop
34 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
9 pages
Understanding Backpropagation Algorithm - Towards Data Science
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
11 pages
nn2
No ratings yet
nn2
12 pages
Unit 4
No ratings yet
Unit 4
16 pages
Back Propagation in NN
No ratings yet
Back Propagation in NN
30 pages
Module-1 Backpropagation Process in Deep Neural Network
No ratings yet
Module-1 Backpropagation Process in Deep Neural Network
5 pages
nn_pdf
No ratings yet
nn_pdf
11 pages
Back Propagation Lsn 4
No ratings yet
Back Propagation Lsn 4
17 pages
ML807_Distributed_and_Federated_Learning_Slides_2
No ratings yet
ML807_Distributed_and_Federated_Learning_Slides_2
211 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
555610A19_DL_EXP4
No ratings yet
555610A19_DL_EXP4
11 pages
EELU ANN ITF309 Lecture 08 Spring 2023-2024-Sensitivity-Back-Propagation
No ratings yet
EELU ANN ITF309 Lecture 08 Spring 2023-2024-Sensitivity-Back-Propagation
39 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Presentation 1
No ratings yet
Presentation 1
14 pages
Ann
No ratings yet
Ann
30 pages
Exp - 4 - 5 (Prakash)
No ratings yet
Exp - 4 - 5 (Prakash)
10 pages
Neural Networks: Derivation: 1 Model
No ratings yet
Neural Networks: Derivation: 1 Model
9 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
ML Assignment-9
No ratings yet
ML Assignment-9
4 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
7 pages
Step by Step Back Propagation
No ratings yet
Step by Step Back Propagation
8 pages
Deep Learning_Lecture 2_Neural Networks
No ratings yet
Deep Learning_Lecture 2_Neural Networks
39 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Strings: - String Is Sequence of Character. - in Java String Is A Class Objects and Implemented Using Two Classes Namely
No ratings yet
Strings: - String Is Sequence of Character. - in Java String Is A Class Objects and Implemented Using Two Classes Namely
16 pages
MPP Questions and Answers Dnyanesh
100% (2)
MPP Questions and Answers Dnyanesh
4 pages
Triggers
No ratings yet
Triggers
44 pages
Flowcharts: by M.K. Lowe
No ratings yet
Flowcharts: by M.K. Lowe
23 pages
Simcenter Nastran 2019.1 Quick Reference Guide
100% (1)
Simcenter Nastran 2019.1 Quick Reference Guide
2,396 pages
Course Code BFC20802 Course Name Computer Programming Faculty of Civil Engineering and Built Environment 1. 1
No ratings yet
Course Code BFC20802 Course Name Computer Programming Faculty of Civil Engineering and Built Environment 1. 1
5 pages
How Do I Scanf, Readln, Etc. in Java
No ratings yet
How Do I Scanf, Readln, Etc. in Java
1 page
Bokhari C2
No ratings yet
Bokhari C2
1,038 pages
Unit Spring MVC
No ratings yet
Unit Spring MVC
26 pages
Java Lab Final - Mca
No ratings yet
Java Lab Final - Mca
24 pages
Assignment For UCCD 1004 Programming Concept and Practices
No ratings yet
Assignment For UCCD 1004 Programming Concept and Practices
4 pages
Localhost Access Log.2020-09-10
No ratings yet
Localhost Access Log.2020-09-10
3 pages
couchbase-certified-java-developer-sample
No ratings yet
couchbase-certified-java-developer-sample
8 pages
IDN SDK - Programmer's Guide
No ratings yet
IDN SDK - Programmer's Guide
37 pages
Rev 2 - q With Ans__ Flow Control String List Tuple 22.08.24
No ratings yet
Rev 2 - q With Ans__ Flow Control String List Tuple 22.08.24
10 pages
Signed Numbers / Integers
No ratings yet
Signed Numbers / Integers
14 pages
Lab 11
No ratings yet
Lab 11
2 pages
Solutions For Tutorial Exercises Association Rule Mining.: Exercise 1. Apriori
No ratings yet
Solutions For Tutorial Exercises Association Rule Mining.: Exercise 1. Apriori
4 pages
1 Vulkan Tutorial - English
No ratings yet
1 Vulkan Tutorial - English
210 pages
1 - Introduction Installation Node-RED
No ratings yet
1 - Introduction Installation Node-RED
5 pages
Quickstart: Create A Python App Using Azure App Service On Linux
No ratings yet
Quickstart: Create A Python App Using Azure App Service On Linux
3 pages
Lab 5 Minimization of Boolean Functions
100% (1)
Lab 5 Minimization of Boolean Functions
9 pages
Ahmed Ali M
No ratings yet
Ahmed Ali M
72 pages
SP23 Bai 051
No ratings yet
SP23 Bai 051
8 pages
Ansible and AWS: Linux Academy Linux Academy
No ratings yet
Ansible and AWS: Linux Academy Linux Academy
5 pages
SOQL Interview Questions
No ratings yet
SOQL Interview Questions
23 pages
FSD Module-05
No ratings yet
FSD Module-05
38 pages