0% found this document useful (0 votes)

104 views12 pages

Backpropagation: Loading Data

The document discusses backpropagation and gradient checking for neural networks. It provides pseudocode for the forward and backward passes during backpropagation. The forward pass involves computing values at each node in a computational graph to get the final output. The backward pass computes gradients for each weight by propagating error backwards. Gradient checking is used to verify correct implementation of backpropagation by numerically estimating gradients and comparing to analytical gradients.

Uploaded by

satyamgovilla007_747

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views12 pages

Backpropagation: Loading Data

Uploaded by

satyamgovilla007_747

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

BackPropagation

There will be some functions that start with the word "grader" ex: grader_sigmoid(), grader_forwardprop(),
grader_backprop() etc, you should not change those function definition.

Every Grader function has to return True.

Loading data
In [1]:
import pickle
import numpy as np
from tqdm import tqdm
import matplotlib.pyplot as plt

with open('data.pkl', 'rb') as f:

data = pickle.load(f)
print(data.shape)
X = data[:, :5]
y = data[:, -1]
print(X.shape, y.shape)

(506, 6)
(506, 5) (506,)

Computational graph

If you observe the graph, we are having input features [f1, f2, f3, f4, f5] and 9 weights [w1, w2, w3, w4, w5, w6, w7, w8,
w9].

The final output of this graph is a value L which is computed as (Y-Y')^2

Task 1: Implementing backpropagation and Gradient checking

Check this video for better understanding of the computational graphs and back propagation

In [2]:

from IPython.display import YouTubeVideo

YouTubeVideo('i94OvYb6noo',width="1000",height="500")

Out[2]:

Gradient clipping
Check this blog link for more details on Gradient clipping

we know that the derivative of any function is

$$\lim_{\epsilon\to0}\frac{f(x+\epsilon)-f(x-\epsilon)}{2\epsilon}$$

The definition above can be used as a numerical approximation of the derivative. Taking an epsilon small enough, the
calculated approximation will have an error in the range of epsilon squared.
In other words, if epsilon is 0.001, the approximation will be off by 0.00001.

Therefore, we can use this to approximate the gradient, and in turn make sure that backpropagation is implemented
properly. This forms the basis of gradient checking!

Gradient checking example

lets understand the concept with a simple example: $f(w1,w2,x1,x2)=w_{1}^{2} . x_{1} + w_{2} . x_{2}$
from the above function , lets assume $w_{1}=1$, $w_{2}=2$, $x_{1}=3$, $x_{2}=4$ the gradient of $f$ w.r.t $w_{1}$ is
\begin{array} {lcl} \frac{df}{dw_{1}} = dw_{1} &=&2.w_{1}.x_{1} \\& = &2.1.3\\& = &6 \end{array}

let calculate the aproximate gradient of $w_{1}$ as mentinoned in the above formula and considering $\epsilon=0.0001$
\begin{array} {lcl} dw_1^{approx} & = & \frac{f(w1+\epsilon,w2,x1,x2)-f(w1-\epsilon,w2,x1,x2)}{2\epsilon} \\ & = &
\frac{((1+0.0001)^{2} . 3 + 2 . 4) - ((1-0.0001)^{2} . 3 + 2 . 4)}{2\epsilon} \\ & = & \frac{(1.00020001 . 3 + 2 . 4) - (0.99980001. 3 + 2
. 4)}{2*0.0001} \\ & = & \frac{(11.00060003) - (10.99940003)}{0.0002}\\ & = & 5.99999999999 \end{array}
. 4)}{2*0.0001} \\ & = & \frac{(11.00060003) - (10.99940003)}{0.0002}\\ & = & 5.99999999999 \end{array}

Then, we apply the following formula for gradient check: gradient_check = $\frac{\left\Vert\left (dW-dW^{approx}\rm\right)
\right\Vert_2}{\left\Vert\left (dW\rm\right) \right\Vert_2+\left\Vert\left (dW^{approx}\rm\right) \right\Vert_2}$

The equation above is basically the Euclidean distance normalized by the sum of the norm of the vectors. We use
normalization in case that one of the vectors is very small. As a value for epsilon, we usually opt for 1e-7. Therefore, if
gradient check return a value less than 1e-7, then it means that backpropagation was implemented correctly. Otherwise,
there is potentially a mistake in your implementation. If the value exceeds 1e-3, then you are sure that the code is not
correct.

in our example: gradient_check $ = \frac{(6 - 5.999999999994898)}{(6 + 5.999999999994898)} = 4.2514140356330737e^{-13}$

you can mathamatically derive the same thing like this

\begin{array} {lcl} dw_1^{approx} & = & \frac{f(w1+\epsilon,w2,x1,x2)-f(w1-\epsilon,w2,x1,x2)}{2\epsilon} \\ & = &
\frac{((w_{1}+\epsilon)^{2} . x_{1} + w_{2} . x_{2}) - ((w_{1}-\epsilon)^{2} . x_{1} + w_{2} . x_{2})}{2\epsilon} \\ & = & \frac{4.
\epsilon.w_{1}. x_{1}}{2\epsilon} \\ & = & 2.w_{1}.x_{1} \end{array}

Implement Gradient checking

(Write your code in def gradient_checking())

Algorithm

Task 2 : Optimizers
As a part of this task, you will be implementing 3 type of optimizers(methods to update weight)
Use the same computational graph that was mentioned above to do this task
Initilze the 9 weights from normal distribution with mean=0 and std=0.01

Check below video and this blog

In [3]:
from IPython.display import YouTubeVideo
YouTubeVideo('gYpoJMlgyXA',width="1000",height="500")

Out[3]:
Algorithm

for each epoch(1-100):

for each data point in your data:
using the functions forward_propagation() and backword_propagation() compute the
gradients of weights
update the weigts with help of gradients ex: w1 = w1-learning_rate*dw1

Implement below tasks

Task 2.1: you will be implementing the above algorithm with Vanilla update of weights

Task 2.2: you will be implementing the above algorithm with Momentum update of weights

Task 2.3: you will be implementing the above algorithm with Adam update of weights

Note : If you get any assertion error while running grader functions, please print the variables in grader functions and check
which variable is returning False .Recheck your logic for that variable .

Write two functions

Forward propagation(Write your code in def forward_propagation())

For easy debugging, we will break the computational graph into 3 parts.

Part 1

Part 2

Part 3
def forward_propagation(X, y, W):

# X: input data point, note that in this assignment you are having 5-d dat
a points
# y: output varible
# W: weight array, its of length 9, W[0] corresponds to w1 in graph, W[1]
corresponds to w2 in graph,
..., W[8] corresponds to w9 in graph.
# you have to return the following variables
# exp= part1 (compute the forward propagation until exp and then store
the values in exp)
# tanh =part2(compute the forward propagation until tanh and then store th
e values in tanh)
# sig = part3(compute the forward propagation until sigmoid and then store
the values in sig)
# now compute remaining values from computional graph and get y'
# write code to compute the value of L=(y-y')^2
# compute derivative of L w.r.to Y' and store it in dl
# Create a dictionary to store all the intermediate values
# store L, exp,tanh,sig,dl variables

return (dictionary, which you might need to use for back propagation)

Backward propagation(Write your code in def backward_propagation())

def backward_propagation(L, W,dictionary):

# L: the loss we calculated for the current point

# dictionary: the outputs of the forward_propagation() function
# write code to compute the gradients of each weight [w1,w2,w3,...,w9]
# Hint: you can use dict type to store the required variables
# return dW, dW is a dictionary with gradients of all the weights

return dW

Task 1

Forward propagation
In [4]:
def sigmoid(z):
'''In this function, we will compute the sigmoid(z)'''

# we can use this function in forward and backward propagation

return 1/(1 + np.exp(-z))

def forward_propagation(x, y, w):

t1=w[0]*x[0]
t2=w[1]*x[1]
t3=t1+t2
t4=t3*t3

t5=t4+w[5]
exp=np.exp(t5)
t7=exp+w[6]
tanh=np.tanh(t7)

t9=w[2]*x[2]
t10=np.sin(t9)

t11=w[3]*x[3]
t12=w[4]*x[4]
t13=t11+t12

t14=t13*t10

t15=t14+w[7]

sig=sigmoid(t15)

t16=sig*w[8]

y_hat= t16+tanh

L=np.square(y-y_hat)
dl=-2*(y-y_hat)

dic={
'exp':exp,
'sigmoid':sig,
'tanh':tanh,
'loss':L,
'dy_pr':dl,
'sin':t10,
'cos':np.cos(t9)

return dic

Grader function - 1

In [5]:
def grader_sigmoid(z):
val=sigmoid(z)
assert(val==0.8807970779778823)
return True
grader_sigmoid(2)

Out[5]:
True

In [6]:

def grader_forwardprop(data):
dl = (data['dy_pr']==-1.9285278284819143)
dl = (data['dy_pr']==-1.9285278284819143)
loss=(data['loss']==0.9298048963072919)
part1=(data['exp']==1.1272967040973583)
part2=(data['tanh']==0.8417934192562146)
part3=(data['sigmoid']==0.5279179387419721)
assert(dl and loss and part1 and part2 and part3)
return True
w=np.ones(9)*0.1
d1=forward_propagation(X[0],y[0],w)
grader_forwardprop(d1)

Out[6]:

True

In [7]:
print(d1)

{'exp': 1.1272967040973583, 'sigmoid': 0.5279179387419721, 'tanh': 0.8417934192562146, 'loss': 0.9

298048963072919, 'dy_pr': -1.9285278284819143, 'sin': -0.14538296400984968, 'cos':
0.9893754564247643}

Backward propagation
In [8]:

def backward_propagation(L,W,dic):
'''In this function, we will compute the backward propagation '''

dw1=dic['dy_pr']*(1-np.square(dic['tanh']))*dic['exp']*2*((W[0]*L[0]+W[1]*L[1])*L[0])

dw2=dic['dy_pr']*(1-np.square(dic['tanh']))*dic['exp']*2*((W[1]*L[1]+W[0]*L[0])*L[1])

dw3=dic['dy_pr']*W[8]*dic['sigmoid']*(1-dic['sigmoid'])*(L[3]*W[3]+L[4]*W[4])*L[2]*dic['cos']
dw4=dic['dy_pr']*W[8]*dic['sigmoid']*(1-dic['sigmoid'])*L[3]*dic['sin']
dw5=dic['dy_pr']*W[8]*dic['sigmoid']*(1-dic['sigmoid'])*L[4]*dic['sin']
dw6=dic['dy_pr']*(1-np.square(dic['tanh']))*dic['exp']
dw7=dic['dy_pr']*(1-np.square(dic['tanh']))
dw8=dic['dy_pr']*W[8]*dic['sigmoid']*(1-dic['sigmoid'])
dw9=dic['sigmoid']*dic['dy_pr']

dW={
'dw1':dw1,
'dw2':dw2,
'dw3':dw3,
'dw4':dw4,
'dw5':dw5,
'dw6':dw6,
'dw7':dw7,
'dw8':dw8,
'dw9':dw9
}

return dW

In [9]:

def grader_backprop(data):
dw1=(data['dw1']==-0.22973323498702003)
dw2=(data['dw2']==-0.021407614717752925)
dw3=(data['dw3']==-0.005625405580266319)
dw4=(data['dw4']==-0.004657941222712423)
dw5=(data['dw5']==-0.0010077228498574246)
dw6=(data['dw6']==-0.6334751873437471)
dw7=(data['dw7']==-0.561941842854033)
dw8=(data['dw8']==-0.04806288407316516)
dw8=(data['dw8']==-0.04806288407316516)
dw9=(data['dw9']==-1.0181044360187037)
assert(dw1 and dw2 and dw3 and dw4 and dw5 and dw6 and dw7 and dw8 and dw9)
return True
w=np.ones(9)*0.1
d1=forward_propagation(X[0],y[0],w)
d1=backward_propagation(X[0],w,d1)

grader_backprop(d1)

Out[9]:

True

Implement gradient checking

W = initilize_randomly
def gradient_checking(data_point, W):

# compute the L value using forward_propagation()

# compute the gradients of W using backword_propagation()
approx_gradients = []
for each wi weight value in W:
# add a small value to weight wi, and then find the values of L with the updated
weights
# subtract a small value to weight wi, and then find the values of L with the
updated weights
# compute the approximation gradients of weight wi
approx_gradients.append(approximation gradients of weight wi)
# compare the gradient of weights W from backword_propagation() with the aproximation
gradients of weights with gradient_check formula
return gradient_check

NOTE: you can do sanity check by checking all the return values of gradient_checking(),
they have to be zero. if not you have bug in your code

In [20]:
W = np.random.rand(9)
eps=0.0001
def gradient_checking(data_point, W):
d1=forward_propagation(data_point,y[0],W)

grad=backward_propagation(data_point,W,d1)
grad=list(grad.values())

# compute the L value using forward_propagation()

# compute the gradients of W using backword_propagation()
approx_gradients = []
for i in range(len(W)):

w_plus=W.copy()
w_plus[i]+=eps
L_plus=forward_propagation(data_point,y[0],w_plus)
grad_plus=backward_propagation(data_point,w_plus,L_plus)
L_plus=L_plus['loss']

w_minus=W.copy()
w_minus[i]-=eps
L_minus=forward_propagation(data_point,y[0],w_minus)
grad_minus=backward_propagation(data_point,w_minus,L_minus)
L_minus=L_minus['loss']
L_minus=L_minus['loss']

approx_grad=(L_plus-L_minus)/(2*eps)
approx_gradients.append(approx_grad)

gradient_check=[]
for i in range(len(W)):
num = np.linalg.norm(grad[i] - approx_gradients[i] )
den = np.linalg.norm(grad[i]) + np.linalg.norm(approx_gradients[i])
diff = num / den
gradient_check.append(diff)

# add a small value to weight wi, and then find the values of L with the updated weights
# subtract a small value to weight wi, and then find the values of L with the updated weig
hts
# compute the approximation gradients of weight wi
#approx_gradients.append(approximation gradients of weight wi)
# compare the gradient of weights W from backword_propagation() with the aproximation
gradients of weights with gradient_check formula
return gradient_check

g=gradient_checking(X[0],W)

In [21]:
g

Out[21]:
[1.446076278481714e-08,
1.2264881537525243e-10,
9.564345727570781e-11,
1.9892512093219975e-10,
7.8985457914592e-12,
1.2992058661307797e-09,
4.214611098858496e-09,
8.936748335692261e-10,
1.2188639210214776e-13]

Task 2: Optimizers

Algorithm with Vanilla update of weights

In [12]:
w=list(np.random.normal(0.0, 0.01, 9))
w

Out[12]:

[-0.001345358300390486,
0.013196112781944023,
0.0019041861049636545,
-0.011440720702484709,
0.017669818548239284,
0.006658934006215388,
0.0068233799771498585,
0.00656258256147008,
0.017630243873985256]

In [13]:
vw=w
Loss=[]
for epoch in range(100):
for i,j in zip(X,y):

d1=forward_propagation(i,j,vw)
loss=d1['loss']

dw=backward_propagation(i,vw,d1)

dw=list(dw.values())
dw=[i * 0.01 for i in dw]
vw=np.subtract(vw,dw)

Loss.append(loss)

Plot between epochs and loss

In [14]:
import matplotlib.pyplot as plt
epoch=list(range(1,101))
plt.plot(epoch,Loss)
plt.xlabel('Epoch')
plt.ylabel('Loss')

Out[14]:
Text(0, 0.5, 'Loss')

Algorithm with Vanilla update of weights

In [15]:
mw=w
v=list(np.zeros(9))
Loss_momentum=[]
for epoch in range(100):

for i,j in zip(X,y):

d1=forward_propagation(i,j,mw)
loss_momentum=d1['loss']

dw=backward_propagation(i,mw,d1)

dw=list(dw.values())

for i in range(len(dw)):
dw[i]=v[i]=0.9*v[i]-0.01*dw[i]
mw=np.add(mw,v)

Loss_momentum.append(loss_momentum)

Plot between epochs and loss

In [16]:

epoch=list(range(1,101))
plt.plot(epoch,Loss_momentum)
plt.xlabel('Epoch')
plt.ylabel('Loss')

Out[16]:

Text(0, 0.5, 'Loss')

Algorithm with Vanilla update of weights

In [33]:

aw=w
av=list(np.zeros(9))
am=list(np.zeros(9))
Loss_adam=[]
for epoch in range(100):

for i,j in zip(X,y):

d1=forward_propagation(i,j,aw)
loss_adam=d1['loss']

dw=backward_propagation(i,aw,d1)

dw=list(dw.values())

for i in range(len(dw)):
am[i] = 0.9*am[i] + (1-0.9)*dw[i]
av[i] = 0.999*av[i] + (1-0.999)*(dw[i]**2)
aw[i]+= - 0.1* am[i] / (np.sqrt(av[i]) + 1e-8)

Loss_adam.append(loss_adam)

Plot between epochs and loss

In [34]:
epoch=list(range(1,101))
plt.plot(epoch,Loss_adam)
plt.xlabel('Epoch')
plt.ylabel('Loss')

Out[34]:

Text(0, 0.5, 'Loss')

Comparision plot between epochs and loss with different optimizers

In [35]:

plt.plot(epoch, Loss, 'r--')

plt.plot(epoch,Loss_momentum, 'b--')
plt.plot(epoch,Loss_adam, 'g--')
plt.legend(['vanilla', 'momentun','adam'])
plt.xlabel('Epoch')
plt.ylabel('Loss')
plt.show();

In [ ]:

'/content/data - PKL' 'RB': Open Print
No ratings yet
'/content/data - PKL' 'RB': Open Print
5 pages
Week 7 - Lab
No ratings yet
Week 7 - Lab
6 pages
Neural Net Python Sleep Study
No ratings yet
Neural Net Python Sleep Study
3 pages
Tutorial On Neural Networks - 18MAR2024
No ratings yet
Tutorial On Neural Networks - 18MAR2024
33 pages
Bananini Chimpanzini
No ratings yet
Bananini Chimpanzini
8 pages
Solving XOR Problem Using DNN AIDS
100% (1)
Solving XOR Problem Using DNN AIDS
4 pages
555610a19 DL Exp4
No ratings yet
555610a19 DL Exp4
11 pages
ML Assignment-9
No ratings yet
ML Assignment-9
4 pages
Neural Network Training Guide
No ratings yet
Neural Network Training Guide
11 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
9 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
Trainina A NN Backpropagation
No ratings yet
Trainina A NN Backpropagation
6 pages
Lecture 02
No ratings yet
Lecture 02
37 pages
A 3
No ratings yet
A 3
5 pages
Linear Regr GD
No ratings yet
Linear Regr GD
3 pages
Deep Learning Assignment 2 Solutions
No ratings yet
Deep Learning Assignment 2 Solutions
8 pages
Lab 4
No ratings yet
Lab 4
2 pages
Da 3 Lab DL 21BCE2687
No ratings yet
Da 3 Lab DL 21BCE2687
15 pages
Ex No 11
No ratings yet
Ex No 11
4 pages
02 ML PDF
No ratings yet
02 ML PDF
5 pages
Neural Networks: Backpropagation
No ratings yet
Neural Networks: Backpropagation
9 pages
Day1 06 Simple NN Python
No ratings yet
Day1 06 Simple NN Python
18 pages
3rd Ass
No ratings yet
3rd Ass
6 pages
hw07 Neural Soln PDF
No ratings yet
hw07 Neural Soln PDF
6 pages
Part 2 Module 2 DL BP
No ratings yet
Part 2 Module 2 DL BP
66 pages
H2 AndresAlcivar
No ratings yet
H2 AndresAlcivar
12 pages
X OR Problem Using DNN
No ratings yet
X OR Problem Using DNN
3 pages
Ann Practical 5
No ratings yet
Ann Practical 5
3 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Lib - NN-đã G P
No ratings yet
Lib - NN-đã G P
83 pages
Cours 2
No ratings yet
Cours 2
25 pages
Exp2.2 - Jupyter Notebook
No ratings yet
Exp2.2 - Jupyter Notebook
3 pages
Implement A Perceptron To Evaluate Logical Operations Including XOR
No ratings yet
Implement A Perceptron To Evaluate Logical Operations Including XOR
73 pages
Python
No ratings yet
Python
3 pages
Neural Network Code
No ratings yet
Neural Network Code
5 pages
Autodiff
No ratings yet
Autodiff
12 pages
DL03 Classroom SNN
No ratings yet
DL03 Classroom SNN
41 pages
Software Laboratory II Code
No ratings yet
Software Laboratory II Code
27 pages
Neural Network Project Phases
No ratings yet
Neural Network Project Phases
5 pages
Vac Sem 7
No ratings yet
Vac Sem 7
29 pages
Neural Network Lab Guide
No ratings yet
Neural Network Lab Guide
17 pages
Assignment No. 3: 1. Plot of Loss Function J Vs Number of Iterations
No ratings yet
Assignment No. 3: 1. Plot of Loss Function J Vs Number of Iterations
6 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
Soft Computing Program
No ratings yet
Soft Computing Program
14 pages
ML 0joh
No ratings yet
ML 0joh
2 pages
IBest DeepLearning
No ratings yet
IBest DeepLearning
123 pages
Experiment 4 NN
No ratings yet
Experiment 4 NN
3 pages
Machine Learning Lab (3) Report (21 CP 81)
No ratings yet
Machine Learning Lab (3) Report (21 CP 81)
7 pages
Machine Learning Algorithm Demos
No ratings yet
Machine Learning Algorithm Demos
31 pages
Machine Learning CODE
No ratings yet
Machine Learning CODE
19 pages
Lab-5 Report
No ratings yet
Lab-5 Report
11 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Gradient Descent & Backpropagation Practice Problems
No ratings yet
Gradient Descent & Backpropagation Practice Problems
7 pages
Derivations For Back Propagation of Multilayer Neural Network
No ratings yet
Derivations For Back Propagation of Multilayer Neural Network
14 pages
Neural Network Logic Gates Implementation
No ratings yet
Neural Network Logic Gates Implementation
10 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
Experiment No
No ratings yet
Experiment No
29 pages
Niraj DL
No ratings yet
Niraj DL
15 pages
R. Snehalatha Ap21110011455 Cse U
No ratings yet
R. Snehalatha Ap21110011455 Cse U
7 pages
2017-18 El241 Signals and Systems (DC) III Semester B. Tech. (Electronics) Total Periods: 52 (40L + 12T)
No ratings yet
2017-18 El241 Signals and Systems (DC) III Semester B. Tech. (Electronics) Total Periods: 52 (40L + 12T)
3 pages
Matlab Code - : %Q4 %length of Impulse Response
No ratings yet
Matlab Code - : %Q4 %length of Impulse Response
1 page
Big Data and Cloud Challenges From Iot: Ricardo Jimenez-Peris Univ. Politecnica de Madrid
No ratings yet
Big Data and Cloud Challenges From Iot: Ricardo Jimenez-Peris Univ. Politecnica de Madrid
14 pages
Gate 2018 Key
No ratings yet
Gate 2018 Key
62 pages
Nda Preparation Kit: Brief Introduction
No ratings yet
Nda Preparation Kit: Brief Introduction
9 pages
Op-Amp Circuit Analysis Guide
No ratings yet
Op-Amp Circuit Analysis Guide
6 pages
MATLAB Convolution for ECE Students
No ratings yet
MATLAB Convolution for ECE Students
7 pages
Nda Preparation Kit: Brief Introduction
No ratings yet
Nda Preparation Kit: Brief Introduction
9 pages
Spherical Trigonometry - Nutshell Vol 8
No ratings yet
Spherical Trigonometry - Nutshell Vol 8
80 pages
LDC Tools Ext51
No ratings yet
LDC Tools Ext51
78 pages
Canadian Oxford Dictionary 2nd Edition Katherine Barber PDF Version
100% (2)
Canadian Oxford Dictionary 2nd Edition Katherine Barber PDF Version
98 pages
Machine Design Shaft Final Report
No ratings yet
Machine Design Shaft Final Report
13 pages
N Gregory Mankiw Brief Principles of Macroeconomics Cengage Learning 2021 Gregory Mankiw Instant Download
100% (1)
N Gregory Mankiw Brief Principles of Macroeconomics Cengage Learning 2021 Gregory Mankiw Instant Download
114 pages
Study Guide Forecasts
No ratings yet
Study Guide Forecasts
46 pages
Curriculum For Broadcast Level-6-1
No ratings yet
Curriculum For Broadcast Level-6-1
94 pages
A Generic Primary-Control Model For Grid-Forming Inverters
No ratings yet
A Generic Primary-Control Model For Grid-Forming Inverters
10 pages
3-10a - Motion Graphs Wkst-Key
No ratings yet
3-10a - Motion Graphs Wkst-Key
1 page
Powtran PI9000-S
100% (1)
Powtran PI9000-S
105 pages
Basics of C
No ratings yet
Basics of C
24 pages
Stepby Steptowards Structural Simulation Using ABAQUSPart 1
No ratings yet
Stepby Steptowards Structural Simulation Using ABAQUSPart 1
67 pages
Aujero 2019 J. Phys. Conf. Ser. 1180 012003
No ratings yet
Aujero 2019 J. Phys. Conf. Ser. 1180 012003
10 pages
AQA GCSE Maths 8300: Topics and Resources Foundation Tier Number
No ratings yet
AQA GCSE Maths 8300: Topics and Resources Foundation Tier Number
3 pages
(Springer Actuarial) Arthur Charpentier - Insurance, Biases, Discrimination and Fairness-Springer Nature (2024)
No ratings yet
(Springer Actuarial) Arthur Charpentier - Insurance, Biases, Discrimination and Fairness-Springer Nature (2024)
491 pages
Mathematical Literacy P1 Nov 2016 MEMO
No ratings yet
Mathematical Literacy P1 Nov 2016 MEMO
15 pages
Data Types and DML
No ratings yet
Data Types and DML
2 pages
Application of Partial Differential Equations
No ratings yet
Application of Partial Differential Equations
7 pages
Higgs
100% (1)
Higgs
36 pages
Chapter 9 - Dead Reckoning
No ratings yet
Chapter 9 - Dead Reckoning
6 pages
Cs3337 2016 Fall Final Exam Key Results Software Engineering Exam CSULA
No ratings yet
Cs3337 2016 Fall Final Exam Key Results Software Engineering Exam CSULA
5 pages
Basic Electronics For Automotive Enthusiasts
No ratings yet
Basic Electronics For Automotive Enthusiasts
10 pages
TCS Model Papers
No ratings yet
TCS Model Papers
165 pages
High Standards Adma p1
100% (1)
High Standards Adma p1
89 pages
Assignment 1 (Excel)
No ratings yet
Assignment 1 (Excel)
6 pages
Roger Nelson - Enhancement of The Global Consciousness Project
100% (1)
Roger Nelson - Enhancement of The Global Consciousness Project
19 pages
Eigenfaces in Face Recognition Report
No ratings yet
Eigenfaces in Face Recognition Report
20 pages
Machine Learning Exam Guidelines
No ratings yet
Machine Learning Exam Guidelines
3 pages
Concise Clinical Embryology: An Integrated, Case-Based Approach - Ebook PDF Instant Download
100% (2)
Concise Clinical Embryology: An Integrated, Case-Based Approach - Ebook PDF Instant Download
61 pages
Slopes Checkpoint Relearning
No ratings yet
Slopes Checkpoint Relearning
7 pages

Backpropagation: Loading Data

Uploaded by

Backpropagation: Loading Data

Uploaded by

BackPropagation

Every Grader function has to return True.

with open('data.pkl', 'rb') as f:

The final output of this graph is a value L which is computed as (Y-Y')^2

Task 1: Implementing backpropagation and Gradient checking

from IPython.display import YouTubeVideo

we know that the derivative of any function is

Gradient checking example</font>

in our example: gradient_check $ = \frac{(6 - 5.999999999994898)}{(6 + 5.999999999994898)} = 4.2514140356330737e^{-13}$

you can mathamatically derive the same thing like this

Implement Gradient checking

Check below video and this blog

for each epoch(1-100):

Implement below tasks</b>

Write two functions

Forward propagation</b>(Write your code in def forward_propagation())

Backward propagation(Write your code in def backward_propagation()) </b>

def backward_propagation(L, W,dictionary):

# L: the loss we calculated for the current point

# we can use this function in forward and backward propagation

return 1/(1 + np.exp(-z))

def forward_propagation(x, y, w):

{'exp': 1.1272967040973583, 'sigmoid': 0.5279179387419721, 'tanh': 0.8417934192562146, 'loss': 0.9

Implement gradient checking

# compute the L value using forward_propagation()

# compute the L value using forward_propagation()

Algorithm with Vanilla update of weights

Plot between epochs and loss

Algorithm with Vanilla update of weights

for i,j in zip(X,y):

Plot between epochs and loss

Text(0, 0.5, 'Loss')

Algorithm with Vanilla update of weights

for i,j in zip(X,y):

Plot between epochs and loss

Text(0, 0.5, 'Loss')

Comparision plot between epochs and loss with different optimizers

plt.plot(epoch, Loss, 'r--')

You might also like