0% found this document useful (0 votes)

171 views8 pages

Neural Networks: 1 Basic Optimizer

The document provides instructions for 7 exercises to implement the components of a basic neural network framework including: 1. A stochastic gradient descent optimizer 2. A base layer class 3. A fully connected layer 4. A rectified linear unit (ReLU) activation function 5. A softmax activation function 6. A cross entropy loss function 7. A neural network class to connect the layers For each component, the tasks include constructing the class, implementing forward and backward pass methods, and adding any necessary properties or optimization methods. Unit tests are provided to validate the implementations. The goal is to build out the basic building blocks for constructing and training neural network models within a layer-based framework.

Uploaded by

Yuan Lee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

171 views8 pages

Neural Networks: 1 Basic Optimizer

Uploaded by

Yuan Lee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Deep Learning Exercises

DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

Neural Networks

1 Basic Optimizer

In this course we will implement advanced optimization schemes, but in the first exercise we
start with the basic Stochastic Gradient Descent (SGD).

Task:

Implement the class Sgd in the file “Optimizers.py” in folder “Optimization”.

• The Sgd constructor receives the learning rate with data type float.

• Implement the method

calculate update(weight tensor, gradient tensor) that returns the updated weights
according to the basic gradient descent update scheme.

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestOptimizers1.

1
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

2 Base Layer

We will realize a small layer oriented Deep Learning framework in this exercise. Layer oriented
frameworks represent a higher level of abstraction to their users than graph oriented frame-
works. This approach limits flexibility but enables easy experimentation using conventional
architectures. Every layer in these architectures has to implement two fundamental opera-
tions: forward(input tensor), backward(error tensor). These operations are the basic
steps executed during training and testing.
We distinguish between trainable and non-trainable layers. Trainable layers have pa-
rameters that are optimized during training (e. g. the Fully Connected Layer, which must be
implemented in this task), while non-trainable layers remain fixed (e. g. the ReLU activation
function).

Task:

Implement a class BaseLayer in the file “Base.py” in folder “Layers”.

• This class will be inherited by every layer in our framework. For information on inheri-
tance in python, please refer to here.

• Write a constructor for this class receiving no arguments. In this constructor, initialize
a boolean member trainable with False. This member will be used to distinguish
trainable from non-trainable layers.

• Optionally, you can add other members like a default weights parameter, which might
come in handy.

2
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

3 Fully Connected Layer

The Fully Connected (FC) layer is the theoretic backbone of layer oriented architectures. It
performs a linear operation on its input.

Task:

Implement a class FullyConnected in the file “FullyConnected.py” in folder “Layers”, that

inherits the base layer that we implemented earlier. This class has to provide the methods
forward(input tensor) and backward(error tensor) as well as the property optimizer.
• Write a constructor for this class, receiving the arguments (input size, output size).
First, call its super-constructor. Set the inherited member trainable to True, as this
layer has trainable parameters. Initialize the weights of this layer uniformly random in
the range [0, 1).
• Implement a method forward(input tensor) which returns a tensor that serves as the
input tensor for the next layer. input tensor is a matrix with input size columns
and batch size rows. The batch size represents the number of inputs processed si-
multaneously. The output size is a parameter of the layer specifying the number of
columns of the output.
• Add a setter and getter property optimizer which sets and returns the protected member
optimizer for this layer. Properties offer a pythonic way of realizing getters and setters.
Please get familiar with this concept if you are not aware of it.
• Implement a method backward(error tensor) which returns a tensor that serves as
the error tensor for the previous layer. Quick reminder: in the backward pass we are
going in the other direction as in the forward pass.
Hint: if you discover that you need something here which is no longer available to you,
think about storing it at the appropriate time.
• To be able to test the gradients with respect to the weights: The member for the weights
and biases should be named weights. For future reasons provide a property gradi-
ent weights which returns the gradient with respect to the weights, after they have
been calculated in the backward-pass. These properties are accessed by the unit tests
and are therefore also important to pass the tests!
• Use the method calculate update(weight tensor, gradient tensor) of your opti-
mizer in your backward pass, in order to update your weights. Don’t perform an
update if the optimizer is not set.
You can verify your implementation using the provided testsuite by providing the commandline
parameter TestFullyConnected1

3
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

4 Rectified Linear Unit

The Rectified Linear Unit is the standard activation function in Deep Learning nowadays. It
has revolutionized Neural Networks because it reduces the effect of the “vanishing gradient”
problem.

Task:

Implement a class ReLU in the file “ReLU.py” in folder “Layers”. This class also has to
provide the methods forward(input tensor) and backward(error tensor).

• Write a constructor for this class, receiving no arguments. The ReLU does not have
trainable parameters, so you don’t have to change the inherited member trainable.

• Implement a method forward(input tensor) which returns a tensor that serves as the
input tensor for the next layer.

• Implement a method backward(error tensor) which returns a tensor that serves as

the error tensor for the previous layer.
Hint: the same hint as before applies.

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestReLU

4
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

5 SoftMax Layer

The SoftMax activation function is used to transform the logits (the output of the network)
into a probability distribution. Therefore, SoftMax is typically used for classification tasks.

Task:

Implement a class SoftMax in the file: “SoftMax.py” in folder “Layers”. This class also has
to provide the methods forward(input tensor) and backward(error tensor).

• Write a constructor for this class, receiving no arguments.

• Implement a method forward(input tensor) which returns the estimated class proba-
bilities for each row representing an element of the batch.

• Implement a method backward(error tensor) which returns a tensor that serves as

the error tensor for the previous layer.
Hint: again the same hint as before applies.

• Remember: Loops are slow in Python. Use NumPy functions instead!

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestSoftMax

5
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

6 Cross Entropy Loss

The cross entropy Loss is often used in classification task, typically in conjunction with SoftMax
(or Sigmoid).

Task:

Implement a class CrossEntropyLoss in the file: “Loss.py” in folder “Optimization”.

When forward propagating we now additionally need the argument label tensor for
forward(prediction tensor, label tensor) and backward(label tensor). We don’t con-
sider the loss function as a layer like the previous ones in our framework, thus it should not
inherit the base layer.

• Write a constructor for this class, receiving no arguments.

• Implement a method forward(prediction tensor, label tensor) which computes the

Loss value according the CrossEntropy Loss formula accumulated over the batch.

• Implement a method backward(label tensor) which returns the error tensor for the
previous layer. The backpropagation starts here, hence no error tensor is needed.
Instead, we need the label tensor.
Hint: the same hint as before applies.

• Remember: Loops are slow in Python. Use NumPy functions instead!

You can verify your implementation using the provided testsuite by providing the commandline
parameter TestCrossEntropyLoss

6
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

7 Neural Network Skeleton

The Neural Network defines the whole architecture by containing all its layers from the input
to the loss layer. This Network manages the testing and the training, that means it calls all
forward methods passing the data from the beginning to the end, as well as the optimization
by calling all backward passes afterwards.

Task:

Implement a class NeuralNetwork in the file: “NeuralNetwork.py” in the same folder as

“NeuralNetworkTests.py”.
• Implement five member variables. An optimizer object received upon construction as
the first argument. A list loss which will contain the loss value for each iteration after
calling train. A list layers which will hold the architecture, a member data layer, which
will provide input data and labels and a member loss layer referring to the special layer
providing loss and prediction. You do not need to care for filling these members with
actual values. They will be set within the unit tests.

• Implement a method forward using input from the data layer and passing it through
all layers of the network. Note that the data layer provides an input tensor and a
label tensor upon calling next() on it. The output of this function should be the
output of the last layer (i. e. the loss layer) of the network.

• Implement a method backward starting from the loss layer passing it the label tensor
for the current input and propagating it back through the network.

• Implement the method append layer(layer). If the layer is trainable, it makes a

deep copy of the neural network’s optimizer and sets it for the layer by using its
optimizer property. Both, trainable and non-trainable layers, are then appended to the
list layers.
Note: We will implement optimizers that have an internal state in the upcoming exercises,
which makes copying of the optimizer object necessary.

• Additionally implement a convenience method train(iterations), which trains the net-

work for iterations and stores the loss for each iteration.

• Finally implement a convenience method test(input tensor) which propagates the in-
put tensor through the network and returns the prediction of the last layer. For clas-
sification tasks we typically query the probabilistic output of the SoftMax layer.
You can verify your implementation using the provided testsuite by providing the commandline
parameter TestNeuralNetwork1

7
Deep Learning Exercises
DL Tutors Team
Exercise 1
April 25, 2023
Neural Networks

8 Test, Debug and Finish

Now we implemented everything.

Task:

Debug your implementation until every test in the suite passes. You can run all tests by
providing no commandline parameter. To run the unittests you can either execute them with
python in the terminal or with the dedicated unittest environment of PyCharm. We recommend
the latter one, as it provides a better overview of all tests. For the automated computation of
the bonus points achieved in one exercise, run the unittests with the bonus flag in a terminal,
with

python3 NeuralNetworkTests.py Bonus

or set in PyCharm a new “Python” configuration with Bonus as “Parameters”. Notice, in

some cases you need to set your src folder as “Working Directory”. More information about
PyCharm configurations can be found here 1 .
Make sure you don’t forget to upload your submission to StudOn. Use the dispatch tool, which
checks all files for completeness and zips the files you need for the upload. Try

python3 dispatch.py --help

to check out the manual. For dispatching your folder run e.g.

python3 dispatch.py -i ./src_to_implement -o submission.zip

and upload the .zip file to StudOn.

1
https://www.jetbrains.com/help/pycharm/creating-and-editing-run-debug-configurations.html

Microsoft Dynamics NAV 5.0
100% (2)
Microsoft Dynamics NAV 5.0
640 pages
Convolutional Neural Networks: 1 Initializers
0% (1)
Convolutional Neural Networks: 1 Initializers
7 pages
ZXA10 F822 PoE Datasheet
No ratings yet
ZXA10 F822 PoE Datasheet
1 page
Mains Compact NT 1.0 Reference Guide
No ratings yet
Mains Compact NT 1.0 Reference Guide
130 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
DL Exp-3 16010422230
No ratings yet
DL Exp-3 16010422230
9 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Manual - Deep Learning Lab.
No ratings yet
Manual - Deep Learning Lab.
43 pages
Building Your Deep Neural Network - Step by Step v8 PDF
No ratings yet
Building Your Deep Neural Network - Step by Step v8 PDF
44 pages
R Deep Neural Network Step by Step
No ratings yet
R Deep Neural Network Step by Step
27 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
No ratings yet
R20!63!20ITC27 Deep Learning Lab Manual (Minor Proj 2) Dr.K.ramu
47 pages
Lab Manual Ann
No ratings yet
Lab Manual Ann
12 pages
Niraj DL
No ratings yet
Niraj DL
15 pages
Assignment1 NN Scrach
No ratings yet
Assignment1 NN Scrach
3 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
Deep Learning Lab - Record
No ratings yet
Deep Learning Lab - Record
67 pages
DL Practical File
No ratings yet
DL Practical File
58 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
RLDL128
No ratings yet
RLDL128
73 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
Machine Learning 2nd Assignment
No ratings yet
Machine Learning 2nd Assignment
3 pages
SS 2020 Solutions
No ratings yet
SS 2020 Solutions
22 pages
Deep Learning Lab Course 2017 (Deep Learning Practical)
No ratings yet
Deep Learning Lab Course 2017 (Deep Learning Practical)
49 pages
Deep Learning With Keras
100% (5)
Deep Learning With Keras
136 pages
Lecture 2 - Hello World in ML
No ratings yet
Lecture 2 - Hello World in ML
49 pages
DL Lab Manual
No ratings yet
DL Lab Manual
52 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
First
No ratings yet
First
92 pages
CS335 Lab6
No ratings yet
CS335 Lab6
7 pages
HW 5
No ratings yet
HW 5
10 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
DEEP LEARNING LAB - Manual
No ratings yet
DEEP LEARNING LAB - Manual
64 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
16 DL 1
No ratings yet
16 DL 1
9 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
2802ICT Programming Assignment 2
No ratings yet
2802ICT Programming Assignment 2
6 pages
Python Basics Nympy
No ratings yet
Python Basics Nympy
5 pages
AI Lab11 Task
No ratings yet
AI Lab11 Task
21 pages
DL Record
No ratings yet
DL Record
11 pages
Homework 2
No ratings yet
Homework 2
3 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
Soft Computing Lab Manual
100% (1)
Soft Computing Lab Manual
25 pages
Assignment 4x
No ratings yet
Assignment 4x
19 pages
Solution: Introduction To Deep Learning
No ratings yet
Solution: Introduction To Deep Learning
20 pages
Google Aiml
No ratings yet
Google Aiml
50 pages
IBest DeepLearning
No ratings yet
IBest DeepLearning
123 pages
DL Quiz1
No ratings yet
DL Quiz1
5 pages
Final DL
No ratings yet
Final DL
26 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
43 pages
Module 2
No ratings yet
Module 2
12 pages
SS 2020
No ratings yet
SS 2020
21 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Java Package Mastery: 100 Knock Series - Master Java in One Hour, 2024 Edition
From Everand
Java Package Mastery: 100 Knock Series - Master Java in One Hour, 2024 Edition
Kanto
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
2024-12-20
No ratings yet
2024-12-20
141 pages
Lab 3.1.7 Troubleshooting A Serial Interface
No ratings yet
Lab 3.1.7 Troubleshooting A Serial Interface
6 pages
Assignment FDDI TokenRing
No ratings yet
Assignment FDDI TokenRing
6 pages
PI Manual Logger
No ratings yet
PI Manual Logger
241 pages
C01 (En) (WANs and Routers)
No ratings yet
C01 (En) (WANs and Routers)
29 pages
HP LaserJet Managed MFP E731 Series CPMD
No ratings yet
HP LaserJet Managed MFP E731 Series CPMD
942 pages
C Online Test
No ratings yet
C Online Test
6 pages
Laxma Reddy Gillala 22 La Promesa Cir Odessa, TX 79765 Home: (432) - 614-0916 Cell: (760) - 508-3465
No ratings yet
Laxma Reddy Gillala 22 La Promesa Cir Odessa, TX 79765 Home: (432) - 614-0916 Cell: (760) - 508-3465
7 pages
Week 2 Introduction To Python Programming
No ratings yet
Week 2 Introduction To Python Programming
24 pages
Webdynpro
100% (1)
Webdynpro
824 pages
Osy Unit-2 Notes CRG - 240814 - 095711
No ratings yet
Osy Unit-2 Notes CRG - 240814 - 095711
44 pages
SQL Server Data Tools For Visual Studio 2012
No ratings yet
SQL Server Data Tools For Visual Studio 2012
3 pages
Sym Commands
No ratings yet
Sym Commands
6 pages
Peplink Ubr Lte Faq
No ratings yet
Peplink Ubr Lte Faq
6 pages
AWS Solutions Architect Associate Study Guide 1
No ratings yet
AWS Solutions Architect Associate Study Guide 1
12 pages
MEC CAD Installation Download From INTERNET: System Requirements
No ratings yet
MEC CAD Installation Download From INTERNET: System Requirements
6 pages
Relative Xpath
No ratings yet
Relative Xpath
11 pages
Study Plan: RBI Assistant 2023 100 Days
No ratings yet
Study Plan: RBI Assistant 2023 100 Days
7 pages
Computer Virus and Antivirus: History of Computer Viruses
No ratings yet
Computer Virus and Antivirus: History of Computer Viruses
5 pages
Quiz - Week 5 - Attempt 1
No ratings yet
Quiz - Week 5 - Attempt 1
8 pages
IEEE 802.5 Token Ring: - A Special Sequence of Bits - Circulates Around The Ring
No ratings yet
IEEE 802.5 Token Ring: - A Special Sequence of Bits - Circulates Around The Ring
21 pages
Save / Load Function in Flightdeck Simulator A32X
No ratings yet
Save / Load Function in Flightdeck Simulator A32X
22 pages
Yashwant Kanitker - VC++, COM and Beyond
100% (3)
Yashwant Kanitker - VC++, COM and Beyond
20 pages
GPRS Communication Protocol
No ratings yet
GPRS Communication Protocol
21 pages
HARDWARE TRAINING Diebold
No ratings yet
HARDWARE TRAINING Diebold
96 pages
Office 2016 Grouppolicyandoctsettings
0% (1)
Office 2016 Grouppolicyandoctsettings
1,793 pages
ICT CSS 11 Q1 W3 Rante
No ratings yet
ICT CSS 11 Q1 W3 Rante
14 pages

Neural Networks: 1 Basic Optimizer

Uploaded by

Neural Networks: 1 Basic Optimizer

Uploaded by

Deep Learning Exercises

Implement the class Sgd in the file “Optimizers.py” in folder “Optimization”.

• Implement the method

Implement a class BaseLayer in the file “Base.py” in folder “Layers”.

3 Fully Connected Layer

Implement a class FullyConnected in the file “FullyConnected.py” in folder “Layers”, that

4 Rectified Linear Unit

• Implement a method backward(error tensor) which returns a tensor that serves as

• Write a constructor for this class, receiving no arguments.

• Implement a method backward(error tensor) which returns a tensor that serves as

• Remember: Loops are slow in Python. Use NumPy functions instead!

6 Cross Entropy Loss

Implement a class CrossEntropyLoss in the file: “Loss.py” in folder “Optimization”.

• Write a constructor for this class, receiving no arguments.

• Implement a method forward(prediction tensor, label tensor) which computes the

• Remember: Loops are slow in Python. Use NumPy functions instead!

7 Neural Network Skeleton

Implement a class NeuralNetwork in the file: “NeuralNetwork.py” in the same folder as

• Implement the method append layer(layer). If the layer is trainable, it makes a

• Additionally implement a convenience method train(iterations), which trains the net-

8 Test, Debug and Finish

Now we implemented everything.

python3 NeuralNetworkTests.py Bonus

or set in PyCharm a new “Python” configuration with Bonus as “Parameters”. Notice, in

python3 dispatch.py --help

python3 dispatch.py -i ./src_to_implement -o submission.zip

and upload the .zip file to StudOn.

You might also like