0% found this document useful (0 votes)

40 views9 pages

Simple CNN Implementation Guide

Uploaded by

Hashaam Zafar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views9 pages

Simple CNN Implementation Guide

Uploaded by

Hashaam Zafar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Department of Electrical Engineering

Faculty Member:_ Date: _

Course/Section:______ Semester: _

CS-477 Computer Vision

Lab 12: Implementation of a Simple
Convolutional Neural Network
Name Reg. No PLO4-CLO4 PLO5-CLO5 PLO8-CLO6 PLO9-CLO7
Investigation Modern Tool Ethics Individual
Usage and Team
Work

5 marks 5 marks 5 marks 5 marks

Lab 9: Implementation of a Simple Convolutional
Neural Network

Objectives
Understand PyTorch’s Tensor library and neural networks at a high
level.
Introduction to CNN
Training a CNN classifier

Lab Report Instructions

All questions should be answered precisely to get maximum credit. Lab report must ensure following
items:
✔ Lab objectives

✔ Python codes

✔ Results (graphs/tables) duly commented and discussed

✔ Conclusion
Introduction to CNN
Convolutional Neural Network (CNN): A CNN is a type of deep neural network
designed to recognize and process visual data with a grid-like structure, such as
images. CNNs are particularly effective in image recognition, object detection, and
other computer vision tasks.
Grid-like Topology: Images can be thought of as a grid of pixels, where each pixel
represents the smallest unit of information. The arrangement of pixels creates a
grid-like structure, and CNNs leverage this spatial organization for more effective
feature extraction.
Digital Image: In the context of CNNs, digital images are represented as a grid of
pixels. Each pixel's position in the grid corresponds to a specific location in the
image, and the pixel value represents the color and intensity at that point. The
combination of pixel values across the grid forms the complete visual representation
of the image.
Binary Representation: While you mentioned a binary representation, it's important
to note that digital images typically use a range of values to represent colors. In the
RGB (Red, Green, Blue) color space, for example, each pixel is often represented by
three values corresponding to the intensity of each color channel. The values are
usually integers ranging from 0 to 255, or they could be normalized to the range [0,
1].
Pixel Values: Pixel values indicate the brightness and color information of the image.
In grayscale images, each pixel has a single value representing intensity. In color
images, each pixel has multiple values corresponding to the intensities of different
color channels.
CNNs use convolutional layers to automatically and adaptively learn spatial
hierarchies of features from input images. These layers contain filters that are
convolved with the input image to extract features such as edges, textures, and more
complex patterns. Pooling layers are often used to reduce the spatial dimensions of the
data, and fully connected layers integrate the learned features for final classification
or regression tasks.
In summary, CNNs are a powerful class of neural networks designed for processing
grid-like data, making them especially effective for tasks involving images and spatial
relationships within those images.
Figure 1: Source: https://pippin.gimp.org/image_processing/images/sample_grid_a_square.png

Convolutional Neural Network (CNN) typically consists of three fundamental layers:

a convolutional layer, a pooling layer, and a fully connected layer.

Figure 2: Source https://www.mathworks.com/videos/introduction-to-deep-learning-what-are-

convolutional-neural-networks--1489512765771.html

Convolution Layer
At the core of the CNN architecture lies the convolutional layer, bearing the primary
computational load of the network. This layer executes a dot product operation
between two matrices – one matrix comprises learnable parameters known as a
kernel, and the other matrix represents a confined section of the receptive field. While
the kernel is spatially smaller than the image, it possesses greater depth. For instance,
in an image with three (RGB) channels, the kernel's height and width are spatially
small, yet its depth extends across all three channels.
Illustration of Convolution Operation
In the forward pass, the kernel traverses the height and width of the image, generating
the image representation of the receptive region. This process yields a two-
dimensional representation known as an activation map, showcasing the kernel's
response at each spatial position within the image. The extent of movement of the
kernel is termed as the "stride."
For an input of size W x W x D, with Dout being the number of kernels, a spatial size
of F, a stride of S, and a specified amount of padding P, the size of the output volume
is determined by the following formula:

Pooling Layer
Following the convolutional layer, the pooling layer plays a crucial role in the
Convolutional Neural Network (CNN) architecture. Its function involves replacing
specific locations in the network's output by computing a summary statistic of the
neighboring outputs. This strategic substitution aids in diminishing the spatial size of
the representation, leading to a reduction in computational demands and the number
of weights. Notably, the pooling operation is applied independently to each slice of
the representation.
Various pooling functions exist, each offering distinct ways of summarizing
information within a neighborhood. Options include the average of the rectangular
neighborhood, the L2 norm of the rectangular neighborhood, and a weighted average
based on the distance from the central pixel. However, among these, max pooling
stands out as the most widely employed method. Max pooling entails selecting the
maximum output value from the neighborhood, providing a robust approach to
retaining essential features while reducing the dimensionality of the representation.
If we have an activation map of size W x W x D, a pooling kernel of spatial size F,
and stride S, then the size of output volume can be determined by the following
formula:

This will yield an output volume of size Wout x Wout x D.In all cases, pooling
provides some translation invariance which means that an object would be
recognizable regardless of where it appears on the frame.

Fully Connected Layer

Neurons in this layer have full connectivity with all neurons in the preceding and
succeeding layer as seen in regular FCNN. This is why it can be computed as usual by
a matrix multiplication followed by a bias effect. The FC layer helps to map the
representation between the input and the output.

Non-Linearity Layers
Given that convolution is inherently a linear operation and images exhibit significant
non-linearity, non-linearity layers are frequently inserted immediately after the
convolutional layer to impart non-linearity to the activation map.
Several types of non-linear operations are commonly employed, with notable
examples including:

Sigmoid:
The sigmoid non-linearity is expressed mathematically as σ(κ) = 1/(1+e¯κ). It takes a
real-valued number and compresses it into a range between 0 and 1. However, a
drawback of the sigmoid function is its tendency to generate gradients close to zero,
particularly at its tails. This property can hinder the backpropagation process,
potentially causing the gradient to become too small and impede effective weight
updates. Additionally, when the input data is consistently positive, the sigmoid may
produce outputs that are either entirely positive or entirely negative, leading to a zig-
zag dynamic in gradient updates for weights.

Tanh:
Tanh transforms a real-valued number to the range [-1, 1]. Similar to sigmoid, tanh
activations can saturate, but unlike sigmoid, its output is zero-centered.
ReLU (Rectified Linear Unit):
ReLU has gained widespread popularity in recent years. It computes the function ƒ(κ)
= max(0, κ), effectively thresholding the activation at zero. Compared to sigmoid and
tanh, ReLU offers faster convergence, accelerating the learning process by a factor of
six. Despite its advantages, ReLU does have a potential drawback during training. If a
large gradient flows through it, the neuron may undergo an update in such a way that
further updates become unlikely. This issue can be mitigated by carefully selecting an
appropriate learning rate

Watch following video to understand how CNN works

https://www.youtube.com/watch?v=HGwBXDKFk9I

Task 1: Convolution on Images ______________________________

Download a color image for this task. Write a function in python that takes as input
arguments: an image, a square filter (3x3, 5x5 etc.), padding size and number of
strides. The output of the function must be an image undergoing the convolution
operation on the input image.Implement convolution and showcase the result by
trying different filters, padding values and number of strides. Provide the code and at
least 4 screenshots for the final outputs.
### TASK 1 EXPLANATION STARTS HERE ###

### TASK 1 EXPLANATION ENDS HERE ###

### TASK 1 CODES START HERE ###

### TASK 1 CODES END HERE ###

### TASK 1 SCREENSHOTS START HERE ###

### TASK 1 SCREENSHOTS END HERE ###

Task 2: Simple CNN ____________________________________________

Build a simple convolutional neural network in PyTorch and train it to recognize
handwritten digits using the MNIST dataset (Training a classifier on the MNIST
dataset can be regarded as the hello world of image recognition).
### TASK 2 EXPLANATION STARTS HERE ###

### TASK 2 EXPLANATION ENDS HERE ###

### TASK 2 CODES START HERE ###

### TASK 2 CODES END HERE ###

### TASK 2 SCREENSHOTS START HERE ###

### TASK 2 SCREENSHOTS END HERE ###

Task 3: CNN _____________________________________________________

Build a simple convolutional neural network in PyTorch and train it to recognize
following fashion object using the fashion MNIST dataset which contains 10 classes
(T-shirt, Trouser, Pullover, Dress, Coat, Sandal, Shift, Sneaker, Bag, Ankle boot).
### TASK 3 EXPLANATION STARTS HERE ###

### TASK 3 EXPLANATION ENDS HERE ###

### TASK 3 CODES START HERE ###

### TASK 3 CODES END HERE ###

### TASK 3 SCREENSHOTS START HERE ###

### TASK 3 SCREENSHOTS END HERE ###

Helpful links
https://towardsdatascience.com/convolutional-neural-networks-explained-9cc5188c4939
https://www.youtube.com/watch?v=HGwBXDKFk9I
https://pytorch.org/tutorials/beginner/blitz/cifar10_tutorial.html
https://colab.research.google.com/github/pytorch/tutorials/blob/gh-pages/_downloads/
4e865243430a47a00d551ca0579a6f6c/cifar10_tutorial.ipynb#scrollTo=PP9km88QkiZp

UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
Project Report
No ratings yet
Project Report
13 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
3.convolutional Networks and Sequence Modeling
No ratings yet
3.convolutional Networks and Sequence Modeling
19 pages
An Introduction To Convolutional Neural Networks
No ratings yet
An Introduction To Convolutional Neural Networks
11 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
NN Jaguar Lava 122
No ratings yet
NN Jaguar Lava 122
10 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
You Can't Stop The Clock
No ratings yet
You Can't Stop The Clock
14 pages
CNNs for Image Recognition
No ratings yet
CNNs for Image Recognition
16 pages
Deep Neural Network DNN
No ratings yet
Deep Neural Network DNN
5 pages
Convolutional Neural Networks 2 Now
No ratings yet
Convolutional Neural Networks 2 Now
6 pages
DL Unit 4&5
No ratings yet
DL Unit 4&5
30 pages
Chapter 3
No ratings yet
Chapter 3
22 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
CNNs: A Guide for Data Scientists
No ratings yet
CNNs: A Guide for Data Scientists
7 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
15 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
No ratings yet
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
31 pages
CNNs for Image Pattern Recognition
100% (1)
CNNs for Image Pattern Recognition
3 pages
Module 05 CNN Arctitecture
No ratings yet
Module 05 CNN Arctitecture
7 pages
Deep Learning Subject Practicals Uni Mumbai
No ratings yet
Deep Learning Subject Practicals Uni Mumbai
25 pages
DL-Unit-3 Final
No ratings yet
DL-Unit-3 Final
25 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Ad3501 DL Unit 2
No ratings yet
Ad3501 DL Unit 2
8 pages
Unit 4 (CNN and SOM)
No ratings yet
Unit 4 (CNN and SOM)
15 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Mini Project Final Report
No ratings yet
Mini Project Final Report
30 pages
Module 5
No ratings yet
Module 5
20 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
CNN Architecture & Concepts Explained
No ratings yet
CNN Architecture & Concepts Explained
21 pages
Arora 2020
No ratings yet
Arora 2020
3 pages
CNNs Explained for Students
No ratings yet
CNNs Explained for Students
11 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Unit 5 Ann
No ratings yet
Unit 5 Ann
28 pages
CNNS and Classification Networks
No ratings yet
CNNS and Classification Networks
115 pages
Unit 4
No ratings yet
Unit 4
19 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
9 pages
Intro to Convolutional Neural Networks
No ratings yet
Intro to Convolutional Neural Networks
80 pages
Unit 3
No ratings yet
Unit 3
10 pages
Module 3
No ratings yet
Module 3
67 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
WINSEM2024-25 BMEE407L TH VL2024250503563 2025-03-28 Reference-Material-I
No ratings yet
WINSEM2024-25 BMEE407L TH VL2024250503563 2025-03-28 Reference-Material-I
36 pages
CNN 1
No ratings yet
CNN 1
19 pages
CNN Basics for AI Enthusiasts
No ratings yet
CNN Basics for AI Enthusiasts
29 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
DL Unit3
No ratings yet
DL Unit3
8 pages
Unit III
No ratings yet
Unit III
89 pages
Principles of Convolutional Neural Networks
No ratings yet
Principles of Convolutional Neural Networks
9 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Seating Plan 16.01.2025 M
No ratings yet
Seating Plan 16.01.2025 M
14 pages
003-1 TG Enviro Science 3 To 6 Years
No ratings yet
003-1 TG Enviro Science 3 To 6 Years
73 pages
Tle 9
100% (1)
Tle 9
4 pages
Karnataka 2nd PUC English Question Paper 2022
100% (2)
Karnataka 2nd PUC English Question Paper 2022
11 pages
English 8 Week 1 and 2
No ratings yet
English 8 Week 1 and 2
10 pages
Conceptual Ambiguities and Measurement Issues in Sensory Processing Sensitivity.
No ratings yet
Conceptual Ambiguities and Measurement Issues in Sensory Processing Sensitivity.
12 pages
Askey Wilson
No ratings yet
Askey Wilson
23 pages
10 Geography Project 2025 - 26
No ratings yet
10 Geography Project 2025 - 26
4 pages
Colonial Anthropology and Its Alternatives
No ratings yet
Colonial Anthropology and Its Alternatives
21 pages
Sae J405-2018
No ratings yet
Sae J405-2018
7 pages
Internal Threaded End-Connection Linear Valve Pn16: Features
No ratings yet
Internal Threaded End-Connection Linear Valve Pn16: Features
4 pages
Coursera 5A3QN7VCB7CT
No ratings yet
Coursera 5A3QN7VCB7CT
1 page
STEM Qualifying Exam Reviewer Grade10 Science Math 50Q
No ratings yet
STEM Qualifying Exam Reviewer Grade10 Science Math 50Q
26 pages
Portfolio Final Demonstration
No ratings yet
Portfolio Final Demonstration
3 pages
Physics Project Form 5: Name:Meena Rubini Thoppasamy Class:5 SN 1
No ratings yet
Physics Project Form 5: Name:Meena Rubini Thoppasamy Class:5 SN 1
13 pages
Brain and Human Body Modeling Computational Human Modeling at EMBC 2018 Sergey Makarov Download
No ratings yet
Brain and Human Body Modeling Computational Human Modeling at EMBC 2018 Sergey Makarov Download
138 pages
Geog 2025 GR 11 June Exam Marking Guidelines
No ratings yet
Geog 2025 GR 11 June Exam Marking Guidelines
8 pages
SERS-Based Cancer Exosome Detection
No ratings yet
SERS-Based Cancer Exosome Detection
10 pages
Computational Simulation of Shock Oscillation Around A
No ratings yet
Computational Simulation of Shock Oscillation Around A
10 pages
Self-Control Research Overview
No ratings yet
Self-Control Research Overview
34 pages
Healthcare Management Module Feedback
No ratings yet
Healthcare Management Module Feedback
8 pages
The Critical Diameter, Detonation Velocity and Shock Sensitivity of Australian pbxw-115 PDF
No ratings yet
The Critical Diameter, Detonation Velocity and Shock Sensitivity of Australian pbxw-115 PDF
45 pages
Lecture 15 1
No ratings yet
Lecture 15 1
21 pages
Synthesis and Characterization of CSA
No ratings yet
Synthesis and Characterization of CSA
7 pages
Certificados Luda La Obra JJ
No ratings yet
Certificados Luda La Obra JJ
6 pages
Valvula Compuerta Una y Media
No ratings yet
Valvula Compuerta Una y Media
1 page
Plant Design and Economics For Chemical Engineers 5th Edition Peters Get PDF Now
No ratings yet
Plant Design and Economics For Chemical Engineers 5th Edition Peters Get PDF Now
339 pages
Computer Application Packages Ii Lecture Note
No ratings yet
Computer Application Packages Ii Lecture Note
18 pages
Plant Systems Biology 1st Edition Dmitry A. Belostotsky (Auth.) - The Full Ebook With All Chapters Is Available For Download Now
100% (2)
Plant Systems Biology 1st Edition Dmitry A. Belostotsky (Auth.) - The Full Ebook With All Chapters Is Available For Download Now
46 pages
Not My Type (MM) Joe Satoria Download
100% (1)
Not My Type (MM) Joe Satoria Download
129 pages