0% found this document useful (0 votes)

91 views61 pages

Convolutional Neural Networks (Part I)

This document provides an overview of convolutional neural networks (CNNs). It discusses how CNNs use convolution operations instead of matrix multiplication, which allows them to leverage sparse interactions, parameter sharing, and equivariance to translation. This helps CNNs better process grid-like data like images and be invariant to translations. The document outlines CNN architecture and operations like convolution, cross-correlation, and pooling layers.

Uploaded by

Shahrullohon Lutfillohonov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

91 views61 pages

Convolutional Neural Networks (Part I)

Uploaded by

Shahrullohon Lutfillohonov

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 61

Convolutional Neural

Networks
(Part I)
• SHAHRULLOHON LUTFILLOHONOV
• shahrullo@pusan.ac.kr
• PNU DataLab
• 2020/10/09
Outline
• Overview
• Motivation
• The Convolution Operation
• Pooling
• Convolution and Pooling as an Infinitely Strong Prior

2
Outline
• Overview
• Motivation
• The Convolution Operation
• Pooling
• Convolution and Pooling as an Infinitely Strong Prior

3
Overview of Convolutional Networks
• Convolutional Networks, a.k.a. Convolutional Neural Networks (CNNs) are a
specialized kind of neural network
• For processing data that has a known grid-like topology
• Ex: time-series data, which is a 1-D grid,
taking samples at intervals

• Image data, which are 2-D grid of pixels

• They utilize convolution, which is a specialized kind of
linear operation 4
KEY idea
• Replace matrix multiplication in neural nets with convolution

• Everything else stays the same

• Maximum likelihood

• Back-propagation

• Etc.

5
Overview of Convolutional Networks
A computer sees an image as an array of
numbers

6
Outline
• Overview
• Motivation
• The Convolution Operation
• Pooling
• Convolution and Pooling as an Infinitely Strong Prior

7
Why Convolution Instead of Matrix
Multiplication?

We saw before:

• A series of matrix multiplications:

8
A Problem

input layer output

layer
• Will a NN that recognizes the left image as a flower
also recognize the one on the right as a flower?
• Need a network that will “fire” regardless of the
precise location of the target object 9
Motivation For Using Convolution Networks

• 1. Convolution leverages three important ideas to improve ML systems:

• Sparse interactions
• Parameter sharing
• Equivariant representations
• 2. Convolution also allows for working with inputs of variable size

10
Motivation: Sparse Connectivity

• Fully connected network: is computed by full matrix multiplication with no

sparse connectivity
11
Motivation: Sparse Connectivity

• Kernel of size 3: only depends on

12
Motivation: Sparse Connectivity

• It is possible to obtain good

performance while keeping
kernel several magnitudes
lower than input

• Connections in CNNs are sparse, but units in deeper layers are connected to
all of the input ( larger receptive field sizes)
13
Practical Example

Fully connected layer Sparse layer with kernel 3x3

~7.5 billion ~1,354 thousand

(=224²x 224²x3) Convolution is billions times more efficient (=224²x3x3x3)
parameters parameters

Convolutional layer also incorporates parameter sharing and this

number will decrease further
14
Motivation: Parameter Sharing

• Parameter sharing refers to using the same parameter for more than one function in a model

• Plain vanilla NN: Each element of the weight matrix is used exactly once when computing the

output of a layer

• It is multiplied by one element of the input and never revisited

• Parameter sharing is synonymous with tied weights

• Weight applied to one input is tied to value of a weight applied elsewhere

• Each member of the kernel is used in every position of the input (except at the boundary)

15
How Parameter Sharing Works

• 1. Convolutional model: black arrows indicate uses of the central element of a 3-element kernel

• 2. Fully connected model: single black arrow indicates use of the central element of the weight
matrix
• Model has no parameter sharing,
so the parameter is used only once

16
Motivation: Equivariance To Translation

• The particular form of parameter sharing leads to equivariance to translation
• Equivariant means that if the input changes, the output changes in the same way
• A function is equivariant to if
• If is a function that translates the input, i.e., that shifts it, then the convolution function is
equivariant to

17
Various instances of Cats detected due to
property of Translational Equivariance.

18
Absence of Equivariance
• In some cases, we may not wish to share parameters

across entire image

• If image is cropped to be centered on a face, we

may want different features from different parts of

the face

• Part of the network processing the top of the face

looks for eyebrows

• Part of the network processing the bottom of the

face looks for the chin

• Certain image operations such as scale and rotation are

not equivariant to convolution

• Other mechanisms are needed for such

transformations
19
A Problem

• Scan for the desired object

• “Look” for the target object at each position
• At each location, entire region is sent through NN
20
Outline
• Overview
• Motivation
• The Convolution Operation
• Pooling
• Convolution and Pooling as an Infinitely Strong Prior

21
Dropped Ball

We consider all the possible ways

22
What is Convolution?

Probability for each case of is

Total likelihood

The convolution of and , evaluated at is:

If we substitute

23
What is Convolution?
• Convolution – an operation on two functions of a real-valued argument

Input Additional
Feature measurements
map Kernel
parameters adapted by the
multidimensional array of data learning algorithm

Arrays of input and kernel are referred to as tensors

24
Convolution Operation

continuous

∞

∑ 𝑥 ( 𝑎 ) 𝑤 (𝑡 − 𝑎) discrete
𝑎=− ∞

25
Two-Dimensional Convolution
• Convolutions over more than one axis
Commutativity
arises because
com
we have flipped
mu
tati
ve
the kernel
relative to the
input

Easier to implement since there is less variation in the

range of valid values of m and n

26
Cross-Correlation
• Same as convolution, but without flipping the kernel

• Both referred to as convolution, and whether kernel is flipped or not

• In ML, learning algorithm will learn appropriate values of the kernel in the
appropriate place

27
Convolution vs Cross-Correlation

28
Overall Architecture

Convolutional Detector layer:

Input Pooling Layer Next layers…
Layer non-linearity

perform several each linear activation

use a pooling
convolutions in is run through a
function to
parallel to produce a nonlinear activation
modify output of
set of linear function such as
the layer further
activations ReLU
29
Convolution

Convolutional Detector layer:

Input Pooling Layer Next layers…
Layer non-linearity

30
Convolution

Convolve image with kernel having weights w (learned by backpropagation)

31
32
The Convolution Operation

Generally, image pixels are 3D vectors with RGB

color (height * weight * depth)
33
Non-Linearity

Convolutional Detector layer:

Input Pooling Layer Next layers…
Layer non-linearity

34
The detector (activation) layer

After obtaining feature map, apply an elementwise non-

linearity to obtain a transformed feature map (same size)
35
Outline
• Overview
• Motivation
• The Convolution Operation
• Pooling
• Convolution and Pooling as an Infinitely Strong Prior

36
What is Pooling?
• Pooling a CNN is a subsampling step
• It replaces output at a location with a summary statistic of nearby outputs

Convolutional Detector layer:

Input Pooling Layer Next layers…
Layer non-linearity

37
38
39
40
41
42
43
Types of Pooling Functions

• Popular 4 types of pooling functions:

• 1. Max Pooling

• 2. Average Pooling of a rectangular neighborhood

• 3. L2 norm of a rectangular neighborhood

• 4. Weighted average

• based on the distance from the central pixel

44
Pooling causes Translation Invariance

• Pooling makes the representation become approximately invariant to

small translations of the input
• If we translate the input by a small amount values, most of the outputs does
not change

45
Max Pooling Produces Invariance To Translation
• View of middle of output of a convolutional layer
Outputs of maxpooling

Outputs of nonlinearity

• Same network after the input has been shifted by one pixel

• Every input value has changed, but only half the values of output have changed because maxpooling
units are only sensitive to maximum value in neighborhood not exact value
46
Why Translation Invariance IMPORTANT?
• Invariance to translation is important if we care about whether a feature is present

rather than exactly where it is

• For detecting a face we just need to know that an eye is present in a region, not its exact location

47
Pooling with Downsampling
• Downsampling allows the features to be flexibly positioned

Downsampling

• Downsampling pixels will not change the object bird

• We can downsample the pixels to make image smaller:
• fewer parameters to characterize the image
48
Outline
• Overview
• Motivation
• The Convolution Operation
• Pooling
• Convolution and Pooling as an Infinitely Strong Prior

49
Prior Parameter Distribution

• Role of a prior probability distribution over the parameters of a model is:

• Encode out belief as to what models are reasonable before seeing the data

50
Weak and Strong Priors
• A weak prior
• A distribution with high entropy
• e.g., Gaussian with high variance
• Data can move parameters freely

• A strong prior
• It has very low entropy
• e.g., a Gaussian with low variance
• Such a prior plays a more active role in determining
where the parameters end up

51
Infinitely Strong Prior

• An infinitely strong prior places zero probability on some parameters

• It says that some parameter values are forbidden regardless of support from data

• With an infinitely strong prior, irrespective of the data the prior cannot be

changed

52
Convolution As Infinitely Strong Prior
• Convolutional net is similar to a fully connected net but with an infinitely
strong prior over its weights
• weights for one hidden unit must be identical to the weights of its neighbor, but shifted
in space
• weights must be zero, except for in the small spatially contiguous receptive field
assigned to that hidden unit
• The function the layer should learn contains only local interactions and is equivariant to
translation

53
Pooling As Infinitely Strong Prior
• The use of pooling is an infinitely strong prior that each unit should be invariant to
small translations
• Maxpooling example:

54
Key Insight: Underfitting
• Convolution and pooling can cause underfitting
• Underfitting happens when model has high bias

• Convolution and pooling are only useful when the

assumptions made by the prior are reasonably
accurate
• Pooling may be inappropriate in some cases
• If the task relies on preserving spatial information
• Using pooling on all features can increase training error

55
Architecture and the training process
• Each layer
produces values
that are obtained
from previous
layer by
performing a
matrix-
multiplication

A CNN is composed of a stacking of several building blocks:

• Convolutional layer
• Pooling layer
• Fully connected layer
56
Architecture and the training process

A model’s performance under particular kernels and weights is calculated with a loss
function through forward propagation on a training dataset

57
Architecture and the training process

learnable parameters (kernels and weights) are updated according to the loss value through
backpropagation with gradient descent optimization algorithm

58
Classification with Convolutional Networks

59
Conclusion
• Scale up neural networks to process very large images/video sequences

• Sparse Interactions

• Parameter Sharing

• Automatically generalize across spatial translations of inputs

• Computationally efficient compared to general matrix multiplication

• Invariant to small changes

• Infinitely Strong Prior

• Applicable to any input that is laid out on a grid (1-D, 2-D, 3-D, …)
60
THANK YOU VERY MUCH

QUESTIONS?
61

(Advances in Intelligent Systems and Computing 530) Juan Manuel Corchado Rodriguez, Sushmita Mitra, Sabu M. Thampi, El-Sayed El-Alfy (eds.)-Intelligent Systems Technologies and Applications 2016-Sprin.pdf
100% (1)
(Advances in Intelligent Systems and Computing 530) Juan Manuel Corchado Rodriguez, Sushmita Mitra, Sabu M. Thampi, El-Sayed El-Alfy (eds.)-Intelligent Systems Technologies and Applications 2016-Sprin.pdf
1,005 pages
Image Restoration Using Residual Generative Adversarial Networks-FINAL
No ratings yet
Image Restoration Using Residual Generative Adversarial Networks-FINAL
21 pages
Q-Learning and Deep Q Networks (DQN)
No ratings yet
Q-Learning and Deep Q Networks (DQN)
52 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
Physics Informed Neural Network Theory and Applications
No ratings yet
Physics Informed Neural Network Theory and Applications
44 pages
Introduction To Machine Learning EECS 6327
No ratings yet
Introduction To Machine Learning EECS 6327
22 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Electrical Systems Design Manual First Edition July 2020-A11y
No ratings yet
Electrical Systems Design Manual First Edition July 2020-A11y
162 pages
Banafa A Quantum Computing and Other Transformative Technologies 2023
No ratings yet
Banafa A Quantum Computing and Other Transformative Technologies 2023
260 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages
A Secret History of Compassion
No ratings yet
A Secret History of Compassion
410 pages
Part I-KDD - Tutorial - GNN PDF
No ratings yet
Part I-KDD - Tutorial - GNN PDF
322 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
Cns Decode
No ratings yet
Cns Decode
142 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
The One Loot Table To Rule Them All
No ratings yet
The One Loot Table To Rule Them All
308 pages
Btech CSE
No ratings yet
Btech CSE
17 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
Neural Network Module 2 Notes
100% (1)
Neural Network Module 2 Notes
72 pages
ALGORITHM Types and Modes
No ratings yet
ALGORITHM Types and Modes
190 pages
Data Science
No ratings yet
Data Science
74 pages
Bit Manipulation - HackerEarth
No ratings yet
Bit Manipulation - HackerEarth
18 pages
ARTIFICIAL NEURAL NETWORKS-moduleIII
No ratings yet
ARTIFICIAL NEURAL NETWORKS-moduleIII
61 pages
Module 8 Pharma
No ratings yet
Module 8 Pharma
34 pages
Machine Learning Unit 2
No ratings yet
Machine Learning Unit 2
33 pages
LSTM
No ratings yet
LSTM
42 pages
Information Retrieval Data Structures & Algorithms - William B. Frakes
No ratings yet
Information Retrieval Data Structures & Algorithms - William B. Frakes
630 pages
Unit 5
No ratings yet
Unit 5
61 pages
Photogrammetry Lecture Note 231-1
No ratings yet
Photogrammetry Lecture Note 231-1
14 pages
RBM, DBN, and DBM
No ratings yet
RBM, DBN, and DBM
79 pages
State Machine Design Inc
No ratings yet
State Machine Design Inc
12 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
The Design Procedures For Prestressed Reinforced Concrete
100% (1)
The Design Procedures For Prestressed Reinforced Concrete
87 pages
GoodHomes July 2021 Digital Issue
No ratings yet
GoodHomes July 2021 Digital Issue
52 pages
Digital Image Processing: Image Restoration: Noise Removal
No ratings yet
Digital Image Processing: Image Restoration: Noise Removal
31 pages
Ethics - Morality Is Not Relative - James Rachels Lecture
100% (1)
Ethics - Morality Is Not Relative - James Rachels Lecture
17 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
35 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Module 2 - Ms Word Group Activity Lesson Plan
No ratings yet
Module 2 - Ms Word Group Activity Lesson Plan
6 pages
The Beverage Industry, Yesterday and Today: Hbar M.Aldana
No ratings yet
The Beverage Industry, Yesterday and Today: Hbar M.Aldana
92 pages
Cellular Neural Networks: A Review
No ratings yet
Cellular Neural Networks: A Review
31 pages
Physics-Guided Physics-Informed and Physics-Encode
No ratings yet
Physics-Guided Physics-Informed and Physics-Encode
37 pages
Computer Education For Nepali School Students - QBASIC CLASS IX
No ratings yet
Computer Education For Nepali School Students - QBASIC CLASS IX
10 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
10 pages
Absolute Ixarc Rotary Encoder With Canopen Interface User Manual
No ratings yet
Absolute Ixarc Rotary Encoder With Canopen Interface User Manual
70 pages
Activity 5
No ratings yet
Activity 5
5 pages
MP Neuron
No ratings yet
MP Neuron
35 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
Neuro Fuzzy Systems
100% (1)
Neuro Fuzzy Systems
27 pages
Telecom Policy
No ratings yet
Telecom Policy
21 pages
Quiz, Quiz, Trade Atom and Periodic Table
No ratings yet
Quiz, Quiz, Trade Atom and Periodic Table
5 pages
PTFE
No ratings yet
PTFE
12 pages
Edited ADM For P.E. Health Q4 W 5 6
No ratings yet
Edited ADM For P.E. Health Q4 W 5 6
13 pages
Song of The Birds El Cant Dels Ocells
No ratings yet
Song of The Birds El Cant Dels Ocells
6 pages
Issue No 11 - Elastic Springback
No ratings yet
Issue No 11 - Elastic Springback
2 pages
Production of H3PO4
0% (1)
Production of H3PO4
7 pages
Galaxy Macau Mega Resort PDF
No ratings yet
Galaxy Macau Mega Resort PDF
18 pages
Chapter Bit Manipulation
No ratings yet
Chapter Bit Manipulation
14 pages
Quantum Neural Networks Versus Conventional Feedforward Neural N
No ratings yet
Quantum Neural Networks Versus Conventional Feedforward Neural N
10 pages
CNN Cheat Sheet
No ratings yet
CNN Cheat Sheet
5 pages
Web Developer Specialist
No ratings yet
Web Developer Specialist
8 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
G5Aiai Introduction To AI: Graham Kendall
No ratings yet
G5Aiai Introduction To AI: Graham Kendall
48 pages
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
Baji Rout Brochure
No ratings yet
Baji Rout Brochure
12 pages
Back Propagation
100% (1)
Back Propagation
27 pages
Deep Learning Kathi
No ratings yet
Deep Learning Kathi
18 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Module 2 Ethics
No ratings yet
Module 2 Ethics
21 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Witherspoon2008 Vital Pulp Therapy With New Materials New Directions and Treatment Perspeectives
No ratings yet
Witherspoon2008 Vital Pulp Therapy With New Materials New Directions and Treatment Perspeectives
4 pages
Stainless Steel Bs en Equivalents
No ratings yet
Stainless Steel Bs en Equivalents
3 pages
Chapter4 Associative Memory
No ratings yet
Chapter4 Associative Memory
27 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
Intro4 ANN Deep CNN PDF
No ratings yet
Intro4 ANN Deep CNN PDF
20 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
Elicitors in Plant Tissue Culture PDF
No ratings yet
Elicitors in Plant Tissue Culture PDF
6 pages
0 J2013 - Sankhya (Jnana) The Yoga of Knowledge
No ratings yet
0 J2013 - Sankhya (Jnana) The Yoga of Knowledge
10 pages
Perceptons Neural Networks
No ratings yet
Perceptons Neural Networks
33 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Public Speaking Text
100% (1)
Public Speaking Text
4 pages
2010-11 - Barcelona Home Retro - Retroclubkits
No ratings yet
2010-11 - Barcelona Home Retro - Retroclubkits
1 page
Turbidity Monitors Comparison
No ratings yet
Turbidity Monitors Comparison
1 page
Apache Mahout Essentials
From Everand
Apache Mahout Essentials
Jayani Withanawasam
No ratings yet
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet