0% found this document useful (0 votes)

90 views36 pages

Deep Learning Basics Lecture 6 Convolutional NN

The document discusses convolutional neural networks and LeNet-5. It introduces convolutional layers, pooling layers, and fully connected layers. It describes the architecture of LeNet-5, which was an early convolutional neural network applied to handwritten digit recognition. LeNet-5 used convolutional and pooling layers followed by fully connected layers. It also discusses momentum, an optimization technique used in stochastic gradient descent training of neural networks.

Uploaded by

baris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views36 pages

Deep Learning Basics Lecture 6 Convolutional NN

Uploaded by

baris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Deep Learning Basics

Lecture 6: Convolutional NN
Princeton University COS 495
Instructor: Yingyu Liang
Review: convolutional layers
Convolution: two dimensional case
Input Kernel/filter
a b c d w x
e f g h y z
i j k l

wa + bx + bw + cx +
ey + fz fy + gz

Feature map
Convolutional layers
the same weight shared for all output nodes

𝑚 output nodes

𝑘 kernel size

𝑛 input nodes

Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Terminology

Figure from Deep Learning,

by Goodfellow, Bengio,
and Courville
Case study: LeNet-5
LeNet-5
• Proposed in “Gradient-based learning applied to document
recognition” , by Yann LeCun, Leon Bottou, Yoshua Bengio and Patrick Haffner,
in Proceedings of the IEEE, 1998
LeNet-5
• Proposed in “Gradient-based learning applied to document
recognition” , by Yann LeCun, Leon Bottou, Yoshua Bengio and Patrick Haffner,
in Proceedings of the IEEE, 1998

• Apply convolution on 2D images (MNIST) and use backpropagation

LeNet-5
• Proposed in “Gradient-based learning applied to document
recognition” , by Yann LeCun, Leon Bottou, Yoshua Bengio and Patrick Haffner,
in Proceedings of the IEEE, 1998

• Apply convolution on 2D images (MNIST) and use backpropagation

• Structure: 2 convolutional layers (with pooling) + 3 fully connected layers

• Input size: 32x32x1
• Convolution kernel size: 5x5
• Pooling: 2x2
LeNet-5

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
LeNet-5

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
LeNet-5 Filter: 5x5, stride: 1x1,
#filters: 6

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
LeNet-5
Pooling: 2x2, stride: 2

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
LeNet-5 Filter: 5x5x6, stride: 1x1,
#filters: 16

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
LeNet-5
Pooling: 2x2, stride: 2

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
LeNet-5
Weight matrix: 400x120

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
Weight matrix: 84x10
LeNet-5
Weight matrix: 120x84

Figure from Gradient-based learning applied to document recognition,

by Y. LeCun, L. Bottou, Y. Bengio and P. Haffner
Software platforms for CNN
Updated in April 2016; checked more recent ones online
Platform: Marvin (marvin.is)
Platform: Marvin by
LeNet in Marvin: convolutional layer
LeNet in Marvin: pooling layer
LeNet in Marvin: fully connected layer
Platform: Caffe (caffe.berkeleyvision.org)
LeNet in Caffe
Platform: Tensorflow (tensorflow.org)
Platform: Tensorflow (tensorflow.org)
Platform: Tensorflow (tensorflow.org)
Others
• Theano – CPU/GPU symbolic expression compiler in python (from
MILA lab at University of Montreal)
• Torch – provides a Matlab-like environment for state-of-the-art
machine learning algorithms in lua
• Lasagne - Lasagne is a lightweight library to build and train neural
networks in Theano

• See: http://deeplearning.net/software_links/
Optimization: momentum
Basic algorithms
• Minimize the (regularized) empirical loss
෠𝐿𝑅 𝜃 = 1 σ𝑛𝑡=1 𝑙(𝜃, 𝑥𝑡 , 𝑦𝑡 ) + 𝑅(𝜃)
𝑛
where the hypothesis is parametrized by 𝜃

• Gradient descent
𝜃𝑡+1 = 𝜃𝑡 − 𝜂𝑡 𝛻𝐿෠ 𝑅 𝜃𝑡
Mini-batch stochastic gradient descent
• Instead of one data point, work with a small batch of 𝑏 points
(𝑥𝑡𝑏+1, 𝑦𝑡𝑏+1 ),…, (𝑥𝑡𝑏+𝑏, 𝑦𝑡𝑏+𝑏 )

• Update rule
1
𝜃𝑡+1 = 𝜃𝑡 − 𝜂𝑡 𝛻 ෍ 𝑙 𝜃𝑡 , 𝑥𝑡𝑏+𝑖 , 𝑦𝑡𝑏+𝑖 + 𝑅(𝜃𝑡 )
𝑏
1≤𝑖≤𝑏
Momentum
• Drawback of SGD: can be slow when gradient is small

• Observation: when the gradient is consistent across consecutive steps,

can take larger steps
• Metaphor: rolling marble ball on gentle slope
Momentum

Contour: loss function

Path: SGD with momentum
Arrow: stochastic gradient

Figure from Deep Learning, by Goodfellow, Bengio, and Courville

Momentum
• work with a small batch of 𝑏 points
(𝑥𝑡𝑏+1, 𝑦𝑡𝑏+1 ),…, (𝑥𝑡𝑏+𝑏, 𝑦𝑡𝑏+𝑏 )

• Keep a momentum variable 𝑣𝑡 , and set a decay rate 𝛼

• Update rule
1
𝑣𝑡 = 𝛼𝑣𝑡−1 − 𝜂𝑡 𝛻 ෍ 𝑙 𝜃𝑡 , 𝑥𝑡𝑏+𝑖 , 𝑦𝑡𝑏+𝑖 + 𝑅(𝜃𝑡 )
𝑏
1≤𝑖≤𝑏

𝜃𝑡+1 = 𝜃𝑡 + 𝑣𝑡
Momentum
• Keep a momentum variable 𝑣𝑡 , and set a decay rate 𝛼
• Update rule
1
𝑣𝑡 = 𝛼𝑣𝑡−1 − 𝜂𝑡 𝛻 ෍ 𝑙 𝜃𝑡 , 𝑥𝑡𝑏+𝑖 , 𝑦𝑡𝑏+𝑖 + 𝑅(𝜃𝑡 )
𝑏
1≤𝑖≤𝑏

𝜃𝑡+1 = 𝜃𝑡 + 𝑣𝑡

• Practical guide: 𝛼 is set to 0.5 until the initial learning stabilizes and
then is increased to 0.9 or higher.

L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
No ratings yet
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
76 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
DNN/CNN Toolbox Overview
No ratings yet
DNN/CNN Toolbox Overview
52 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Seminar Neural Nets Bihar
No ratings yet
Seminar Neural Nets Bihar
28 pages
Week 8
No ratings yet
Week 8
101 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
7 CNN
No ratings yet
7 CNN
66 pages
DL Unit 5
No ratings yet
DL Unit 5
2 pages
Oversampling Techniques Deep Belief Network DenseNets DNN
No ratings yet
Oversampling Techniques Deep Belief Network DenseNets DNN
23 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
DL Unit 5
No ratings yet
DL Unit 5
2 pages
Neural Networks & Deep Learning Makaut & & 7th SemNotes
No ratings yet
Neural Networks & Deep Learning Makaut & & 7th SemNotes
36 pages
Deep Learning for CS Students
No ratings yet
Deep Learning for CS Students
75 pages
Mergeddv
No ratings yet
Mergeddv
2 pages
Deep Learning Frameworks & Techniques
No ratings yet
Deep Learning Frameworks & Techniques
5 pages
AI Chapter 4
No ratings yet
AI Chapter 4
63 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Introduction To Deep Learning: Nandita Bhaskhar
No ratings yet
Introduction To Deep Learning: Nandita Bhaskhar
56 pages
Lecture 6 - Use Cases of CNN and Implementation
No ratings yet
Lecture 6 - Use Cases of CNN and Implementation
33 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
48 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
30 pages
Chapter 6 (6.2)
No ratings yet
Chapter 6 (6.2)
65 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Deep Learning - Intro, Methods & Applications
100% (1)
Deep Learning - Intro, Methods & Applications
37 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
L10-DL Intro
No ratings yet
L10-DL Intro
25 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Introduction to Deep Learning Course
No ratings yet
Introduction to Deep Learning Course
63 pages
001 Intro
No ratings yet
001 Intro
66 pages
Keras and Tensorflow
No ratings yet
Keras and Tensorflow
11 pages
DL-19-CNN Sequential Model 210223
No ratings yet
DL-19-CNN Sequential Model 210223
18 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
15 pages
Let Us Code: Using Deep Learning Through A Library
No ratings yet
Let Us Code: Using Deep Learning Through A Library
17 pages
Deep Learning (Handout)
No ratings yet
Deep Learning (Handout)
11 pages
Module 3 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 3 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
20 pages
Unit 2 Part 03
No ratings yet
Unit 2 Part 03
49 pages
Detailed Deep Learning Answers
No ratings yet
Detailed Deep Learning Answers
4 pages
DL Unit 5 Perfect Pdf. - 1
No ratings yet
DL Unit 5 Perfect Pdf. - 1
17 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Lecture5 MCQ Guide
No ratings yet
Lecture5 MCQ Guide
9 pages
CNN 2
No ratings yet
CNN 2
47 pages
Unit 4
No ratings yet
Unit 4
86 pages
Chapter 3
No ratings yet
Chapter 3
24 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
82 pages
CS 236 Section 3
No ratings yet
CS 236 Section 3
59 pages
Lecun 20201027 Att
No ratings yet
Lecun 20201027 Att
72 pages
DL Unit 1
No ratings yet
DL Unit 1
199 pages
Backpropagation Lecture Notes
No ratings yet
Backpropagation Lecture Notes
31 pages
OSRAM SFH 309 Datasheet
No ratings yet
OSRAM SFH 309 Datasheet
16 pages
Deep Learning Basics Lecture 3 Regularization I
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
32 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
Deep Learning Basics Lecture 1 Feedforward
No ratings yet
Deep Learning Basics Lecture 1 Feedforward
31 pages
Deep Learning Basics Lecture 8 Autoencoder & DBM
No ratings yet
Deep Learning Basics Lecture 8 Autoencoder & DBM
28 pages
Deep Learning Basics Lecture 11 Practical Methodology
No ratings yet
Deep Learning Basics Lecture 11 Practical Methodology
25 pages
Lectures On Electromagnetic Theory - Weng Cho Chew
No ratings yet
Lectures On Electromagnetic Theory - Weng Cho Chew
591 pages
Motion of Charged Particles in Fields: 2.1 Uniform B Field, E 0
No ratings yet
Motion of Charged Particles in Fields: 2.1 Uniform B Field, E 0
23 pages
ECE604 f20 hw3
0% (1)
ECE604 f20 hw3
3 pages
ECE604 f20 hw1
No ratings yet
ECE604 f20 hw1
1 page
PYu-RC Group 51 RoHS L 12
No ratings yet
PYu-RC Group 51 RoHS L 12
10 pages
SFH 203 - en
No ratings yet
SFH 203 - en
15 pages
SFH 235 Fa - en
No ratings yet
SFH 235 Fa - en
15 pages
VFD-B-PID Set-Up
No ratings yet
VFD-B-PID Set-Up
6 pages
Homeostasis & Control Systems Lecture
No ratings yet
Homeostasis & Control Systems Lecture
15 pages
MCQ For SMD PDF
100% (2)
MCQ For SMD PDF
17 pages
Tech Hum Error Rate Pred
No ratings yet
Tech Hum Error Rate Pred
3 pages
ALL Questions and Answers PDF
No ratings yet
ALL Questions and Answers PDF
102 pages
Block Diagrams
No ratings yet
Block Diagrams
2 pages
Introduction To Systems Analysis and Design
No ratings yet
Introduction To Systems Analysis and Design
48 pages
Self Driving Cars
No ratings yet
Self Driving Cars
4 pages
A - Inhaltsverzeichnis - R30IB Rev 9
0% (1)
A - Inhaltsverzeichnis - R30IB Rev 9
17 pages
Topic 3i - Artificial Neural Networks - Revised 20032020
100% (1)
Topic 3i - Artificial Neural Networks - Revised 20032020
70 pages
Business Process Management
No ratings yet
Business Process Management
8 pages
Thermodynamics & Entropy Basics
No ratings yet
Thermodynamics & Entropy Basics
20 pages
6 - APPENDIX 19 - 24 Instrument Installation Procedure Checklist
No ratings yet
6 - APPENDIX 19 - 24 Instrument Installation Procedure Checklist
9 pages
Backpropagation in CNNs Guide
No ratings yet
Backpropagation in CNNs Guide
4 pages
Literature Review of PID Controller Based On Various Soft Computing Techniques
No ratings yet
Literature Review of PID Controller Based On Various Soft Computing Techniques
4 pages
Profibus Deif
No ratings yet
Profibus Deif
44 pages
Analysis of Anti-Windup Techniques in PID Controle of Process With Measurement
No ratings yet
Analysis of Anti-Windup Techniques in PID Controle of Process With Measurement
6 pages
02 Greedy 3 Fractionalknapsack
No ratings yet
02 Greedy 3 Fractionalknapsack
33 pages
0TH Review
No ratings yet
0TH Review
10 pages
FoST 2018 Final-Chapter 2
No ratings yet
FoST 2018 Final-Chapter 2
39 pages
Machine Learning Course Guide
No ratings yet
Machine Learning Course Guide
2 pages
ENSC-483 - Inverted Pendulum - Final Project
No ratings yet
ENSC-483 - Inverted Pendulum - Final Project
14 pages
Learning Activity Sheet General Chemistry 2 (Q4 - Lessons 1 and 2) Spontaneous Process and Entropy
No ratings yet
Learning Activity Sheet General Chemistry 2 (Q4 - Lessons 1 and 2) Spontaneous Process and Entropy
10 pages
Linear Programming Alloy Blend Optimization
No ratings yet
Linear Programming Alloy Blend Optimization
30 pages
Digital Signal Processing Course Overview
No ratings yet
Digital Signal Processing Course Overview
448 pages
IES - Electronics Engineering - Control System PDF
No ratings yet
IES - Electronics Engineering - Control System PDF
66 pages
.Trashed 1697858794 Synopsis Report
No ratings yet
.Trashed 1697858794 Synopsis Report
6 pages
Digital Filters
No ratings yet
Digital Filters
50 pages
Bowen Family Systems Theory Guide
No ratings yet
Bowen Family Systems Theory Guide
21 pages
Dental 1
No ratings yet
Dental 1
33 pages