Assignment 3

Deep Learning Assignment based on Image classification on CIFAR-10 dataset using CNN and also sentiment classification using RNN

Uploaded by

Saurabh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views5 pages

Assignment 3

Deep Learning Assignment based on Image classification on CIFAR-10 dataset using CNN and also sentiment classification using RNN

Uploaded by

Saurabh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Deep Learning: Assignment 3

Implementation of CNN, RNN, LSTM, and GRU

This assignment involves the following tasks:

• Image classification using CNN

• Sentiment Analysis using RNN, LSTM, and GRU

Submit the executed code in Jupyter notebook. You can write your observations and results using the
heading and markdown cells in Jupyter. If you have memory or GPU constraints, you can use Google
colab or Kaggle. The links to the libraries and resources required are added in Moodle. If the training
takes lot of time, then train in intervals by saving and restoring the models from checkpoints. You can
learn about saving and restoring the models from the documentation of Tensorflow or Keras.

1. Image Classification using CNN:

The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per
class. There are 50000 training images and 10000 test images. The task is to construct a CNN for
CIFAR10 classification.

(a) Fetch the training and test datasets for CIFAR10 using the built in functions in Tensorflow or
Keras. Follow the links in Moodle, to know how to download the datasets.
(b) Create a validation set of 10000 images from the training set.
(c) Train the following CNN architectures as follows:
(d) Model1:
• The network comprises of 2 convolutional layers and 2 max pooling layers and 1 fully
connected hidden layer.
• Number of kernels in the first convlayer =32 (5x5 filters)
• Number of kernels in the 2nd conv layer =64 (5x5 filters)
• Max pooling of size 2x2
• Non-Linearity used in all hidden layers is ReLU
• The output layer is softmax
• Number of units in the fully connected hidden layer1 = 64
• Number of units in the output layer =10
The network architecture would be as given below:
Input → conv1 (32 filters (5x5)) → Maxpool (2x2) → conv2 (64 filters (5x5)) → Maxpool
(2x2) → Flatten → FCL1(64) → Softmax output layer(10)
(e) Model2:
• The network comprises of 2 convolutional layers and 2 max pooling layers and 1 fully
connected hidden layer.

1
• Number of kernels in the first convlayer =32 (5x5 filters)
• Number of kernels in the 2nd conv layer =64 (5x5 filters)
• Max pooling of size 3x3
• Non-Linearity used in all hidden layers is ReLU
• The output layer is softmax
• Number of units in the fully connected hidden layer1 = 64
• Number of units in the output layer =10
The network architecture would be as given below:
Input → conv1 (32 filters (5x5)) → Maxpool (3x3) → conv2 (64 filters (5x5)) → Maxpool
(3x3) → Flatten → FCL1(64) → Softmax output layer(10)
(f) Model3:
• The network comprises of 2 convolutional layers and 2 max pooling layers and 1 fully
connected hidden layer.
• Number of filter kernels in the first convlayer =32 (3x3 filters)
• Number of filter kernel in the 2nd conv layer =64 (3x3 filters)
• Max pooling of size 2x2
• Non-Linearity used in all hidden layers is ReLU
• The output layer is softmax
• Number of units in the fully connected hidden layer1 = 64
• Number of units in the output layer =10
The network architecture would be as given below:
Input → conv1 (32 filters (3x3)) → Maxpool (2x2) → conv2 (64 filters (3x3)) → Maxpool
(2x2) → Flatten → FCL1(64) → Softmax output layer(10)
(g) Model4:
• The network comprises of 3 convolutional layers and 2 max pooling layers and 1 fully
connected hidden layer.
• Number of kernels in the first convlayer =32 (3x3 filters)
• Number of kernels in the 2nd conv layer =64 (3x3 filters)
• Number of kernels in the 3rd conv layer =128 (3x3 filters)
• Max pooling of size 2x2
• Non-Linearity used in all hidden layers is ReLU
• The output layer is softmax
• Number of units in the fully connected hidden layer1 = 64
• Number of units in the output layer =10
The network architecture would be as given below:
Input → conv1 (32 filters (3x3)) → Maxpool (2x2) → conv2 (64 filters (3x3)) → Maxpool
(2x2) → conv3 (128 filters (3x3)) → Maxpool (2x2) → Flatten → FCL1(64) → Softmax
output layer(10)
(h) Which the best model among model1 to model4? Why?
(i) Model5:
• Choose the architecture of best model among model1, model2, model3, and model4. Add
strides of 2 to all the convolution layers. If it is not possible to add strides due to negative
dimension, choose the next best model.

2
(j) Model6:
• Choose the architecture of best model of model1, model2,model3, and model4. Add “SAME”
padding to all the convolution layers.
(k) Model7:
• Choose the architecture of best model of model1, model2,model3, and model4. Add “SAME”
padding to all the convolution layers with stride2. If it is not possible to add strides due to
negative dimension, choose the next best model.
(l) Which the best model among model1 to model7? Why?
(m) Model8:
• Choose the architecture of the best model among model1 to model7. Use tanh activation
function in the hidden layers instead of ReLU
(n) Model 9:
• Choose the architecture of the best model among model1 to model7. Use sigmoid activation
function in the hidden layers instead of ReLU
(o) For each model trained, plot the loss vs iteration curve and accuracy vs iteration curve for
training data.
(p) Tabulate the following for model 1 to model 9. You can create the table by hand and then
upload the screenshot of the table in the jupyter notebook.
• Total number of trainable parameters.
• Training time
• Training accuracy
• Validation accuracy
• Test accuracy

2. Sentiment analysis on IMDB movie review dataset:

The dataset comprises of 50,000 movie reviews taken from IMDb. The training and test data sets
are uploaded in moodle. The datasets are in the form of text files. Every line in the files is a movie
review. The training set and test set comprises of 25000 movie reviews each. For both training and
test data sets, the first 12500 reviews are positive and the rest of the reviews are negative.

(a) Preprocess the dataset:

Remove punctuation and < br > tag. Convert all the characters to lower case.
(b) Create the target vector for training and test data. The first 12500 values of target vector is 1
and the rest of the values are zero. The target vector is same for training and test data.
(c) Create a validation set which is 20% of the training set.
(d) Train the following models.
(e) Model 1:
• Integer encode the reviews
• To make the length of all the reviews equal, pad the reviews to maximum length of 200.
You can use pad sequences API in Tensorflow or Keras.
• It is possible to learn the vector embedding of words using the embedding layer in Tensorflow
or Keras. Follow the links in Moodle to learn about vector embeddings.
• The network consists of an embedding layer, 1 RNN layer and output layer.

3
• The embedding size of embedding layer = 128
• Number of nodes in RNN layers = 200
• Non-linearity used in all RNN layers is tanh.
• The output layer is sigmoid.
• The network architecture is as follows:
Input → Embeddding Layer (embedding size=128) → RNN1(200 nodes) → Sigmoid output
(f) Model 2:
• Integer encode the reviews
• The network consists of an embedding layer, 1 LSTM layer and output layer.
• The embedding size of embedding layer = 128
• Number of nodes in LSTM layers = 200
• Non-linearity used in all LSTM layers is tanh.
• The output layer is sigmoid.
• The network architecture is as follows:
Input → Embeddding Layer (embedding size=128) → LSTM1(200 nodes) → Sigmoid out-
put
(g) Model 3:
• Integer encode the reviews
• The network consists of an embedding layer, 1 GRU layer and output layer.
• The embedding size of embedding layer = 128
• Number of nodes in GRU layers = 200
• Non-linearity used in all GRU layers is ReLU.
• The output layer is sigmoid.
• The network architecture is as follows:
Input → Embeddding Layer (embedding size=128) → GRU1(200 nodes) → Sigmoid output
(h) Which is the best model among model1 to model3 - RNN, LSTM or GRU? Why?
(i) Model 4:
• Integer encode the reviews
• The network consists of an embedding layer, 2 (GRU/LSTM/RNN based of the best per-
forming model in model1 to model3) layers and output layer.
• All other configurations are same as model1.
(j) Model 5:
• Integer encode the reviews
• The network consists of an embedding layer, 3 (GRU/LSTM/RNN based of the best per-
forming model in model1 to model3) layers and output layer.
• All other configurations are same as model1.
(k) Model 6:
• Use word2vec to create word embeddings of length 128
• Create vector representation of reviews from word embedding
• Choose the best performing model among model1 to model5. Remove the embedding layer
in the best model and give the vector embeddings obtained by word2vec as input.

4
• The network architecture is as follows:
Input(Vector embeddings obtained using word2vec) → RNN/LSTM/GRU layers (the num-
ber of layers and nodes same as the best model among model 1 to model5) → Sigmoid
output layer
(l) Plot the loss vs iteration and accuracy vs iteration curve for training data for all the models.
(m) Tabulate the training, testing and validation accuracy for all the models.

Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
CCS355 SET1 Anna University Lab Manual Question Set
100% (1)
CCS355 SET1 Anna University Lab Manual Question Set
3 pages
DL Record
No ratings yet
DL Record
11 pages
CCS355 SET2 Anna University Lab Question Set Neural Network
No ratings yet
CCS355 SET2 Anna University Lab Question Set Neural Network
2 pages
Question Bank AML
No ratings yet
Question Bank AML
2 pages
Exp 6,7,8
No ratings yet
Exp 6,7,8
17 pages
Deepques
No ratings yet
Deepques
12 pages
DL Lab Manual
No ratings yet
DL Lab Manual
18 pages
Computer Science Exam Paper
No ratings yet
Computer Science Exam Paper
14 pages
Deep Learning Cat 2
No ratings yet
Deep Learning Cat 2
14 pages
DL Cie2
No ratings yet
DL Cie2
5 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
2nd Assignment 7AI - DL
No ratings yet
2nd Assignment 7AI - DL
2 pages
Sample C6 Deep Learning and ANN Paper
No ratings yet
Sample C6 Deep Learning and ANN Paper
3 pages
DL Important
No ratings yet
DL Important
13 pages
Assignment 3 2
No ratings yet
Assignment 3 2
2 pages
Unit 3
No ratings yet
Unit 3
1 page
Ad3301 Set1
No ratings yet
Ad3301 Set1
2 pages
Deep Learning Assignment 01
No ratings yet
Deep Learning Assignment 01
5 pages
Ad3511 Set4
No ratings yet
Ad3511 Set4
3 pages
DL Lab Answers Batch 2
No ratings yet
DL Lab Answers Batch 2
27 pages
DL MCQ
No ratings yet
DL MCQ
13 pages
Yarn Own BD'
No ratings yet
Yarn Own BD'
9 pages
QB DL
No ratings yet
QB DL
2 pages
Question Bank
No ratings yet
Question Bank
14 pages
Deep Learning-Question Bank-Module-Wise
75% (4)
Deep Learning-Question Bank-Module-Wise
5 pages
Code Explanation #3
No ratings yet
Code Explanation #3
5 pages
Unit4 Deep Learning
No ratings yet
Unit4 Deep Learning
6 pages
CD-601 Assignmentquestions
No ratings yet
CD-601 Assignmentquestions
2 pages
Sentiment Analysis With An Recurrent Neural Networks
No ratings yet
Sentiment Analysis With An Recurrent Neural Networks
12 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
Keras NLP Encoding and Sentiment Analysis
No ratings yet
Keras NLP Encoding and Sentiment Analysis
8 pages
Assignment 5 - NN
No ratings yet
Assignment 5 - NN
4 pages
CCS355-Neural Networks and Deep Learning - Assignment 1
No ratings yet
CCS355-Neural Networks and Deep Learning - Assignment 1
15 pages
NLP Lab Assignment - 05
No ratings yet
NLP Lab Assignment - 05
6 pages
Text Classification - Movie Review - News Wires
No ratings yet
Text Classification - Movie Review - News Wires
5 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
BTCS604
No ratings yet
BTCS604
2 pages
DL CO1 and CO2 Answers
No ratings yet
DL CO1 and CO2 Answers
36 pages
Deep Learning Quetion
No ratings yet
Deep Learning Quetion
7 pages
DL - LSTM - 3.ipynb - Colab
No ratings yet
DL - LSTM - 3.ipynb - Colab
3 pages
Notes of Deep Learning Top Architectures
No ratings yet
Notes of Deep Learning Top Architectures
13 pages
Assignment DL 12 03 2025
No ratings yet
Assignment DL 12 03 2025
2 pages
DL 22Q71A4206
No ratings yet
DL 22Q71A4206
65 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
DL2024
No ratings yet
DL2024
4 pages
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
No ratings yet
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
28 pages
Neural Networks and Deep Learning Lab
No ratings yet
Neural Networks and Deep Learning Lab
6 pages
AML (Advanced Machine Learning)
No ratings yet
AML (Advanced Machine Learning)
11 pages
Sequence Models - Merged
No ratings yet
Sequence Models - Merged
67 pages
Deep Learning Viva Questions
No ratings yet
Deep Learning Viva Questions
10 pages
RNN LSTM
No ratings yet
RNN LSTM
37 pages
Malware - Detection - Using - Neural - Networks (Main Paper)
No ratings yet
Malware - Detection - Using - Neural - Networks (Main Paper)
51 pages
CST384 Jun 2023
No ratings yet
CST384 Jun 2023
2 pages
Deep Learning Key Questions Guide
No ratings yet
Deep Learning Key Questions Guide
2 pages
Genai See
No ratings yet
Genai See
51 pages
Lecture Notes 6
No ratings yet
Lecture Notes 6
5 pages
QuestionBank C# and
No ratings yet
QuestionBank C# and
3 pages
CS663-2024-Executive NLP - Assignment Sentiment Analysis
No ratings yet
CS663-2024-Executive NLP - Assignment Sentiment Analysis
4 pages
Philips Ultrasound, Inc.: Document Number
No ratings yet
Philips Ultrasound, Inc.: Document Number
35 pages
C# Unit 1
No ratings yet
C# Unit 1
54 pages
Student Guide to Blackboard Ultra
No ratings yet
Student Guide to Blackboard Ultra
15 pages
SAP HANA High Availability on Azure
No ratings yet
SAP HANA High Availability on Azure
40 pages
B.Sc Software Engineering Guide
No ratings yet
B.Sc Software Engineering Guide
21 pages
Lista-Precios Tecnomega 06-01-2020
No ratings yet
Lista-Precios Tecnomega 06-01-2020
6 pages
Fabrication of Safety Railing and SS Steps Stand
No ratings yet
Fabrication of Safety Railing and SS Steps Stand
4 pages
Akhya CV
No ratings yet
Akhya CV
2 pages
Extend The MDG Business Partner-Node Extension (Reuse Option)
No ratings yet
Extend The MDG Business Partner-Node Extension (Reuse Option)
66 pages
Acrt
No ratings yet
Acrt
7 pages
E-Governance Initiatives AP
No ratings yet
E-Governance Initiatives AP
1 page
ASP.NET Login & Registration Guide
No ratings yet
ASP.NET Login & Registration Guide
5 pages
Sample-VM Decommission PDF
No ratings yet
Sample-VM Decommission PDF
20 pages
Student Record System Report
No ratings yet
Student Record System Report
47 pages
MiCloud Flex As Deployed in ILand Data Centers-Security Whitepaper V1
No ratings yet
MiCloud Flex As Deployed in ILand Data Centers-Security Whitepaper V1
18 pages
Dental Price Book
No ratings yet
Dental Price Book
17 pages
Among Us Il2cpp Dump
No ratings yet
Among Us Il2cpp Dump
4,319 pages
Information Security Controls Guide
No ratings yet
Information Security Controls Guide
32 pages
PeopleLink Eagle 4K Webcam 2024 2
No ratings yet
PeopleLink Eagle 4K Webcam 2024 2
3 pages
Cesil Jesudas
100% (1)
Cesil Jesudas
6 pages
Xilinx MPSoC for Modulation Classification
No ratings yet
Xilinx MPSoC for Modulation Classification
1 page
ugBASIC - User Manual
No ratings yet
ugBASIC - User Manual
952 pages
How To Approach Fit-to-Standard Analysis / Design: On Premise
0% (1)
How To Approach Fit-to-Standard Analysis / Design: On Premise
57 pages
Beginner's Guide to Compiler Building
No ratings yet
Beginner's Guide to Compiler Building
11 pages
Lesson 12. Survey Monkey and Spreadsheet
No ratings yet
Lesson 12. Survey Monkey and Spreadsheet
5 pages
Cohesion and Coupling
No ratings yet
Cohesion and Coupling
40 pages
Pangya Debug - Wiki (GAME FAQ+INFO)
No ratings yet
Pangya Debug - Wiki (GAME FAQ+INFO)
116 pages
Teapot Graphics Report: CSE 425
No ratings yet
Teapot Graphics Report: CSE 425
3 pages
Activity Diagram Use Case 8 Reset Password: Teacher System
No ratings yet
Activity Diagram Use Case 8 Reset Password: Teacher System
13 pages
Linux Pocket Guide Essential Commands 3rd Edition Daniel J. Barrett Instant Download
No ratings yet
Linux Pocket Guide Essential Commands 3rd Edition Daniel J. Barrett Instant Download
124 pages