0% found this document useful (0 votes)

163 views5 pages

DL-basics-of-neural-networks-MNIST-dataset - Ipynb - Colab

The document outlines a project to classify handwritten digits using the MNIST dataset, consisting of 70,000 grayscale images. It details the setup using TensorFlow and Keras, including data preprocessing, model building, and evaluation. The project encourages experimentation with model architecture and training techniques to improve accuracy.

Uploaded by

Ricardo Emanuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

163 views5 pages

DL-basics-of-neural-networks-MNIST-dataset - Ipynb - Colab

Uploaded by

Ricardo Emanuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.

ipynb - Colab

keyboard_arrow_down 1. Problem Definition

The objective is to classify grayscale images of handwritten digits (0-9) from the MNIST
dataset. This is a collection of 70,000 grayscale images of handwritten digits (0-9). Each image
is a 28x28 pixel matrix, where each pixel represents the intensity of grayscale values (0 to 255).

The MNIST dataset consists of 60,000 training samples and 10,000 test samples. It is widely
used as a benchmark in machine learning for tasks like image classification.

The goal is to train a neural network to accurately predict the correct digit for each image based
on patterns learned during training. This is a supervised learning problem where the input is the
image, and the output is the digit label.

Here is an example of how the computer interpret each image from the dataset:

keyboard_arrow_down 2. Setup

keyboard_arrow_down 2.1. Libraries

# Libraries
import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Flatten
from tensorflow.keras.datasets import mnist
import matplotlib.pyplot as plt
import numpy as np

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 1/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

# Set seed for reproducibility

np.random.seed(42)
tf.random.set_seed(42)

TensorFlow is a popular deep learning framework used to build and train neural networks and
will be used here due to its flexibility, scalability, and ease of integration. For beginners,
TensorFlow offers high-level APIs like Keras, which simplify the process of creating and training
models. These features make TensorFlow an excellent choice for learning and experimentation.

keyboard_arrow_down 2.2. Exploratory analysis

Before loading the dataset, let's understand its structure visually. This will help us comprehend
the nature of the data we are working with.

# Temporary loading of MNIST for visualization

(temp_X_train, temp_y_train), _ = mnist.load_data()

# Plot a grid of sample images with labels

plt.figure(figsize=(12, 6))
for i in range(15):
plt.subplot(3, 5, i + 1)
plt.imshow(temp_X_train[i], cmap='gray')
plt.title(f"Label: {temp_y_train[i]}")
plt.axis('off')
plt.tight_layout()
plt.show()

This will help us check if the dataset is balanced (i.e., equal representation of all digits).

# Analyze class distribution

unique, counts = np.unique(temp_y_train, return_counts=True)
plt.bar(unique, counts, color='blue', alpha=0.7)
for i, count in enumerate(counts):
plt.text(unique[i], count, str(count), ha='center', va='bottom')
plt.title('Class Distribution in MNIST Training Set')
plt.xticks(unique, [str(digit) for digit in unique])
plt.xlabel('Digit Labels')
plt.ylabel('Frequency')
plt.show()

keyboard_arrow_down 3. Load and preprocess the MNIST dataset

# Load the MNIST dataset

(X_train, y_train), (X_test, y_test) = mnist.load_data()

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 2/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

# Normalize the pixel values to the range [0, 1]

X_train = X_train / 255.0
X_test = X_test / 255.0

# Visualize the effects of normalization

# Compare the pixel intensity distribution before and after normalization
plt.figure(figsize=(12, 5))

# Before normalization
plt.subplot(1, 2, 1)
plt.hist(temp_X_train.flatten(), bins=50, color='green', alpha=0.7)
plt.title('Pixel Intensity Before Normalization')
plt.xlabel('Pixel Intensity')
plt.ylabel('Frequency')

# After normalization
plt.subplot(1, 2, 2)
plt.hist(X_train.flatten(), bins=50, color='blue', alpha=0.7)
plt.title('Pixel Intensity After Normalization')
plt.xlabel('Pixel Intensity (Normalized)')
plt.ylabel('Frequency')

plt.tight_layout()
plt.show()

Normalization ensures that the input data has a uniform range, which helps the model converge
faster during training. Neural networks perform better when the data is scaled, as it prevents
larger input values from dominating the learning process.

keyboard_arrow_down 4. Modelling

The model is built using TensorFlow's Sequential API, which allows us to stack layers
sequentially. Each layer processes data in a specific way:

1. Flatten Layer: Converts the 2D input images (28x28 pixels) into a 1D array of 784 features.
This is necessary to feed the data into the Dense layers.
2. Dense Layer (Hidden): A fully connected layer and ReLU activation. ReLU introduces non-
linearity, allowing the model to learn complex patterns.
3. Dense Layer (Output): The final layer has 10 neurons, each representing a digit (0-9). It
uses the softmax activation function to produce probabilities for each class, enabling
classification.

# Build neural network model

model = Sequential([
Flatten(input_shape=(28, 28)), # Flatten the 2D image into a 1D array
Dense(8, activation='relu'), # First hidden layer with 128 neurons
https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 3/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

Dense(10, activation='softmax') # Output layer with 10 neurons (one for each digit)
])

# Compile the model

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

# Train the model

history = model.fit(X_train, y_train, epochs=5, validation_split=0.1)

# Save the trained model immediately after training

model.save('mnist_model.h5')
print("Model saved successfully after training.")

# Evaluate the model on the test data

# Load the saved model

# This will load the model from the HDF5 file for evaluation or inference.
loaded_model = tf.keras.models.load_model('mnist_model.h5')
print("Model loaded successfully from HDF5 file.")

# Evaluate the loaded model on the test data

test_loss, test_acc = loaded_model.evaluate(X_test, y_test)
print(f"Test accuracy (from loaded model): {test_acc:.2f}")

# Visualize training performance

plt.figure(figsize=(10, 5))
plt.plot(range(1, len(history.history['accuracy']) + 1), history.history['accuracy'], mar
plt.plot(range(1, len(history.history['val_accuracy']) + 1), history.history['val_accurac
plt.xticks(range(1, len(history.history['accuracy']) + 1))
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.legend()
plt.title('Training and Validation Accuracy')
plt.show()

# Make predictions
predictions = model.predict(X_test)

# Visualize some predictions

plt.figure(figsize=(10, 5))
for i in range(10):
plt.subplot(2, 5, i + 1)
plt.imshow(X_test[i], cmap="gray")
predicted_label = tf.argmax(predictions[i]).numpy()
true_label = y_test[i]
plt.title(f"Pred: {predicted_label}, True: {true_label}")
plt.axis("off")
plt.show()

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 4/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

keyboard_arrow_down 5. Experimentation

Run experiments with the model architecture and training process. Here are some suggestions:

Add more hidden layers: Try increasing the depth of the network by stacking additional
Dense layers.
Change the number of neurons: Modify the number of neurons in each * Dense layer to see
how it affects performance.
Use different activation functions: Experiment with alternatives like 'sigmoid', 'tanh', or
'LeakyReLU'.
Try other optimization algorithms: Replace 'adam' with optimizers like 'sgd', 'rmsprop', or
'nadam'.
Alter the number of epochs: Train the model for more or fewer epochs and observe
overfitting or underfitting.
Use dropout: Introduce dropout layers to prevent overfitting and enhance generalization.
Modify the learning rate: Adjust the optimizer's learning rate to see how it influences
convergence.

Document your changes and verify if or how each modification impacts the training, validation,
and test accuracy.

keyboard_arrow_down 6. Conclusion

Write your conclusions here

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 5/5

Build Image Classifier with Keras
No ratings yet
Build Image Classifier with Keras
17 pages
Introduction To ANN With Steps 10 25
No ratings yet
Introduction To ANN With Steps 10 25
30 pages
DL Practical 3
No ratings yet
DL Practical 3
5 pages
Newbie's Deep Learning Project To Recognize Handwritten Digit
No ratings yet
Newbie's Deep Learning Project To Recognize Handwritten Digit
6 pages
01 - Mnist - Ipynb (4) - JupyterLab
No ratings yet
01 - Mnist - Ipynb (4) - JupyterLab
23 pages
Introduction To Genetic Algorithm Neural Networks
No ratings yet
Introduction To Genetic Algorithm Neural Networks
44 pages
Deep Learning Lab With Tensorflow
No ratings yet
Deep Learning Lab With Tensorflow
84 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
5 pages
Assignment SQGAN
No ratings yet
Assignment SQGAN
14 pages
Implement A Neural Network Using Python
No ratings yet
Implement A Neural Network Using Python
4 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
MNIST Detection with PyTorch & Keras
No ratings yet
MNIST Detection with PyTorch & Keras
3 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
On Handwritten Digit Recognition
No ratings yet
On Handwritten Digit Recognition
15 pages
Mnist Classification Report
No ratings yet
Mnist Classification Report
15 pages
Project AI
No ratings yet
Project AI
2 pages
ML Guide: MNIST Digit Classification
No ratings yet
ML Guide: MNIST Digit Classification
98 pages
Multi Layer Perceptron Tf2 Code Description
No ratings yet
Multi Layer Perceptron Tf2 Code Description
10 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
ML Ex6
No ratings yet
ML Ex6
8 pages
Capstone Project Report (Digit-Recognition Using CNN)
No ratings yet
Capstone Project Report (Digit-Recognition Using CNN)
11 pages
Handwritten Digit Recognition Guide
100% (1)
Handwritten Digit Recognition Guide
15 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
24 pages
C2 W1 Assignment
No ratings yet
C2 W1 Assignment
25 pages
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
8 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Ml0120en m2v4 The Mnist Database
No ratings yet
Ml0120en m2v4 The Mnist Database
2 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Handwritten Digit Recognition System
No ratings yet
Handwritten Digit Recognition System
19 pages
Piyush Rastogi
No ratings yet
Piyush Rastogi
5 pages
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
No ratings yet
Exp. No.: I. Aim: AIML634P Neural Network Lab 2262034
6 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Assignment - 13: Title
No ratings yet
Assignment - 13: Title
2 pages
Lab Assignment 1 216
No ratings yet
Lab Assignment 1 216
2 pages
MNIST MLP Digit Classifier Guide
No ratings yet
MNIST MLP Digit Classifier Guide
43 pages
Week 4
No ratings yet
Week 4
15 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
42 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Handwritten Digit Recognition Guide
No ratings yet
Handwritten Digit Recognition Guide
10 pages
Pytorch MNIST Digits Prediction Hands On 1
No ratings yet
Pytorch MNIST Digits Prediction Hands On 1
16 pages
WEEK2
No ratings yet
WEEK2
3 pages
Image/Digit Recognition Using Machine Learning: by Raghav Chawla, I.T/B.Tech/Hmritm/5 Semester 43713303117
100% (1)
Image/Digit Recognition Using Machine Learning: by Raghav Chawla, I.T/B.Tech/Hmritm/5 Semester 43713303117
15 pages
106106213
No ratings yet
106106213
637 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
D1 - Deep Learning Workshop Session 3
No ratings yet
D1 - Deep Learning Workshop Session 3
5 pages
Lab 6 ML
No ratings yet
Lab 6 ML
7 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
Pirinen, Antti: The Barriers and Enablers of Co-Design For Services
No ratings yet
Pirinen, Antti: The Barriers and Enablers of Co-Design For Services
17 pages
TPEs Aligned With CSTPs (Modified For DS Credentials)
No ratings yet
TPEs Aligned With CSTPs (Modified For DS Credentials)
17 pages
Optimised Psychology Base Study Guide by Eksiiy
No ratings yet
Optimised Psychology Base Study Guide by Eksiiy
6 pages
ModifiedCRLA G2 Scoresheet II SILANG
No ratings yet
ModifiedCRLA G2 Scoresheet II SILANG
45 pages
Ed 63
No ratings yet
Ed 63
7 pages
Government Propaganda Unit - Behaviour Change Guidelines
90% (10)
Government Propaganda Unit - Behaviour Change Guidelines
70 pages
Macro Plan - Class 4
No ratings yet
Macro Plan - Class 4
16 pages
Math 3 Quarter 2 Preliminary Pages
100% (1)
Math 3 Quarter 2 Preliminary Pages
5 pages
Akash Kumar Singh - 23WU0202098
No ratings yet
Akash Kumar Singh - 23WU0202098
6 pages
The Multicontext Approach 2
No ratings yet
The Multicontext Approach 2
11 pages
Theories Models
No ratings yet
Theories Models
33 pages
Deep Learning for Object Detection
No ratings yet
Deep Learning for Object Detection
3 pages
Lecture On Eastern and Western Philosophy
No ratings yet
Lecture On Eastern and Western Philosophy
2 pages
Reading Skills 3
No ratings yet
Reading Skills 3
18 pages
Critical Approaches in Literature: 21 CLPW Handout Q2.L1
No ratings yet
Critical Approaches in Literature: 21 CLPW Handout Q2.L1
3 pages
Reading Practice 5 Media Literacy
No ratings yet
Reading Practice 5 Media Literacy
10 pages
SQUARE Group of Companies - Docx-O
No ratings yet
SQUARE Group of Companies - Docx-O
65 pages
Sample Lesson Plan
No ratings yet
Sample Lesson Plan
8 pages
Lesson Plan 4 Newspaper & Poetry
No ratings yet
Lesson Plan 4 Newspaper & Poetry
6 pages
Makalah MR Agus
No ratings yet
Makalah MR Agus
3 pages
Yoga Breathing's Brain Impact
No ratings yet
Yoga Breathing's Brain Impact
10 pages
Exam 03
No ratings yet
Exam 03
2 pages
Differences Between Classical and Operant Conditioning Classical Conditioning Operant Conditioning
No ratings yet
Differences Between Classical and Operant Conditioning Classical Conditioning Operant Conditioning
6 pages
Presentation 3 Preposition in On at
No ratings yet
Presentation 3 Preposition in On at
25 pages
First Periodical Test in Introduction To The Philosophy of The Human Person
100% (2)
First Periodical Test in Introduction To The Philosophy of The Human Person
4 pages
Quantitative Research Reviewer Guide
No ratings yet
Quantitative Research Reviewer Guide
5 pages
Dates: Writing The Date
No ratings yet
Dates: Writing The Date
3 pages
GLOSSARY of Continuum 21-8-2013
No ratings yet
GLOSSARY of Continuum 21-8-2013
9 pages
English Conditional
No ratings yet
English Conditional
4 pages
Teaching The Weather
No ratings yet
Teaching The Weather
22 pages