0% found this document useful (0 votes)

29 views36 pages

Deep Learning - Image Synthesis

Applying artificial intelligence in image synthesis

Uploaded by

pvgopika333

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views36 pages

Deep Learning - Image Synthesis

Applying artificial intelligence in image synthesis

Uploaded by

pvgopika333

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

JURY ASSIGNMENT

SUBMITTED BY :
GOPIKA P V
(BFT/22/1080)
MEERA K L
(BFT/22/202)
FASHION IMAGE SYNTHESIS
INTRODUCTION
o Fashion is a major global business with designers playing a crucial
role in creating new styles.
o AI and machine learning have enhanced various industries but
fashion has seen less exploration in data analytics.
o AI has potential in fashion for tasks like classification, forecasting,
and recommendation systems.
o Generative Modelling and GANs, Comparing GAN Models
o The innovate fashion design by generating new images and
compares two advanced GAN models for this purpose.
INDUSTRY
IMAGE SYNTHESIS
UNDERSTANDING METHODOLOGY
DATA UNDERSTANDING

EVALUATION DATA

DATA PREPARATION

DATA MODELING
1. UNDERSTANDING THE INDUSTRY

• About studying about the fashion industry generating various images

related to design coping up with the standards of the current market
trend is the first and foremost step
RESEARCH OBJECTIVES
• Creating new images of fashion items can assist fashion designers, acting
as virtual assistants in the industry.
2. Data Understanding
• This stage begins with data acquisition and then understanding the data and
then finding similarities
• The dataset contains 70,000 greyscale images of 28x28 pixels consisting of 10
classes Ankle Boot, Bag, Coat, T-shirt/Top, Dress, Shirt, Trousers, Sandal, Pullover,
Sneaker

• Each row consists of an image in the dataset and the size of images are 28x28
pixels

• Each pixels represents the light and dark pixels value of an image where in high
numbers represents darker shades.

• This pixel values ranges from 0-255 where 0 means white and 255 means black.
• The dataset has total 785 columns where 784 columns are 8 from 28x28 pixels
consisting each cell one pixel value and there is one column for class label at
start of each row.
3. Data Preparation

• Data acquisition is the most essential stage

• The dataset should be finalized after performing relevant study in

domain as if dataset is not suitable for the project

• The pre-processing of the data

• Reading the Dataset

S • Checking on dataset shape

T
E • Analysis of train and test data
P
• Normalization of data and Reshaping
S
• Data acquisition is the most important
step

• Data should be finalized after studying 3.1 Reading the Dataset

• The pre processing of data is done in

python using Tensor flow ( it is an open
source platform for machine learning)
• The dataset shape can be defined by
shape of an image as each cell
consists of a pixel value. 3.2 CHECKING ON
DATASET SHAPE
• The total number of rows represents
the number of image

• Total number of pixel values

represents number of columns
• After analyzing the dataset there are
6000 images per category in the training 3.3 ANALYSIS OF
set and 10000 images per category in the TRAIN AND TEST
testing set. DATA

TRAIN DATA TEST DATA

• Pixel Value Range: pixel values in the images range
from 0 to 1

• Normalization: To optimize model performance and 3.5 Normalization

reduce training time, the pixel values are normalized
to a range of 0 to 1. This is achieved by dividing each of data and
pixel value by 255. Reshaping

• Reshaping: After normalization, the data is reshaped

to fit the input requirements of the model. The input
shape for both DCGAN and Caps GAN is set to
28x28x1, where 1 represents the number of channels.
• Data Shape: After pre-processing, the shape of both training and
testing data is adjusted accordingly.

• Readiness for Model Input: Following exploration and pre-processing,

the dataset is now prepared to be fed into the model as the expected
input for training.

Before normalization

Training data Testing data

4. DATA MODELING

• GAN Models Implemented Two advanced GAN models.

• Capsule Network based Generative Adversarial Network (Caps GAN)

• Deep Convolutional Generative Adversarial Network (DCGAN), are

applied to the prepared dataset.
• Introduced by Ian Goodfellow
• Consist of two neural networks a
generator and a discriminator.
GANS
• The generator creates synthetic
data
• The discriminator evaluates the GENERATIVE
authenticity
• The generated data against real
ADVERSARIAL
data. NETWORKS
• The generator improves its ability
to create realistic data over time.
The basic GAN consists of two neural networks

• Generator and Discriminator

• The Generator network starts with random noise and creates data, like
images.
• Its goal is to make this generated data look as real as possible
• The Discriminator network takes both real data and generated data and
tries to tell them apart.
• It outputs a probability indicating whether the data is real or fake.
• The Generator uses noise to create new sample images.
• The Discriminator's job is to distinguish between real and fake images with
a binary output.
• Both the Generator and Discriminator use Convolutional Neural Networks
(CNNs).
• Noise is just a random data sample used to start the generation process.
CONVOLUTIONAL
• It is a class of deep learning
• CNN is heavily used in computer vision NEURAL NETWORKS
• It is similar to the basic neural network
• CNN also have learnable parameter like neural
network
• Convolutional neural network (ConvNet’s or CNNs) is
one of the main categories to do
IMAGE RECOGNITION IMAGE CLASSIFICATION OBJECT DETECTION
3 BASIC COMPONENTS TO DEFINE CNN

• The Convolution Layer

• The Pooling Layer

• The Output Layer or Fully

connected layer
Working of CNN

• The CNN model has 2 convolutional layer and pooling layers followed
by 2 fully connected layers.

• Batch normalization is applied in the 2nd and 3rd layers with Leaky
ReLU activation for all layers and a sigmoid function in the final layer.

HYPERPARAMETER K
Number of epochs applied to the model
• Batch normalization = Batch Norm is a technique that normalizes
data between neural network layers, using mini-batches instead of
the full dataset. It speeds up training.

mz is the neuron’s output

Sz is the standard deviation of the neuron’s output

Leak ReLu

• Leaky ReLu Activation Function (Rectified Linear Unit)- Instead of

defining the ReLU activation function as 0 for negative values of
inputs(x), it defines as an extremely small linear component of x.

• This function returns x if it receives any positive input but for any
negative value of x, it returns a really small value which is 0.01
times x. Thus it gives an output for negative values.
Sigmoid Function
• Sigmoid Function – It is used as a neural network activation function.
• When the activation function for a neuron is a sigmoid function it is a
guarantee that the output of this unit will always be between 0 and 1.

EQUATION
Deep Convolutional Generative
Adversarial Network DCGAN
• It’s a type of Generative Adversarial Network that
use a deep convolutional neural networks to
generate high quality images.

BENEFITS
APPLICATIONS
• High quality image generation
• Image synthesis
• Improved training stability
• Super resolution
Working Method
• Use batch normalization.
• Apply Leaky ReLU activation function.
• Use convolutional layers instead of pooling layers.
• The discriminator is a CNN with 2 convolutional layers, 2 fully
connected layers, batch normalization, Leaky ReLU and sigmoid
activation.
• The generator also has 2 convolutional layers.
• Batch normalization is used in each generator layer except the last
one.
• ReLU activation is used in the first 3 generator layers and sigmoid
activation in the last layer.
Capsule Network based Generative
Adversarial Network
Caps GAN

• Caps GAN integrates two concepts to enhance generative models'

ability to understand and reproduce complex structures and
hierarchies in data.

BENEFITS APPLICATIONS
• Improved Data Generation • Image Synthesis
• Dynamic Routing • Medical Imaging
CAPSULE NETWORK BASED GENERATIVE ADVERSARIAL
NETWORK
• The generator networks are the same.
• The discriminator network is a Capsule network instead of CNN.
• The first convolutional layer has a kernel size 3x3, 1 stride, and 256
filters.
• It consists of two Capsule net layers Primary-Caps and Digit-Caps
along with Leaky ReLU activation, batch normalization, flattening and
a Dense layer in Keras.
• Then it ends with a sigmoid function.
• Digit Capsule Layer outputs 16D vectors containing object
instantiation parameters.
Conv2d
Primary caps layer (Conv2d
Input layers Leaky ReLU
Reshape Squash)
Batch Normalization

Digit caps layer (Dense

Sigmoid function multiply Leaky ReLU Flatten function
Activation)
• Primary Caps layer – lowest capsule layer
• Flatten function – Acts as a bridge between layers.
• Generator: The generator of Caps GAN and the DCGAN are same
using deconvolutional neural network.
• Keras Dense Layer - The dense layer is a simple Layer of neurons in
which each neuron receives input from all the neurons of the
previous layer called as dense.
• The dense layer is used to classify images based on output from
convolutional layers.
5.EVALUATION
Qualitative Evaluation
Qualitative evaluation has been conducted on basis of visual analysis of
generated images of GAN
FUTURE
SCOPE

• New and advanced GANs are emerging regularly.

• Further studies in the fashion industry can improve
results.
• Future research could use complex datasets, such
as models in shops, fashion shows, and events.
• GANs can also be applied to smaller datasets for
research.
Choosing proper data

Not enough datasets in specific fields

CHALLENGES
Neural networks take much more time
for the execution

Requires high computation power to run

References
• https://norma.ncirl.ie/4399/1/karanjain.pdf/
• https://www.researchgate.net/
publication/373205261_AI_Assisted_Fashion_Design_A_Review/
• https://www.baeldung.com/cs/batch-normalization-cnn/
• https://builtin.com/machine-learning/sigmoid-activation-function/
• https://www.mygreatlearning.com/blog/relu-activation-function//
• https://blog.paperspace.com/capsule-networks/
• https://www.geeksforgeeks.org/generative-adversarial-network-gan/
/
THANK YOU

5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Slides 1
No ratings yet
Slides 1
50 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Introduction To GANs
No ratings yet
Introduction To GANs
10 pages
Batch 16
No ratings yet
Batch 16
24 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Group B Deep Learning Assignment No: 3B: Categories
No ratings yet
Group B Deep Learning Assignment No: 3B: Categories
13 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
AI Slide 2
No ratings yet
AI Slide 2
82 pages
Harsha Thesis
No ratings yet
Harsha Thesis
62 pages
Unit 3
No ratings yet
Unit 3
14 pages
Module 5
No ratings yet
Module 5
20 pages
Project Report
No ratings yet
Project Report
30 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Artificial Intelligence Convolution Neural Networks
No ratings yet
Artificial Intelligence Convolution Neural Networks
77 pages
Unit 2
No ratings yet
Unit 2
28 pages
Ch10 Deep Learning
No ratings yet
Ch10 Deep Learning
104 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
An Introduction To Convolutional Neural Networks
No ratings yet
An Introduction To Convolutional Neural Networks
11 pages
PDL 05-Merged
No ratings yet
PDL 05-Merged
8 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Mini Project Final Report
No ratings yet
Mini Project Final Report
30 pages
Generative AI Fundamentals GANs QB 14 Aug v1.0
No ratings yet
Generative AI Fundamentals GANs QB 14 Aug v1.0
24 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Week 6
No ratings yet
Week 6
8 pages
Deep Learning (22CS63) : Module-3
No ratings yet
Deep Learning (22CS63) : Module-3
58 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Aai 2
No ratings yet
Aai 2
83 pages
Anime Face Generation Using DC-GANs
No ratings yet
Anime Face Generation Using DC-GANs
6 pages
Module 05 CNN Arctitecture
No ratings yet
Module 05 CNN Arctitecture
7 pages
Al3502 - DLV Unit 3
No ratings yet
Al3502 - DLV Unit 3
11 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Cours 8 B
No ratings yet
Cours 8 B
39 pages
Convolutional Nets
No ratings yet
Convolutional Nets
41 pages
Gen AI 10-1
No ratings yet
Gen AI 10-1
60 pages
AlexNet and Other Pretrained Models - Presentation
No ratings yet
AlexNet and Other Pretrained Models - Presentation
182 pages
CNN, RNN
No ratings yet
CNN, RNN
60 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Advanced Design For AI Algorithms: Lec.: 1 GAN
No ratings yet
Advanced Design For AI Algorithms: Lec.: 1 GAN
223 pages
Images and Convolutional Neural Networks: Practical Deep Learning
No ratings yet
Images and Convolutional Neural Networks: Practical Deep Learning
34 pages
Unit 4 (CNN and SOM)
No ratings yet
Unit 4 (CNN and SOM)
15 pages
Deep Learning and Neural Networks Guide
No ratings yet
Deep Learning and Neural Networks Guide
84 pages
NN Jaguar Lava 122
No ratings yet
NN Jaguar Lava 122
10 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
Module 6.2 GAN
No ratings yet
Module 6.2 GAN
29 pages
Max78000 Article Series Part 1
No ratings yet
Max78000 Article Series Part 1
4 pages
Anime Faces via DC-GANs
No ratings yet
Anime Faces via DC-GANs
6 pages
Module 5
No ratings yet
Module 5
72 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
15 pages
Presentation FYP
No ratings yet
Presentation FYP
18 pages
Genai Week5
No ratings yet
Genai Week5
33 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Unit 4
No ratings yet
Unit 4
51 pages
CNN Image Classification Guide
No ratings yet
CNN Image Classification Guide
20 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
7 pages
Perceptron Letter Case Classification
No ratings yet
Perceptron Letter Case Classification
6 pages
12 AI Unit 6 Understanding Neural Networks
No ratings yet
12 AI Unit 6 Understanding Neural Networks
21 pages
EC360 Soft Computing S5-EC-Syllabus
No ratings yet
EC360 Soft Computing S5-EC-Syllabus
2 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
Cabs Availability Prediction Using Deep Learning: Project Member
No ratings yet
Cabs Availability Prediction Using Deep Learning: Project Member
58 pages
Backpropagation With Example
No ratings yet
Backpropagation With Example
42 pages
Lecture 1-Unit 3.3
No ratings yet
Lecture 1-Unit 3.3
3 pages
Transformers and Attention Mechanisms - Post Quiz - Attempt Review
No ratings yet
Transformers and Attention Mechanisms - Post Quiz - Attempt Review
5 pages
ActivationFun Survey Arxiv
No ratings yet
ActivationFun Survey Arxiv
49 pages
AI Course Experiments Certificate
No ratings yet
AI Course Experiments Certificate
69 pages
Biological Neuron Artificial Neuron
No ratings yet
Biological Neuron Artificial Neuron
18 pages
Neural Networks & Fuzzy Logic Course
No ratings yet
Neural Networks & Fuzzy Logic Course
1 page
CS60010 - Deep - NN - PPTX (1) 2
No ratings yet
CS60010 - Deep - NN - PPTX (1) 2
50 pages
Deep Learning Lab: Translated MLP - CNN
No ratings yet
Deep Learning Lab: Translated MLP - CNN
19 pages
Generative Adversarial Network Architecture and Applications
No ratings yet
Generative Adversarial Network Architecture and Applications
41 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
DL4CV Seq Att
No ratings yet
DL4CV Seq Att
63 pages
Notes For Electrical 2nd Year
No ratings yet
Notes For Electrical 2nd Year
4 pages
AI Future Book
No ratings yet
AI Future Book
2 pages
Lecture 11 - Supervised Learning - Hopfield Networks - (Part 4)
No ratings yet
Lecture 11 - Supervised Learning - Hopfield Networks - (Part 4)
5 pages
Lesson 1 - Course - Introduction
No ratings yet
Lesson 1 - Course - Introduction
9 pages
Deep Learning Course File
No ratings yet
Deep Learning Course File
56 pages
Computer Vision Exam
No ratings yet
Computer Vision Exam
7 pages
Generative AI Unit 3 Notes
No ratings yet
Generative AI Unit 3 Notes
8 pages
GPT4 Architecture
No ratings yet
GPT4 Architecture
2 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
Steps To Choose Data Science Career Path
No ratings yet
Steps To Choose Data Science Career Path
1 page
Notes-2-Supervised-Hebb Learning
No ratings yet
Notes-2-Supervised-Hebb Learning
2 pages
Tensorflow
No ratings yet
Tensorflow
9 pages