0% found this document useful (0 votes)

25 views50 pages

Google Aiml

Uploaded by

rampalla2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views50 pages

Google Aiml

Uploaded by

rampalla2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

1.

Before you begin

In this codelab, you'll learn the basic "Hello, World" of ML, where instead of programming explicit
rules in a language, such as Java or C++, you'll build a system trained on data to infer the rules
that determine a relationship between numbers.

Consider the following problem: You're building a system that performs activity recognition for
fitness tracking. You might have access to the speed at which a person is walking and attempt to
infer their activity based on that speed using a conditional.

if(speed<4){
status=WALKING;
}

You could extend that to running with another condition.

if(speed<4){
status=WALKING;
} else {
status=RUNNING;
}

In a final condition, you could similarly detect cycling.

if(speed<4){
status=WALKING;
} else if(speed<12){
status=RUNNING;
} else {
status=BIKING;
}

Now, consider what happens when you want to include an activity, like golf. It's less obvious how
to create a rule to determine the activity.

// Now what?

It's extremely difficult to write a program that will recognize the golfing activity, so what do you
do? You can use ML to solve the problem!

Prerequisites
Before attempting this codelab, you'll want to have:

• A solid knowledge of Python

• Basic programming skills

What you'll learn

• The basics of machine learning

What you'll build

• Your first machine learning model

What you'll need

If you've never created an ML model using TensorFlow, you can use Colaboratory, a browser-
based environment containing all the required dependencies. You can find the code for the rest
of the codelab running in Colab.

If you're using a different IDE, make sure you have Python installed. You'll also need TensorFlow
and the NumPy library. You can learn more about and install TensorFlow here. Install NumPy
here.

2. What is ML?

Consider the traditional manner of building apps, as represented in the following diagram:

You express rules in a programming language. They act on data and your program provides answers**.**
In the case of the activity detection, the rules (the code you wrote to define activity types) acted upon the
data (the person's movement speed) to produce an answer: the return value from the function for
determining the activity status of the user (whether they were walking, running, biking, or doing something
else).

The process for detecting that activity status via ML is very similar, only the axes are different.

Instead of trying to define the rules and express them in a programming language, you provide the answers
(typically called labels) along with the data, and the machine infers the rules that determine the
relationship between the answers and data. For example, your activity detection scenario might look like
this in an ML context:

You gather lots of data and label it to effectively say, "This is what walking looks like," or "This is what
running looks like." Then, the computer can infer the rules that determine, from the data, what the distinct
patterns that denote a particular activity are.

Beyond being an alternative method to programming that scenario, that approach also gives you the ability
to open new scenarios, such as the golfing one that may not have been possible under the rules-based
traditional programming approach.

In traditional programming, your code compiles into a binary that is typically called a program. In ML, the
item that you create from the data and labels is called a model.

So, if you go back to this diagram:

Consider the result of that to be a model, which is used like this at runtime:
You pass the model some data and the model uses the rules that it inferred from the training to make a
prediction, such as, "That data looks like walking," or "That data looks like biking."

3. Create your first ML model

Consider the following sets of numbers. Can you see the relationship between them?

X: -1 0 1 2 3 4

Y: -2 1 4 7 10 13

As you look at them, you might notice that the value of X is increasing by 1 as you read left to right and
the corresponding value of Y is increasing by 3. You probably think that Y equals 3X plus or minus
something. Then, you'd probably look at the 0 on X and see that Y is 1, and you'd come up with the
relationship Y=3X+1.

That's almost exactly how you would use code to train a model to spot the patterns in the data!

Now, look at the code to do it.

How would you train a neural network to do the equivalent task? Using data! By feeding it with a set of X's
and a set of Y's, it should be able to figure out the relationship between them.

Imports
> Note: If you're not using Colab and have your own Python environment set up with TensorFlow installed
and ready to use, then create a new Python file before continuing.

Start with your imports. Here, you're importing TensorFlow and calling it tf for ease of use.

Next, import a library called numpy, which represents your data as lists easily and quickly.
The framework for defining a neural network as a set of sequential layers is called keras, so import that,
too.

import tensorflow as tf
import numpy as np
from tensorflow import keras

Define and compile the neural network

Next, create the simplest possible neural network. It has one layer, that layer has one neuron, and the input
shape to it is only one value.

model = tf.keras.Sequential([keras.layers.Dense(units=1, input_shape=[1])])

Next, write the code to compile your neural network. When you do so, you need to specify two functions—
a loss and an optimizer.

In this example, you know that the relationship between the numbers is Y=3X+1.

When the computer is trying to learn that, it makes a guess, maybe Y=10X+10. The loss function measures
the guessed answers against the known correct answers and measures how well or badly it did.

Next, the model uses the optimizer function to make another guess. Based on the loss function's result, it
tries to minimize the loss. At this point, maybe it will come up with something like Y=5X+5. While that's
still pretty bad, it's closer to the correct result (the loss is lower).

The model repeats that for the number of epochs, which you'll see shortly.

First, here's how to tell it to use mean_squared_error for the loss and stochastic gradient descent (sgd) for
the optimizer. You don't need to understand the math for those yet, but you can see that they work!

Over time, you'll learn the different and appropriate loss and optimizer functions for different scenarios.

model.compile(optimizer='sgd', loss='mean_squared_error')

Provide the data

Next, feed some data. In this case, you take the six X and six Y variables from earlier. You can see that the
relationship between those is that Y=3X+1, so where X is -1, Y is -2.
A python library called NumPy provides lots of array type data structures to do this. Specify the values as
an array in NumPy with np.array[].

xs = np.array([-1.0, 0.0, 1.0, 2.0, 3.0, 4.0], dtype=float)

ys = np.array([-2.0, 1.0, 4.0, 7.0, 10.0, 13.0], dtype=float)

Now you have all the code you need to define the neural network. The next step is to train it to see if it can
infer the patterns between those numbers and use them to create a model.

4. Train the neural network

The process of training the neural network, where it learns the relationship between the X's and Y's, is in
the model.fit call. That's where it will go through the loop before making a guess, measuring how good or
bad it is (the loss), or using the optimizer to make another guess. It will do that for the number of epochs
that you specify. When you run that code, you'll see the loss will be printed out for each epoch.

model.fit(xs, ys, epochs=500)

For example, you can see that for the first few epochs, the loss value is quite large, but it's getting smaller
with each step.

As the training progresses, the loss soon gets very small.

By the time the training is done, the loss is extremely small, showing that our model is doing a great job of
inferring the relationship between the numbers.

You probably don't need all 500 epochs and can experiment with different amounts. As you can see from
the example, the loss is really small after only 50 epochs, so that might be enough!

5. Use the model

You have a model that has been trained to learn the relationship between X and Y. You can use
the model.predict method to have it figure out the Y for a previously unknown X. For example, if X is 10,
what do you think Y will be? Take a guess before you run the following code:

print(model.predict([10.0]))

You might have thought 31, but it ended up being a little over. Why do you think that is?
Neural networks deal with probabilities, so it calculated that there is a very high probability that the
relationship between X and Y is Y=3X+1, but it can't know for sure with only six data points. The result is
very close to 31, but not necessarily 31.

As you work with neural networks, you'll see that pattern recurring. You will almost always deal with
probabilities, not certainties, and will do a little bit of coding to figure out what the result is based on the
probabilities, particularly when it comes to classification.
Module 2

2. Start coding
First, walk through the executable Colab notebook.

Start by importing TensorFlow.

import tensorflow as tf
print(tf.__version__)

You'll train a neural network to recognize items of clothing from a common dataset
called Fashion MNIST. It contains 70,000 items of clothing in 10 different categories. Each item
of clothing is in a 28x28 grayscale image. You can see some examples here:

The labels associated with the dataset are:

Label Description

0 T-shirt/top

1 Trouser

2 Pullover

3 Dress

4 Coat

5 Sandal

6 Shirt

7 Sneaker

8 Bag

9 Ankle boot

The Fashion MNIST data is available in the tf.keras.datasets API. Load it like this:

mnist = tf.keras.datasets.fashion_mnist

Calling load_data on that object gives you two sets of two lists: training values and testing values,
which represent graphics that show clothing items and their labels.

(training_images, training_labels), (test_images, test_labels) = mnist.load_data()

What do those values look like? Print a training image and a training label to see. You can
experiment with different indices in the array.

import matplotlib.pyplot as plt

plt.imshow(training_images[0])
print(training_labels[0])
print(training_images[0])

The print of the data for item 0 looks like this:

You'll notice that all the values are integers between 0 and 255. When training a neural network,
it's easier to treat all values as between 0 and 1, a process called normalization. Fortunately,
Python provides an easy way to normalize a list like that without looping.

training_images = training_images / 255.0

test_images = test_images / 255.0

You may also want to look at 42, a different boot than the one at index 0.

Now, you might be wondering why there are two datasets—training and testing.

The idea is to have one set of data for training and another set of data that the model hasn't yet
encountered to see how well it can classify values. After all, when you're done, you'll want to use
the model with data that it hadn't previously seen! Also, without separate testing data, you'll run
the risk of the network only memorizing its training data without generalizing its knowledge.
3. Design the model
Now design the model. You'll have three layers. Go through them one-by-one and explore the
different types of layers and the parameters used for each.

model = tf.keras.models.Sequential([tf.keras.layers.Flatten(),
tf.keras.layers.Dense(128, activation=tf.nn.relu),
tf.keras.layers.Dense(10, activation=tf.nn.softmax)])

• Sequential defines a sequence of layers in the neural network.

• Flatten takes a square and turns it into a one-dimensional vector.
• Dense adds a layer of neurons.
• Activation functions tell each layer of neurons what to do. There are lots of options, but
use these for now:
• Relu effectively means that if X is greater than 0 return X, else return 0. It only passes
values of 0 or greater to the next layer in the network.
• Softmax takes a set of values, and effectively picks the biggest one. For example, if the
output of the last layer looks like [0.1, 0.1, 0.05, 0.1, 9.5, 0.1, 0.05, 0.05, 0.05], then it saves
you from having to sort for the largest value—it returns [0,0,0,0,1,0,0,0,0].

4. Compile and train the model

Now that the model is defined, the next thing to do is build it. Create a model by first compiling it
with an optimizer and loss function, then train it on your training data and labels. The goal is to
have the model figure out the relationship between the training data and its training labels. Later,
you want your model to see data that resembles your training data, then make a prediction about
what that data should look like.

Notice the use of metrics= as a parameter, which allows TensorFlow to report on the accuracy of
the training by checking the predicted results against the known answers (the labels).

model.compile(optimizer = tf.keras.optimizers.Adam(),
loss = 'sparse_categorical_crossentropy',
metrics=['accuracy'])

model.fit(training_images, training_labels, epochs=5)

When model.fit executes, you'll see loss and accuracy:

Epoch 1/5
60000/60000 [=======] - 6s 101us/sample - loss: 0.4964 - acc: 0.8247
Epoch 2/5
60000/60000 [=======] - 5s 86us/sample - loss: 0.3720 - acc: 0.8656
Epoch 3/5
60000/60000 [=======] - 5s 85us/sample - loss: 0.3335 - acc: 0.8780
Epoch 4/5
60000/60000 [=======] - 6s 103us/sample - loss: 0.3134 - acc: 0.8844
Epoch 5/5
60000/60000 [=======] - 6s 94us/sample - loss: 0.2931 - acc: 0.8926

When the model is done training, you will see an accuracy value at the end of the final epoch. It
might look something like 0.8926 as above. This tells you that your neural network is about 89%
accurate in classifying the training data. In other words, it figured out a pattern match between
the image and the labels that worked 89% of the time. Not great, but not bad considering it was
only trained for five epochs and done quickly.

5. Test the model

How would the model perform on data it hasn't seen? That's why you have the test set. You
call model.evaluate and pass in the two sets, and it reports the loss for each. Give it a try:

model.evaluate(test_images, test_labels)

And here's the output:

10000/10000 [=====] - 1s 56us/sample - loss: 0.3365 - acc: 0.8789

[0.33648381242752073, 0.8789]

That example returned an accuracy of .8789, meaning it was about 88% accurate. (You might
have slightly different values.)

As expected, the model is not as accurate with the unknown data as it was with the data it was
trained on! As you learn more about TensorFlow, you'll find ways to improve that.

To explore further, try the exercises in the next step.

6. Exploration exercises

Exercise 1
For this first exercise, run the following code:

classifications = model.predict(test_images)
print(classifications[0])

It creates a set of classifications for each of the test images, then prints the first entry in the
classifications. The output after you run it is a list of numbers. Why do you think that is and what
do those numbers represent?

Try running print(test_labels[0]) and you'll get a 9. Does that help you understand why the list looks
the way it does?

The output of the model is a list of 10 numbers. Those numbers are a probability that the value
being classified is the corresponding label. For example, the first value in the list is the
probability that the clothing is of class 0 and the next is a 1. Notice that they are all very low
probabilities except one. Also, because of Softmax, all the probabilities in the list sum to 1.0.

The list and the labels are 0 based, so the ankle boot having label 9 means that it is the 10th of
the 10 classes. The list having the 10th element being the highest value means that the neural
network has predicted that the item it is classifying is most likely an ankle boot.

Exercise 2
Look at the layers in your model. Experiment with different values for the dense layer with 512
neurons.

What different results do you get for loss and training time? Why do you think that's the case?

For example, if you increase to 1,024 neurons, you have to do more calculations, slowing down
the process. But in this case they have a good impact because the model is more accurate. That
doesn't mean more is always better. You can hit the law of diminishing returns very quickly.

Exercise 3
What would happen if you remove the Flatten() layer. Why do you think that's the case?

You get an error about the shape of the data. The details of the error may seem vague right now,
but it reinforces the rule of thumb that the first layer in your network should be the same shape
as your data. Right now your data is 28x28 images, and 28 layers of 28 neurons would be
infeasible, so it makes more sense to flatten that 28,28 into a 784x1.

Instead of writing all the code, add the Flatten() layer at the beginning. When the arrays are loaded
into the model later, they'll automatically be flattened for you.

Exercise 4
Consider the final (output) layers. Why are there 10 of them? What would happen if you had a
different amount than 10?

Try training the network with 5. You get an error as soon as it finds an unexpected value. Another
rule of thumb—the number of neurons in the last layer should match the number of classes you
are classifying for. In this case, it's the digits 0 through 9, so there are 10 of them, and hence you
should have 10 neurons in your final layer.

Exercise 5
Consider the effects of additional layers in the network. What will happen if you add another layer
between the one with 512 and the final layer with 10?

There isn't a significant impact because this is relatively simple data. For far more complex data,
extra layers are often necessary.

Exercise 6
Before you trained, you normalized the data, going from values that were 0 through 255 to values
that were 0 through 1. What would be the impact of removing that? Here's the complete code to
give it a try (note that the two lines that normalize the data are commented out).

Why do you think you get different results? There's a great answer here on Stack Overflow.

import tensorflow as tf
print(tf.__version__)
mnist = tf.keras.datasets.fashion_mnist
(training_images, training_labels), (test_images, test_labels) = mnist.load_data()
#training_images=training_images/255.0
#test_images=test_images/255.0
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation=tf.nn.relu),
tf.keras.layers.Dense(10, activation=tf.nn.softmax)
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy')
model.fit(training_images, training_labels, epochs=5)
model.evaluate(test_images, test_labels)
classifications = model.predict(test_images)
print(classifications[0])
print(test_labels[0])
7. Explore callbacks

Earlier, when you trained for extra epochs, you had an issue where your loss might change. It
might have taken a bit of time for you to wait for the training to do that and you might have
thought that it'd be nice if you could stop the training when you reach a desired value, such as
95% accuracy. If you reach that after 3 epochs, why sit around waiting for it to finish a lot more
epochs?

Like any other program, you have callbacks! See them in action:

import tensorflow as tf

class myCallback(tf.keras.callbacks.Callback):
def on_epoch_end(self, epoch, logs={}):
if(logs.get('accuracy')>0.95):
print("\nReached 95% accuracy so cancelling training!")
self.model.stop_training = True

callbacks = myCallback()
mnist = tf.keras.datasets.fashion_mnist
(training_images, training_labels), (test_images, test_labels) = mnist.load_data()
training_images=training_images/255.0
test_images=test_images/255.0
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation=tf.nn.relu),
tf.keras.layers.Dense(10, activation=tf.nn.softmax)
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])
model.fit(training_images, training_labels, epochs=5, callbacks=[callbacks])
Build convolutions and perform
module 3

pooling
2. What are convolutions?
A convolution is a filter that passes over an image, processes it, and extracts the important
features.

Let's say you have an image of a person wearing a sneaker. How would you detect that a sneaker
is present in the image? In order for your program to "see" the image as a sneaker, you'll have to
extract the important features, and blur the inessential features. This is called feature mapping.

The feature mapping process is theoretically simple. You'll scan every pixel in the image and
then look at its neighboring pixels. You multiply the values of those pixels by the equivalent
weights in a filter.

For example:

In this case, a 3x3 convolution matrix, or image kernel, is specified.

The current pixel value is 192. You can calculate the value of the new pixel by looking at the
neighbor values, multiplying them by the values specified in the filter, and making the new pixel
value the final amount.

Now it's time to explore how convolutions work by creating a basic convolution on a 2D
grayscale image.

You'll demonstrate that with the ascent image from SciPy. It's a nice built-in picture with lots of
angles and lines.
3. Start coding
Start by importing some Python libraries and the ascent picture:

import cv2
import numpy as np
from scipy import misc
i = misc.ascent()

Next, use the Pyplot library matplotlib to draw the image so that you know what it looks like:

import matplotlib.pyplot as plt

plt.grid(False)
plt.gray()
plt.axis('off')
plt.imshow(i)
plt.show()

You can see that it's an image of a stairwell. There are lots of features you can try and isolate.
For example, there are strong vertical lines.

The image is stored as a NumPy array, so we can create the transformed image by just copying
that array. The size_x and size_y variables will hold the dimensions of the image so you can loop
over it later.

i_transformed = np.copy(i)
size_x = i_transformed.shape[0]
size_y = i_transformed.shape[1]
4. Create the convolution matrix
First, make a convolution matrix (or kernel) as a 3x3 array:

# This filter detects edges nicely

# It creates a filter that only passes through sharp edges and straight lines.
# Experiment with different values for fun effects.
#filter = [ [0, 1, 0], [1, -4, 1], [0, 1, 0]]
# A couple more filters to try for fun!
filter = [ [-1, -2, -1], [0, 0, 0], [1, 2, 1]]
#filter = [ [-1, 0, 1], [-2, 0, 2], [-1, 0, 1]]
# If all the digits in the filter don't add up to 0 or 1, you
# should probably do a weight to get it to do so
# so, for example, if your weights are 1,1,1 1,2,1 1,1,1
# They add up to 10, so you would set a weight of .1 if you want to normalize them
weight = 1

Now, calculate the output pixels. Iterate over the image, leaving a 1-pixel margin, and multiply
each of the neighbors of the current pixel by the value defined in the filter.

That means that the current pixel's neighbor above it and to the left of it will be multiplied by the
top-left item in the filter. Then, multiply the result by the weight and ensure that the result is in the
range 0 through 255.

Finally, load the new value into the transformed image:

for x in range(1,size_x-1):
for y in range(1,size_y-1):
output_pixel = 0.0
output_pixel = output_pixel + (i[x - 1, y-1] * filter[0][0])
output_pixel = output_pixel + (i[x, y-1] * filter[0][1])
output_pixel = output_pixel + (i[x + 1, y-1] * filter[0][2])
output_pixel = output_pixel + (i[x-1, y] * filter[1][0])
output_pixel = output_pixel + (i[x, y] * filter[1][1])
output_pixel = output_pixel + (i[x+1, y] * filter[1][2])
output_pixel = output_pixel + (i[x-1, y+1] * filter[2][0])
output_pixel = output_pixel + (i[x, y+1] * filter[2][1])
output_pixel = output_pixel + (i[x+1, y+1] * filter[2][2])
output_pixel = output_pixel * weight
if(output_pixel<0):
output_pixel=0
if(output_pixel>255):
output_pixel=255
i_transformed[x, y] = output_pixel
5. Examine the results
Now, plot the image to see the effect of passing the filter over it:

# Plot the image. Note the size of the axes -- they are 512 by 512
plt.gray()
plt.grid(False)
plt.imshow(i_transformed)
#plt.axis('off')
plt.show()

Consider the following filter values and their impact on the image.

Using [-1,0,1,-2,0,2,-1,0,1] gives you a very strong set of vertical lines:

Using [-1,-2,-1,0,0,0,1,2,1] gives you horizontal lines:

Explore different values! Also, try differently sized filters, such as 5x5 or 7x7.

6. Understanding Pooling
Now that you've identified the essential features of the image, what do you do? How do you use
the resulting feature map to classify images?

Similar to convolutions, pooling greatly helps with detecting features. Pooling layers reduce the
overall amount of information in an image while maintaining the features that are detected as
present.

There are a number of different types of pooling, but you'll use one called Maximum (Max)
Pooling.

Iterate over the image and, at each point, consider the pixel and its immediate neighbors to the
right, beneath, and right-beneath. Take the largest of those (hence max pooling) and load it into
the new image. Thus, the new image will be one-fourth the size of the
old.
7. Write code for pooling
The following code will show a (2, 2) pooling. Run it to see the output.

You'll see that while the image is one-fourth the size of the original, it kept all the features.

new_x = int(size_x/2)
new_y = int(size_y/2)
newImage = np.zeros((new_x, new_y))
for x in range(0, size_x, 2):
for y in range(0, size_y, 2):
pixels = []
pixels.append(i_transformed[x, y])
pixels.append(i_transformed[x+1, y])
pixels.append(i_transformed[x, y+1])
pixels.append(i_transformed[x+1, y+1])
pixels.sort(reverse=True)
newImage[int(x/2),int(y/2)] = pixels[0]

# Plot the image. Note the size of the axes -- now 256 pixels instead of 512
plt.gray()
plt.grid(False)
plt.imshow(newImage)
#plt.axis('off')
plt.show()

Note the axes of that plot. The image is now 256x256, one-fourth of its original size, and the
detected features have been enhanced despite less data now being in the image.
CNN

2. Improve computer vision accuracy with convolutions

You now know how to do fashion image recognition using a Deep Neural Network (DNN)
containing three layers— the input layer (in the shape of the input data), the output layer (in the
shape of the desired output) and a hidden layer. You experimented with several parameters that
influence the final accuracy, such as different sizes of hidden layers and number of training
epochs.

For convenience, here's the entire code again. Run it and take a note of the test accuracy that is
printed out at the end.

import tensorflow as tf
mnist = tf.keras.datasets.fashion_mnist
(training_images, training_labels), (test_images, test_labels) = mnist.load_data()
training_images=training_images/255.0
test_images=test_images/255.0
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dense(10, activation='softmax')
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])
model.fit(training_images, training_labels, epochs=5)
test_loss, test_accuracy = model.evaluate(test_images, test_labels)
print ('Test loss: {}, Test accuracy: {}'.format(test_loss, test_accuracy*100))

Your accuracy is probably about 89% on training and 87% on validation. You can make that even
better using convolutions, which narrows down the content of the image to focus on specific,
distinct details.

If you've ever done image processing using a filter, then convolutions will look very familiar.

In short, you take an array (usually 3x3 or 5x5) and pass it over the image. By changing the
underlying pixels based on the formula within that matrix, you can perform operations like edge
detection. For example, typically a 3x3 is defined for edge detection where the middle cell is 8,
and all of its neighbors are -1. In this case, for each pixel, you would multiply its value by 8, then
subtract the value of each neighbor. Do this for every pixel, and you'll end up with a new image
that has its edges enhanced.

This is perfect for computer vision, because enhancing features like edges helps the computer
distinguish one item from another. Better still, the amount of information needed is much less,
because you'll train only on the highlighted features.

That's the concept of Convolutional Neural Networks. Add some layers to do convolution before
you have the dense layers, and then the information going to the dense layers becomes more
focused and possibly more accurate.
3. Try the code
Run the following code. It's the same neural network as earlier, but this time with convolutional
layers added first. It will take longer, but look at the impact on the accuracy:

import tensorflow as tf
print(tf.__version__)
mnist = tf.keras.datasets.fashion_mnist
(training_images, training_labels), (test_images, test_labels) = mnist.load_data()
training_images=training_images.reshape(60000, 28, 28, 1)
training_images=training_images / 255.0
test_images = test_images.reshape(10000, 28, 28, 1)
test_images=test_images / 255.0
model = tf.keras.models.Sequential([
tf.keras.layers.Conv2D(64, (3, 3), activation='relu', input_shape=(28, 28, 1)),
tf.keras.layers.MaxPooling2D(2, 2),
tf.keras.layers.Conv2D(64, (3, 3), activation='relu'),
tf.keras.layers.MaxPooling2D(2,2),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dense(10, activation='softmax')
])
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])
model.summary()
model.fit(training_images, training_labels, epochs=5)
test_loss, test_accuracy = model.evaluate(test_images, test_labels)
print ('Test loss: {}, Test accuracy: {}'.format(test_loss, test_accuracy*100))

It's likely gone up to about 93% on the training data and 91% on the validation data.

Now try running it for more epochs—say about 20—and explore the results. While the training
results might seem really good, the validation results may actually go down due to a
phenomenon called overfitting.

Overfitting occurs when the network learns the data from the training set too well, so it's
specialised to recognize only that data, and as a result is less effective at seeing other data in
more general situations. For example, if you trained only on heels, then the network might be very
good at identifying heels, but sneakers might confuse it.

Look at the code again, and see step-by-step how the convolutions were built.
4. Gather the data
The first step is to gather the data.

You'll notice that there's a change here and the training data needed to be reshaped. That's
because the first convolution expects a single tensor containing everything, so instead of 60,000
28x28x1 items in a list, you have a single 4D list that is 60,000x28x28x1, and the same for the
test images. If you don't do that, then you'll get an error when training because the convolutions
do not recognize the shape.

import tensorflow as tf
mnist = tf.keras.datasets.fashion_mnist
(training_images, training_labels), (test_images, test_labels) = mnist.load_data()
training_images=training_images.reshape(60000, 28, 28, 1)
training_images = training_images/255.0
test_images = test_images.reshape(10000, 28, 28, 1)
test_images = test_images/255.0

5. Define the model

Next, define your model. Instead of the input layer at the top, you're going to add a convolutional
layer. The parameters are:

• The number of convolutions you want to generate. A value like 32 is a good starting
point.
• The size of the convolutional matrix, in this case a 3x3 grid.
• The activation function to use, in this case use relu.
• In the first layer, the shape of the input data.

You'll follow the convolution with a max pooling layer, which is designed to compress the image
while maintaining the content of the features that were highlighted by the convolution. By
specifying (2,2) for the max pooling, the effect is to reduce the size of the image by a factor of 4.
It creates a 2x2 array of pixels and picks the largest pixel value, turning 4 pixels into 1. It repeats
this computation across the image, and in so doing halves the number of horizontal pixels and
halves the number of vertical pixels.

You can call model.summary() to see the size and shape of the network. Notice that after every
max pooling layer, the image size is reduced in the following way:

_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
conv2d_2 (Conv2D) (None, 26, 26, 64) 640
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 13, 13, 64) 0
_________________________________________________________________
conv2d_3 (Conv2D) (None, 11, 11, 64) 36928
_________________________________________________________________
max_pooling2d_3 (MaxPooling2 (None, 5, 5, 64) 0
_________________________________________________________________
flatten_2 (Flatten) (None, 1600) 0
_________________________________________________________________
dense_4 (Dense) (None, 128) 204928
_________________________________________________________________
dense_5 (Dense) (None, 10) 1290
=================================================================

Here's the full code for the CNN:

model = tf.keras.models.Sequential([
tf.keras.layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)),
tf.keras.layers.MaxPooling2D(2, 2),
#Add another convolution
tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
tf.keras.layers.MaxPooling2D(2, 2),
#Now flatten the output. After this you'll just have the same DNN structure as the non convolutional
version
tf.keras.layers.Flatten(),
#The same 128 dense layers, and 10 output layers as in the pre-convolution example:
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dense(10, activation='softmax')
])

6. Compile and train the model

Compile the model, call the fit method to do the training, and evaluate the loss and accuracy
from the test set.

CODE:

model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

model.fit(training_images, training_labels, epochs=5)
test_loss, test_acc = model.evaluate(test_images, test_labels)
print ('Test loss: {}, Test accuracy: {}'.format(test_loss, test_acc*100))
7. Visualize the convolutions and pooling
This code shows you the convolutions graphically. The print (test_labels[:100]) shows the first 100
labels in the test set, and you can see that the ones at index 0, index 23 and index 28 are all the
same value (9). They're all shoes. Take a look at the result of running the convolution on each
and you'll begin to see common features between them emerge. Now, when the DNN is training
on that data, it's working with a lot less information, and it's perhaps finding a commonality
between shoes based on that convolution and pooling combination.

print(test_labels[:100])

[9 2 1 1 6 1 4 6 5 7 4 5 7 3 4 1 2 4 8 0 2 5 7 9 1 4 6 0 9 3 8 8 3 3 8 0 7
5796137672122445822848077851123987026
2 3 1 2 8 4 1 8 5 9 5 0 3 2 0 6 5 3 6 7 1 8 0 1 4 2]

Now you can select some of the corresponding images for those labels and render what they
look like going through the convolutions. So, in the following
code, FIRST_IMAGE, SECOND_IMAGE and THIRD_IMAGE are all the indexes for value 9, an ankle boot.

import matplotlib.pyplot as plt

f, axarr = plt.subplots(3,4)
FIRST_IMAGE=0
SECOND_IMAGE=23
THIRD_IMAGE=28
CONVOLUTION_NUMBER = 6
from tensorflow.keras import models
layer_outputs = [layer.output for layer in model.layers]
activation_model = tf.keras.models.Model(inputs = model.input, outputs = layer_outputs)
for x in range(0,4):
f1 = activation_model.predict(test_images[FIRST_IMAGE].reshape(1, 28, 28, 1))[x]
axarr[0,x].imshow(f1[0, : , :, CONVOLUTION_NUMBER], cmap='inferno')
axarr[0,x].grid(False)
f2 = activation_model.predict(test_images[SECOND_IMAGE].reshape(1, 28, 28, 1))[x]
axarr[1,x].imshow(f2[0, : , :, CONVOLUTION_NUMBER], cmap='inferno')
axarr[1,x].grid(False)
f3 = activation_model.predict(test_images[THIRD_IMAGE].reshape(1, 28, 28, 1))[x]
axarr[2,x].imshow(f3[0, : , :, CONVOLUTION_NUMBER], cmap='inferno')
axarr[2,x].grid(False)

And you should see something like the following, where the convolution is taking the essence of
the sole of the shoe, effectively spotting that as a common feature across all shoes.
8. Exercises

Exercise 1
Try editing the convolutions. Change the number of convolutions from 32 to either 16 or 64.
What impact does that have on accuracy and training time?

Exercise 2
Remove the final convolution. What impact does that have on accuracy or training time?

Exercise 3
Add more convolutions. What impact does that have?

Exercise 4
Remove all convolutions but the first. What impact does that have? Experiment with it.
COMPLEX IMAGES MODULE 5

2. Getting Started: Acquire the data

You'll do this by building a horses-or-humans classifier that will tell you if a given image contains
a horse or a human, where the network is trained to recognize features that determine which is
which. You'll have to do some processing of the data before you can train.

First, download the data:

!wget \
https://storage.googleapis.com/learning-datasets/horse-or-human.zip \
-O /tmp/horse-or-human.zip

The following Python code will use the OS library to use operating system libraries, giving you
access to the file system and the zip file library, therefore allowing you to unzip the data.

import os
import zipfile

local_zip = '/tmp/horse-or-human.zip'
zip_ref = zipfile.ZipFile(local_zip, 'r')
zip_ref.extractall('/tmp/horse-or-human')
zip_ref.close()

The contents of the zip file are extracted to the base directory /tmp/horse-or-human, which contain
horses and human subdirectories.

In short, the training set is the data that is used to tell the neural network model that "this is what
a horse looks like" and "this is what a human looks like."
3. Use the ImageGenerator to label and prepare the data
You do not explicitly label the images as horses or humans.

Later you'll see something called an ImageDataGenerator being used. It reads images from
subdirectories and automatically labels them from the name of that subdirectory. For example,
you have a training directory containing a horses directory and a humans
directory. ImageDataGenerator will label the images appropriately for you, reducing a coding step.

Define each of those directories.

# Directory with our training horse pictures

train_horse_dir = os.path.join('/tmp/horse-or-human/horses')

# Directory with our training human pictures

train_human_dir = os.path.join('/tmp/horse-or-human/humans')

Now, see what the filenames look like in the horses and humans training directories:

train_horse_names = os.listdir(train_horse_dir)
print(train_horse_names[:10])
train_human_names = os.listdir(train_human_dir)
print(train_human_names[:10])

Find the total number of horse and human images in the directories:

print('total training horse images:', len(os.listdir(train_horse_dir)))

print('total training human images:', len(os.listdir(train_human_dir)))
4. Explore the data
Take a look at a few pictures to get a better sense of what they look like.

First, configure the matplot parameters:

%matplotlib inline

import matplotlib.pyplot as plt

import matplotlib.image as mpimg

# Parameters for our graph; we'll output images in a 4x4 configuration

nrows = 4
ncols = 4

# Index for iterating over images

pic_index = 0

Now, display a batch of eight horse pictures and eight human pictures. You can rerun the cell to
see a fresh batch each time.

# Set up matplotlib fig, and size it to fit 4x4 pics

fig = plt.gcf()
fig.set_size_inches(ncols * 4, nrows * 4)

pic_index += 8
next_horse_pix = [os.path.join(train_horse_dir, fname)
for fname in train_horse_names[pic_index-8:pic_index]]
next_human_pix = [os.path.join(train_human_dir, fname)
for fname in train_human_names[pic_index-8:pic_index]]

for i, img_path in enumerate(next_horse_pix+next_human_pix):

# Set up subplot; subplot indices start at 1
sp = plt.subplot(nrows, ncols, i + 1)
sp.axis('Off') # Don't show axes (or gridlines)

img = mpimg.imread(img_path)
plt.imshow(img)

plt.show()

Here are some example images showing horses and humans in different poses and orientations:
5. Define the model
Start defining the model.

Begin by importing TensorFlow:

import tensorflow as tf

Then, add convolutional layers and flatten the final result to feed into the densely connected
layers. Finally, add the densely connected layers.

Note that because you're facing a two-class classification problem (a binary

classification problem) you'll end your network with a sigmoid activation so that the output of
your network will be a single scalar between 0 and 1, encoding the probability that the current
image is class 1 (as opposed to class 0).

model = tf.keras.models.Sequential([
# Note the input shape is the desired size of the image 300x300 with 3 bytes color
# This is the first convolution
tf.keras.layers.Conv2D(16, (3,3), activation='relu', input_shape=(300, 300, 3)),
tf.keras.layers.MaxPooling2D(2, 2),
# The second convolution
tf.keras.layers.Conv2D(32, (3,3), activation='relu'),
tf.keras.layers.MaxPooling2D(2,2),
# The third convolution
tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
tf.keras.layers.MaxPooling2D(2,2),
# The fourth convolution
tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
tf.keras.layers.MaxPooling2D(2,2),
# The fifth convolution
tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
tf.keras.layers.MaxPooling2D(2,2),
# Flatten the results to feed into a DNN
tf.keras.layers.Flatten(),
# 512 neuron hidden layer
tf.keras.layers.Dense(512, activation='relu'),
# Only 1 output neuron. It will contain a value from 0-1 where 0 for 1 class ('horses') and 1 for the
other ('humans')
tf.keras.layers.Dense(1, activation='sigmoid')
])

The model.summary() method call prints a summary of the network.

model.summary()
You can see the results here:

Layer (type) Output Shape Param #

=================================================================
conv2d (Conv2D) (None, 298, 298, 16) 448
_________________________________________________________________
max_pooling2d (MaxPooling2D) (None, 149, 149, 16) 0
_________________________________________________________________
conv2d_1 (Conv2D) (None, 147, 147, 32) 4640
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 73, 73, 32) 0
_________________________________________________________________
conv2d_2 (Conv2D) (None, 71, 71, 64) 18496
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 35, 35, 64) 0
_________________________________________________________________
conv2d_3 (Conv2D) (None, 33, 33, 64) 36928
_________________________________________________________________
max_pooling2d_3 (MaxPooling2 (None, 16, 16, 64) 0
_________________________________________________________________
conv2d_4 (Conv2D) (None, 14, 14, 64) 36928
_________________________________________________________________
max_pooling2d_4 (MaxPooling2 (None, 7, 7, 64) 0
_________________________________________________________________
flatten (Flatten) (None, 3136) 0
_________________________________________________________________
dense (Dense) (None, 512) 1606144
_________________________________________________________________
dense_1 (Dense) (None, 1) 513
=================================================================
Total params: 1,704,097
Trainable params: 1,704,097
Non-trainable params: 0

The output shape column shows how the size of your feature map evolves in each successive
layer. The convolution layers reduce the size of the feature maps by a bit due to padding and
each pooling layer halves the dimensions.

6. Compile the model

Next, configure the specifications for model training. Train your model with
the binary_crossentropy loss because it's a binary classification problem and your final activation is
a sigmoid. (For a refresher on loss metrics, see Descending into ML.) Use the rmsprop optimizer
with a learning rate of 0.001. During training, monitor classification accuracy.

Note: In this case, using the RMSprop optimization algorithm is preferable to stochastic
gradient descent (SGD) because RMSprop automates learning-rate tuning for you. (Other
optimizers, such as Adam and Adagrad, also automatically adapt the learning rate during
training and would work equally well here.)
Code:
from tensorflow.keras.optimizers import RMSprop

model.compile(loss='binary_crossentropy',
optimizer=RMSprop(lr=0.001),
metrics=['acc'])

7. Train the model from generators

Set up data generators that read pictures in your source folders, convert them to float32 tensors,
and feed them (with their labels) to your network.

You'll have one generator for the training images and one for the validation images. Your
generators will yield batches of images of size 300x300 and their labels (binary).

As you may already know, data that goes into neural networks should usually be normalized in
some way to make it more amenable to processing by the network. (It's uncommon to feed raw
pixels into a CNN.) In your case, you'll preprocess your images by normalizing the pixel values to
be in the [0, 1] range (originally all values are in the [0, 255] range).

In Keras, that can be done via the keras.preprocessing.image.ImageDataGenerator class using the
rescale parameter. That ImageDataGenerator class allows you to instantiate generators of
augmented image batches (and their labels) via .flow(data, labels) or
.flow_from_directory(directory). Those generators can then be used with the Keras model
methods that accept data generators as
inputs: fit_generator, evaluate_generator and predict_generator.

from tensorflow.keras.preprocessing.image import ImageDataGenerator

# All images will be rescaled by 1./255

train_datagen = ImageDataGenerator(rescale=1./255)

# Flow training images in batches of 128 using train_datagen generator

train_generator = train_datagen.flow_from_directory(
'/tmp/horse-or-human/', # This is the source directory for training images
target_size=(300, 300), # All images will be resized to 150x150
batch_size=128,
# Since we use binary_crossentropy loss, we need binary labels
class_mode='binary')
8. Do the training
Train for 15 epochs. (That may take a few minutes to run.)

history = model.fit(
train_generator,
steps_per_epoch=8,
epochs=15,
verbose=1)

Note the values per epoch.

The Loss and Accuracy are a great indication of progress of training. It's making a guess as to
the classification of the training data, and then measuring it against the known label, calculating
the result. Accuracy is the portion of correct guesses.

Epoch 1/15
9/9 [==============================] - 9s 1s/step - loss: 0.8662 - acc: 0.5151
Epoch 2/15
9/9 [==============================] - 8s 927ms/step - loss: 0.7212 - acc: 0.5969
Epoch 3/15
9/9 [==============================] - 8s 921ms/step - loss: 0.6612 - acc: 0.6592
Epoch 4/15
9/9 [==============================] - 8s 925ms/step - loss: 0.3135 - acc: 0.8481
Epoch 5/15
9/9 [==============================] - 8s 919ms/step - loss: 0.4640 - acc: 0.8530
Epoch 6/15
9/9 [==============================] - 8s 896ms/step - loss: 0.2306 - acc: 0.9231
Epoch 7/15
9/9 [==============================] - 8s 915ms/step - loss: 0.1464 - acc: 0.9396
Epoch 8/15
9/9 [==============================] - 8s 935ms/step - loss: 0.2663 - acc: 0.8919
Epoch 9/15
9/9 [==============================] - 8s 883ms/step - loss: 0.0772 - acc: 0.9698
Epoch 10/15
9/9 [==============================] - 9s 951ms/step - loss: 0.0403 - acc: 0.9805
Epoch 11/15
9/9 [==============================] - 8s 891ms/step - loss: 0.2618 - acc: 0.9075
Epoch 12/15
9/9 [==============================] - 8s 902ms/step - loss: 0.0434 - acc: 0.9873
Epoch 13/15
9/9 [==============================] - 8s 904ms/step - loss: 0.0187 - acc: 0.9932
Epoch 14/15
9/9 [==============================] - 9s 951ms/step - loss: 0.0974 - acc: 0.9649
Epoch 15/15
9/9 [==============================] - 8s 877ms/step - loss: 0.2859 - acc: 0.9338
9. Test the model
Now actually run a prediction using the model. The code will allow you to choose one or more
files from your file system. It will then upload them and run them through the model, giving an
indication of whether the object is a horse or a human.

You can download images from the internet to your file system to try them out! Note that you
might see that the network makes a lot of mistakes despite the fact that the training accuracy is
above 99%.

That's due to something called overfitting, which means that the neural network is trained with
very limited data (there are only roughly 500 images of each class). So it's very good at
recognizing images that look like those in the training set, but it can fail a lot at images that are
not in the training set.

That's a datapoint proving that the more data that you train on, the better your final network will
be!

There are many techniques that can be used to make your training better, despite limited data,
including something called image augmentation, but that's beyond the scope of this codelab.

import numpy as np
from google.colab import files
from keras.preprocessing import image

uploaded = files.upload()

for fn in uploaded.keys():

# predicting images
path = '/content/' + fn
img = image.load_img(path, target_size=(300, 300))
x = image.img_to_array(img)
x = np.expand_dims(x, axis=0)

images = np.vstack([x])
classes = model.predict(images, batch_size=10)
print(classes[0])
if classes[0]>0.5:
print(fn + " is a human")
else:
print(fn + " is a horse")

For example, say that you want to test with this image:
Here's what the colab produces:

Despite it being a cartoon graphic, it still classifies correctly.

The following image also classifies correctly:

Try some images of your own and explore!
10. Visualize intermediate representations
To get a feel for what kind of features your CNN has learned, a fun thing to do is visualize how an
input gets transformed as it goes through the CNN.

Pick a random image from the training set, then generate a figure where each row is the output
of a layer and each image in the row is a specific filter in that output feature map. Rerun that cell
to generate intermediate representations for a variety of training images.

import numpy as np
import random
from tensorflow.keras.preprocessing.image import img_to_array, load_img

# Let's define a new Model that will take an image as input, and will output
# intermediate representations for all layers in the previous model after
# the first.
successive_outputs = [layer.output for layer in model.layers[1:]]
#visualization_model = Model(img_input, successive_outputs)
visualization_model = tf.keras.models.Model(inputs = model.input, outputs = successive_outputs)
# Let's prepare a random input image from the training set.
horse_img_files = [os.path.join(train_horse_dir, f) for f in train_horse_names]
human_img_files = [os.path.join(train_human_dir, f) for f in train_human_names]
img_path = random.choice(horse_img_files + human_img_files)

img = load_img(img_path, target_size=(300, 300)) # this is a PIL image

x = img_to_array(img) # Numpy array with shape (150, 150, 3)
x = x.reshape((1,) + x.shape) # Numpy array with shape (1, 150, 150, 3)

# Rescale by 1/255
x /= 255

# Let's run our image through our network, thus obtaining all
# intermediate representations for this image.
successive_feature_maps = visualization_model.predict(x)

# These are the names of the layers, so can have them as part of our plot
layer_names = [layer.name for layer in model.layers]

# Now let's display our representations

for layer_name, feature_map in zip(layer_names, successive_feature_maps):
if len(feature_map.shape) == 4:
# Just do this for the conv / maxpool layers, not the fully-connected layers
n_features = feature_map.shape[-1] # number of features in feature map
# The feature map has shape (1, size, size, n_features)
size = feature_map.shape[1]
# We will tile our images in this matrix
display_grid = np.zeros((size, size * n_features))
for i in range(n_features):
# Postprocess the feature to make it visually palatable
x = feature_map[0, :, :, i]
x -= x.mean()
if x.std()>0:
x /= x.std()
x *= 64
x += 128
x = np.clip(x, 0, 255).astype('uint8')
# We'll tile each filter into this big horizontal grid
display_grid[:, i * size : (i + 1) * size] = x
# Display the grid
scale = 20. / n_features
plt.figure(figsize=(scale * n_features, scale))
plt.title(layer_name)
plt.grid(False)
plt.imshow(display_grid, aspect='auto', cmap='viridis')

Here are example results:

As you can see, you go from the raw pixels of the images to increasingly abstract and compact
representations. The representations downstream start highlighting what the network pays
attention to, and they show fewer and fewer features being "activated." Most are set to zero.
That's called sparsity. Representation sparsity is a key feature of deep learning.

Those representations carry increasingly less information about the original pixels of the image,
but increasingly refined information about the class of the image. You can think of a CNN (or a
deep network in general) as an information distillation pipeline.
Use convolutional neural networks (CNNs) with large datasets to
avoid overfitting

Use CNNS with larger datasets

2. Train with a large dataset: Cats and dogs

In this codelab, you'll look at a real and very large dataset, and see the impact that it has on
avoiding overfitting.

First, set up your development environment with the requisite libraries that you'll need.

import os
import zipfile
import random
import tensorflow as tf
from tensorflow.keras.optimizers import RMSprop
from tensorflow.keras.preprocessing.image import ImageDataGenerator
from shutil import copyfile

3. Get the data

The full dataset for the kaggle challenge is provided by Microsoft. You can find it here. See the
instructions in the comments if the URL in the following code block doesn't work.

# If the URL doesn't work, visit https://www.microsoft.com/en-

us/download/confirmation.aspx?id=54765
# And right click on the 'Download Manually' link to get a new URL to the dataset
# Note: This is a very large dataset and will take time to download
!wget --no-check-certificate "https://download.microsoft.com/download/3/E/1/3E1C3F21-ECDB-
4869-8368-6DEBA77B919F/kagglecatsanddogs_3367a.zip" -O "/tmp/cats-and-dogs.zip"
local_zip = '/tmp/cats-and-dogs.zip'
zip_ref = zipfile.ZipFile(local_zip, 'r')
zip_ref.extractall('/tmp')
zip_ref.close()
print(len(os.listdir('/tmp/PetImages/Cat/')))
print(len(os.listdir('/tmp/PetImages/Dog/')))
# Expected Output:
# 12501
# 12501
4. Prepare the data
Now that you downloaded the data, unzip it into training and testing directories. The following
code achieves that:

try:
os.mkdir('/tmp/cats-v-dogs')
os.mkdir('/tmp/cats-v-dogs/training')
os.mkdir('/tmp/cats-v-dogs/testing')
os.mkdir('/tmp/cats-v-dogs/training/cats')
os.mkdir('/tmp/cats-v-dogs/training/dogs')
os.mkdir('/tmp/cats-v-dogs/testing/cats')
os.mkdir('/tmp/cats-v-dogs/testing/dogs')
except OSError:
pass

def split_data(SOURCE, TRAINING, TESTING, SPLIT_SIZE):

files = []
for filename in os.listdir(SOURCE):
file = SOURCE + filename
if os.path.getsize(file) > 0:
files.append(filename)
else:
print(filename + " is zero length, so ignoring.")

training_length = int(len(files) * SPLIT_SIZE)

testing_length = int(len(files) - training_length)
shuffled_set = random.sample(files, len(files))
training_set = shuffled_set[0:training_length]
testing_set = shuffled_set[:testing_length]

for filename in training_set:

this_file = SOURCE + filename
destination = TRAINING + filename
copyfile(this_file, destination)

for filename in testing_set:

this_file = SOURCE + filename
destination = TESTING + filename
copyfile(this_file, destination)

CAT_SOURCE_DIR = "/tmp/PetImages/Cat/"
TRAINING_CATS_DIR = "/tmp/cats-v-dogs/training/cats/"
TESTING_CATS_DIR = "/tmp/cats-v-dogs/testing/cats/"
DOG_SOURCE_DIR = "/tmp/PetImages/Dog/"
TRAINING_DOGS_DIR = "/tmp/cats-v-dogs/training/dogs/"
TESTING_DOGS_DIR = "/tmp/cats-v-dogs/testing/dogs/"

split_size = .9
split_data(CAT_SOURCE_DIR, TRAINING_CATS_DIR, TESTING_CATS_DIR, split_size)
split_data(DOG_SOURCE_DIR, TRAINING_DOGS_DIR, TESTING_DOGS_DIR, split_size)
# Expected output
# 666.jpg is zero length, so ignoring
# 11702.jpg is zero length, so ignoring

You can check to see if your data is properly unpacked using the following code:

print(len(os.listdir('/tmp/cats-v-dogs/training/cats/')))
print(len(os.listdir('/tmp/cats-v-dogs/training/dogs/')))
print(len(os.listdir('/tmp/cats-v-dogs/testing/cats/')))
print(len(os.listdir('/tmp/cats-v-dogs/testing/dogs/')))
# Expected output:
# 11250
# 11250
# 1250
# 1250

5. Define the model

Next, define the model as a series of convolutional layers with max pooling.

model = tf.keras.models.Sequential([
tf.keras.layers.Conv2D(16, (3, 3), activation='relu', input_shape=(150, 150, 3)),
tf.keras.layers.MaxPooling2D(2, 2),
tf.keras.layers.Conv2D(32, (3, 3), activation='relu'),
tf.keras.layers.MaxPooling2D(2, 2),
tf.keras.layers.Conv2D(64, (3, 3), activation='relu'),
tf.keras.layers.MaxPooling2D(2, 2),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(512, activation='relu'),
tf.keras.layers.Dense(1, activation='sigmoid')
])
model.compile(optimizer=RMSprop(lr=0.001), loss='binary_crossentropy', metrics=['accuracy'])
6. Train the model
Now that the model is defined, you can train the model using an ImageDataGenerator.

TRAINING_DIR = "/tmp/cats-v-dogs/training/"
train_datagen = ImageDataGenerator(rescale=1.0/255.)
train_generator = train_datagen.flow_from_directory(TRAINING_DIR,
batch_size=100,
class_mode='binary',
target_size=(150, 150))

VALIDATION_DIR = "/tmp/cats-v-dogs/testing/"
validation_datagen = ImageDataGenerator(rescale=1.0/255.)
validation_generator = validation_datagen.flow_from_directory(VALIDATION_DIR,
batch_size=100,
class_mode='binary',
target_size=(150, 150))

# Expected Output:
# Found 22498 images belonging to 2 classes.
# Found 2500 images belonging to 2 classes.

To train the model, you now call model.fit_generator, passing it to the generators that you created.

# Note that this may take some time.

history = model.fit_generator(train_generator,
epochs=15,
verbose=1,
validation_data=validation_generator)

7. Explore the results

You can explore and plot the training and validation accuracy with the following code. Use it to
see when you reach maximum training efficiency and see whether you're overfitting.

%matplotlib inline
import matplotlib.image as mpimg
import matplotlib.pyplot as plt
#-----------------------------------------------------------
# Retrieve a list of list results on training and test data
# sets for each training epoch
#-----------------------------------------------------------
acc=history.history['accuracy']
val_acc=history.history['val_accuracy']
loss=history.history['loss']
val_loss=history.history['val_loss']
epochs=range(len(acc)) # Get number of epochs

#------------------------------------------------
# Plot training and validation accuracy per epoch
#------------------------------------------------
plt.plot(epochs, acc, 'r', "Training Accuracy")
plt.plot(epochs, val_acc, 'b', "Validation Accuracy")
plt.title('Training and validation accuracy')
plt.figure()

#------------------------------------------------
# Plot training and validation loss per epoch
#------------------------------------------------
plt.plot(epochs, loss, 'r', "Training Loss")
plt.plot(epochs, val_loss, 'b', "Validation Loss")
plt.figure()

8. Test your model

If you want to take the model for a spin, you can use the following code. Upload images to see how it
classifies them!

# Here's a codeblock just for fun. You should be able to upload an image here
# and have it classified without crashing
import numpy as np
from google.colab import files
from keras.preprocessing import image

uploaded = files.upload()

for fn in uploaded.keys():

# predicting images
path = '/content/' + fn
img = image.load_img(path, target_size=(150, 150))
x = image.img_to_array(img)
x = np.expand_dims(x, axis=0)

images = np.vstack([x])
classes = model.predict(images, batch_size=10)
print(classes[0])
if classes[0]>0.5:
print(fn + " is a dog")
else: print(fn + " is a cat")

Practical No.1: Design A Home Page For Business To Consumer Website
No ratings yet
Practical No.1: Design A Home Page For Business To Consumer Website
16 pages
JX-3P MIDI Expansion Kit Installation Manual
No ratings yet
JX-3P MIDI Expansion Kit Installation Manual
15 pages
Hello World NN
No ratings yet
Hello World NN
8 pages
Tensorflow
No ratings yet
Tensorflow
29 pages
Getting Started With TensorFlow - Js - TensorFlow - Medium
No ratings yet
Getting Started With TensorFlow - Js - TensorFlow - Medium
6 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
50 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
Module V
No ratings yet
Module V
19 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
No ratings yet
Python TensorFlow Tutorial - Build A Neural Network - Adventures in Machine Learning
18 pages
ML Unit-5
No ratings yet
ML Unit-5
14 pages
Lecture 2 - Hello World in ML
No ratings yet
Lecture 2 - Hello World in ML
49 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Tensor Flow Guide
No ratings yet
Tensor Flow Guide
25 pages
Gerald Corzo 5/26/2020: Workshop Google Machine Learning Tools (Services) 1
No ratings yet
Gerald Corzo 5/26/2020: Workshop Google Machine Learning Tools (Services) 1
24 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
43 pages
106106213
No ratings yet
106106213
637 pages
CSE488 - Lab7 - Neural Networks and TensorFlow
No ratings yet
CSE488 - Lab7 - Neural Networks and TensorFlow
21 pages
TensorFlow Basics
100% (1)
TensorFlow Basics
38 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
No ratings yet
Tensorflow 2.0 Cheat Sheet: Some Pre-Requisites TF Core Learning Algorithms Working With Keras Models
2 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
Notebook - Tensorflow Keras
No ratings yet
Notebook - Tensorflow Keras
25 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
Chap 3 TensorFlow
No ratings yet
Chap 3 TensorFlow
24 pages
Unit III
No ratings yet
Unit III
28 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Crash Course On Tensorflow!: Vincent Lepetit!
No ratings yet
Crash Course On Tensorflow!: Vincent Lepetit!
63 pages
Final DL
No ratings yet
Final DL
26 pages
Appendix Tensorflow PDF
50% (8)
Appendix Tensorflow PDF
14 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
NN Lab
No ratings yet
NN Lab
9 pages
DeepTrading With TensorFlow 1 - TodoTrader
No ratings yet
DeepTrading With TensorFlow 1 - TodoTrader
6 pages
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
DLT Experiment 2
No ratings yet
DLT Experiment 2
7 pages
L2 - Basic ANN Model Building With TF-Keras
No ratings yet
L2 - Basic ANN Model Building With TF-Keras
16 pages
TensorFlow For Machine Intelligence
No ratings yet
TensorFlow For Machine Intelligence
306 pages
AI Lab11 Task
No ratings yet
AI Lab11 Task
21 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
DL Lab - Merged
No ratings yet
DL Lab - Merged
60 pages
Tensorflow Ensai SID 13 01 17
No ratings yet
Tensorflow Ensai SID 13 01 17
99 pages
Intro To Neural Nets PDF
No ratings yet
Intro To Neural Nets PDF
29 pages
03 - Lecture Slide - Basic Models in TensorFlow
No ratings yet
03 - Lecture Slide - Basic Models in TensorFlow
94 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
What Is TensorFlow
No ratings yet
What Is TensorFlow
38 pages
UNIT II - PPT - Part 1
No ratings yet
UNIT II - PPT - Part 1
41 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages
Introduction To TensorFlow
No ratings yet
Introduction To TensorFlow
3 pages
02 - Lecture Note - TensorFlow Ops
No ratings yet
02 - Lecture Note - TensorFlow Ops
21 pages
ML Exp 10
No ratings yet
ML Exp 10
4 pages
DL Experiments
No ratings yet
DL Experiments
19 pages
Lab 12
No ratings yet
Lab 12
6 pages
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
TensorFlow For Machine Intelligence
100% (27)
TensorFlow For Machine Intelligence
305 pages
Chapter 3 - Training Deep Neural Networks
No ratings yet
Chapter 3 - Training Deep Neural Networks
25 pages
TensorFlow Tutorial
No ratings yet
TensorFlow Tutorial
65 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
From Everand
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
Nikhil Khan
No ratings yet
Excel Simulations
From Everand
Excel Simulations
Gerard M. Verschuuren
3.5/5 (2)
TCS Next Steps
No ratings yet
TCS Next Steps
4 pages
RedHat Process Document
No ratings yet
RedHat Process Document
12 pages
Uk Ic Postdoc RF Research Topics 2023
No ratings yet
Uk Ic Postdoc RF Research Topics 2023
30 pages
Random Research Titles Relatedto Philippines
No ratings yet
Random Research Titles Relatedto Philippines
3 pages
What Are The Top Books in Instrumentation Engineering - Quora
No ratings yet
What Are The Top Books in Instrumentation Engineering - Quora
6 pages
DevOps Questions Answers
No ratings yet
DevOps Questions Answers
5 pages
9-Error Detection and Correction-21!01!2022 (21-Jan-2022) Material I 21-01-2022 Error Detection - Correction-Up
No ratings yet
9-Error Detection and Correction-21!01!2022 (21-Jan-2022) Material I 21-01-2022 Error Detection - Correction-Up
56 pages
Smart-UPS On-Line - SRV6KI - APC
No ratings yet
Smart-UPS On-Line - SRV6KI - APC
2 pages
Load Balancer & Autoscaling
No ratings yet
Load Balancer & Autoscaling
24 pages
Computer Science Projects
No ratings yet
Computer Science Projects
1 page
SIM68M - Hardware Design - V1.04
No ratings yet
SIM68M - Hardware Design - V1.04
31 pages
Passbolt On AlmaLinux 9
No ratings yet
Passbolt On AlmaLinux 9
12 pages
Data Warehouse Dan Business Intelligence Si-41-02 (Onp) : The Correct Answer Is: Source System
No ratings yet
Data Warehouse Dan Business Intelligence Si-41-02 (Onp) : The Correct Answer Is: Source System
3 pages
Log
No ratings yet
Log
31 pages
College of Communication and Information Technology: University of Northern Philippines
No ratings yet
College of Communication and Information Technology: University of Northern Philippines
19 pages
Week 3 - Dividing Rational Algebraic Expressions
No ratings yet
Week 3 - Dividing Rational Algebraic Expressions
10 pages
Job Information: Engineer Checked Approved Name: Date: Structure Type
No ratings yet
Job Information: Engineer Checked Approved Name: Date: Structure Type
8 pages
Co1 Session Wise Problems
No ratings yet
Co1 Session Wise Problems
4 pages
Module 1 Final
100% (1)
Module 1 Final
89 pages
CM Forrester Total Economic Impact Ansible Analyst Paper f13019 201806 en
No ratings yet
CM Forrester Total Economic Impact Ansible Analyst Paper f13019 201806 en
20 pages
INTRODUCTION To Desktop Editing and Photography
No ratings yet
INTRODUCTION To Desktop Editing and Photography
41 pages
FDS E1s PDF
No ratings yet
FDS E1s PDF
17 pages
Evernote PDF
No ratings yet
Evernote PDF
2 pages
Philippine Independence Day Illustrative Presentation
No ratings yet
Philippine Independence Day Illustrative Presentation
15 pages
II Sem DS Unit I
No ratings yet
II Sem DS Unit I
55 pages
PK-WT-Unit - 03 - CSS
No ratings yet
PK-WT-Unit - 03 - CSS
26 pages
Advanced Archive Recovery Software
No ratings yet
Advanced Archive Recovery Software
44 pages
Manufacturing and Automation Engineering
No ratings yet
Manufacturing and Automation Engineering
49 pages
Azure 900 Exam Questions
No ratings yet
Azure 900 Exam Questions
31 pages
First-In First-Out (FIFO) Control Logic VHDL Modeling Example
No ratings yet
First-In First-Out (FIFO) Control Logic VHDL Modeling Example
6 pages
Galcon Manual Usuario
No ratings yet
Galcon Manual Usuario
65 pages
Oracle Data Guard - Fast Start Failover Understood!: Dr. Martin Wunderli
No ratings yet
Oracle Data Guard - Fast Start Failover Understood!: Dr. Martin Wunderli
31 pages