Build image classifier using transfer learning - Fine-tuning MobileNet with Keras

video

expand_more

text

expand_more

Building a fine-tuned MobileNet model with TensorFlow's Keras API

In this episode, we'll discuss how to build a fine-tuned MobileNet model and implement this model in code using TensorFlow's Keras API.

Now that we've seen what MobileNet is all about in the last episode, let's now talk about how we can fine-tune the model and and use transfer learning to train it on another dataset.

If you're not already familiar with the concept of fine-tuning, that's alright because we have several other episodes on fine-tuning using the VGG16 model with Keras, as well as an episode dedicated to the concept of fine-tuning and transfer learning, so check those out first if you need to.

Alright, let's jump into the code!

Fine-tuning MobileNet with Keras

First, make sure all the imports are in place from last time.

Similar to what we previously implemented with VGG16, we're going to be fine-tuning MobileNet on images of cats and dogs. The implementation will be pretty similar, but you'll notice there will be a few differences.

Many different breeds of cats and dogs were included in the ImageNet data set for which MobileNet was originally trained on, so the original model has already learned a lot about cats and dogs in general. Because of this, it won't take much tuning to get the model to perform well on this specific, more narrow classification task.

In a later episode, however, we'll be fine-tuning MobileNet on a completely new data set made up of classes that the model hasn't already learned about in it's original training, so stay tuned for that.

Preparing the data

Before we start tuning the model, we need to prepare the data. The data I'm using is a random subset of cat and dog image data from the Kaggle cat versus dog competition, and I have my image data stored on disk in a specific directory structure in order to use the Keras flow_from_directory() function that we'll see in just a sec.

If you're following along, then you'll need to structure your data in the same way, and you can do that by following the episode on Image Preparation for CNNs with Keras.

We now define the path variables for where the training, validation, and test set reside on disk.

train_path = 'data/dogs-vs-cats/train'
valid_path = 'data/dogs-vs-cats/valid'
test_path = 'data/dogs-vs-cats/test'

Then, we create directory iterators for each dataset using Keras' ImageDataGenerator.flow_from_directory() function, which yeilds batches of image data from the directory that we pass in with our first parameter.

train_batches = ImageDataGenerator(preprocessing_function=tf.keras.applications.mobilenet.preprocess_input).flow_from_directory(
    directory=train_path, target_size=(224,224), batch_size=10)
valid_batches = ImageDataGenerator(preprocessing_function=tf.keras.applications.mobilenet.preprocess_input).flow_from_directory(
    directory=valid_path, target_size=(224,224), batch_size=10)
test_batches = ImageDataGenerator(preprocessing_function=tf.keras.applications.mobilenet.preprocess_input).flow_from_directory(
    directory=test_path, target_size=(224,224), batch_size=10, shuffle=False)

Notice the preprocessing_function parameter we're supplying to ImageDataGenerator. We're setting this equal to keras.applications.mobilenet.preprocess_input(). This is going to do the necessary MobileNet preprocessing on the images obtained from flow_from_directory().

Recall, we talked about this exact function in the last episode and its role in regards to preprocessing images for MobileNet.

To flow_from directory(), we're passing in the path to the data set, the target_size for the images, and the batch_size we're choosing to use for training. We do this exact same thing for all three data sets: train, validation, and test.

For the test_batches variable, we're also supplying one additional parameter, shuffle=False, so that we can later access the corresponding non-shuffled test lables to plot a confusion matrix.

The data portion is now done. Next, let's move on to modifying the model.

Model modification

We import MobileNet in the same way we saw in the last episode.

mobile = tf.keras.applications.mobilenet.MobileNet()

Next, we're going to grab the output from the sixth to last layer of the model and store it in this variable x.

x = mobile.layers[-6].output

We'll be using this to build a new model. This new model will consist of the original MobileNet up to the sixth to last layer. We're not including the last five layers of the original MobileNet.

By looking at the summary of the original model, we can see that by not including the last five layers, we'll be including everything up to and including the last global_average_pooling layer. Run model.summary() yourself or watch the corresponding video to see this.

Note that the amount of layers that you choose to cut off when you're fine-tuning a model will vary for each scenario, but I've found through experimentation that just removing the last 5 layers here works out well for this particular task. So with this setup, we'll be keeping the vast majority of the original MobileNet architecutre, which has 88 layers total.

Now, we append an output layer that we're calling output, which will just be a Dense layer with 2 output nodes, for cat and dog, and we'll use the softmax activation function.

output = Dense(units=2, activation='softmax')(x)

Now, we construct the new fine-tuned model, which we're calling model.

model = Model(inputs=mobile.input, outputs=output)

Note, you can see by the Model constructor used to create our model, that this is a model that is being created with the Keras Functional API, not the Sequential API that we've worked with in previous episodes. That's why this format that we're using to create the model may look a little different than what you're used to.

To build the new model, we create an instance of the Model class and specify the inputs to the model to be equal to the input of the original MobileNet, and then we define the outputs of the model to be equal to the output variable we created directly above.

This creates a new model, which is identical to the original MobileNet up to the original model's sixth to last layer. We don't have the last five original MobileNet layers included, but instead we have a new layer, the output layer we created with two output nodes.

You can compare the summary of the new model here with the summary of the original MobileNet to verifiy these differences using by calling summary() on both the old and new models. This is also shown in the corresponding video.

Now, we need to choose how many layers we actually want to be trained when we train on cats and dogs.

Here, we are freezing the weights of all the layers except for the last five layers in our new model, meaning that only the last five layers of the model will be trained.

for layer in model.layers[:-5]:
    layer.trainable = False

By training only the last five layers, all the weights in the remaining earlier layers will not be updated during training and instead will be saved with the ImageNet weights from the original MobileNet.

Note that the number of layers that you choose to retrain is, again, one of those things that varies by situtation. Since the original MobileNet model has already generally learned about cats and dogs, we're not really needing to retrain many layers.

Now, our new model is now built, tuned, and ready to be trained on cats and dogs. Make sure you've got your model ready for training, and in the next episode we'll do that together, and we'll also see how the model holds up to predicting on new unseen images from our test set. See ya there!

quiz

expand_more

DEEPLIZARD Message notifications

This lesson doesn't have any quiz questions yet!

resources

expand_more

Now that we've seen what MobileNet is all about in our last video, let's talk about how we can fine-tune the model via transfer learning and and use it on another dataset. We'll also be walking through the implementation of this in code using Keras, and through this process we'll get exposed to Keras' Functional API. Pre-requisite videos: Fine-tuning a Neural Network explained https://youtu.be/5T-iXNNiwIs Fine-tune VGG16 Image Classifier with Keras | Part 1: Build https://youtu.be/oDHpqu52soI Fine-tune VGG16 Image Classifier with Keras | Part 2: Train https://youtu.be/INaX55V1zpY Fine-tune VGG16 Image Classifier with Keras | Part 3: Predict https://youtu.be/HDom7mAxCdc Image preparation for CNN Image Classifier with Keras https://youtu.be/LhEMXbjGV_4 🕒🦎 VIDEO SECTIONS 🦎🕒 00:00 Welcome to DEEPLIZARD - Go to deeplizard.com for learning resources 00:30 Help deeplizard add video timestamps - See example in the description 06:32 Collective Intelligence and the DEEPLIZARD HIVEMIND 💥🦎 DEEPLIZARD COMMUNITY RESOURCES 🦎💥 👋 Hey, we're Chris and Mandy, the creators of deeplizard! 👀 CHECK OUT OUR VLOG: 🔗 https://youtube.com/deeplizardvlog 👉 Check out the blog post and other resources for this video: 🔗 https://deeplizard.com/learn/video/4Tcqw5oIfIg 💻 DOWNLOAD ACCESS TO CODE FILES 🤖 Available for members of the deeplizard hivemind: 🔗 https://deeplizard.com/resources 🧠 Support collective intelligence, join the deeplizard hivemind: 🔗 https://deeplizard.com/hivemind 🤜 Support collective intelligence, create a quiz question for this video: 🔗 https://deeplizard.com/create-quiz-question 🚀 Boost collective intelligence by sharing this video on social media! ❤️🦎 Special thanks to the following polymaths of the deeplizard hivemind: Tammy Prash Guy Payeur Christian Sikuq 👀 Follow deeplizard: Our vlog: https://youtube.com/deeplizardvlog Facebook: https://facebook.com/deeplizard Instagram: https://instagram.com/deeplizard Twitter: https://twitter.com/deeplizard Patreon: https://patreon.com/deeplizard YouTube: https://youtube.com/deeplizard 🎓 Deep Learning with deeplizard: Fundamental Concepts - https://deeplizard.com/learn/video/gZmobeGL0Yg Beginner Code - https://deeplizard.com/learn/video/RznKVRTFkBY Intermediate Code - https://deeplizard.com/learn/video/v5cngxo4mIg Advanced Deep RL - https://deeplizard.com/learn/video/nyjbcRQ-uQ8 🎓 Other Courses: Data Science - https://deeplizard.com/learn/video/d11chG7Z-xk Trading - https://deeplizard.com/learn/video/ZpfCK_uHL9Y 🛒 Check out products deeplizard recommends on Amazon: 🔗 https://amazon.com/shop/deeplizard 📕 Get a FREE 30-day Audible trial and 2 FREE audio books using deeplizard's link: 🔗 https://amzn.to/2yoqWRn 🎵 deeplizard uses music by Kevin MacLeod 🔗 https://youtube.com/channel/UCSZXFhRIx6b0dFX3xS8L1yQ 🔗 http://incompetech.com/ ❤️ Please use the knowledge gained from deeplizard content for good, not evil.

updates

expand_more

DEEPLIZARD Message notifications

Update history for this page

Did you know you that deeplizard content is regularly updated and maintained?

Updated
Maintained

Spot something that needs to be updated? Don't hesitate to let us know. We'll fix it!

All relevant updates for the content on this page are listed below.

e038463

Update code for tf.keras compatibility

Committed by Mandy on June 7, 2020

TensorFlow - Python Deep Learning Neural Network API