TensorFlowml~12 mins

Pre-trained models (VGG, ResNet, MobileNet) in TensorFlow - Model Pipeline Trace

Choose your learning style9 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Model Pipeline - Pre-trained models (VGG, ResNet, MobileNet)

This pipeline uses pre-trained models like VGG, ResNet, and MobileNet to recognize images. These models have already learned from large datasets, so we can use them to quickly identify new images without training from scratch.

Data Flow - 5 Stages

1Input Image

1 image x 224 x 224 x 3→Load and resize image to 224x224 pixels with 3 color channels (RGB)→1 image x 224 x 224 x 3

A photo of a cat resized to 224x224 pixels

↓

2Preprocessing

1 image x 224 x 224 x 3→Normalize pixel values and apply model-specific preprocessing (e.g., mean subtraction for VGG)→1 image x 224 x 224 x 3

Pixel values scaled and adjusted for VGG model input

↓

3Feature Extraction

1 image x 224 x 224 x 3→Pass image through pre-trained model layers (VGG, ResNet, or MobileNet) to extract features→1 vector x 1024 (example for MobileNetV2)

Feature vector representing image content

↓

4Classification Layer

1 vector x 1024→Apply final dense layer with softmax to predict class probabilities→1 vector x 1000 (ImageNet classes)

Probabilities for 1000 object categories

↓

5Prediction Output

1 vector x 1000→Select class with highest probability as predicted label→1 label

"tabby cat" with 0.85 probability

Training Trace - Epoch by Epoch


Loss: 0.45 |*****
Loss: 0.30 |****
Loss: 0.22 |***
Loss: 0.18 |**
Loss: 0.15 |*

Epoch	Loss ↓	Accuracy ↑	Observation
1	0.45	0.78	Initial fine-tuning starts with moderate loss and accuracy
2	0.30	0.85	Loss decreases and accuracy improves as model learns
3	0.22	0.90	Model converges with good accuracy and low loss
4	0.18	0.92	Further fine-tuning improves performance slightly
5	0.15	0.93	Training stabilizes with minimal loss and high accuracy

Prediction Trace - 4 Layers

Layer 1: Input Layer

Layer 2: Pre-trained Model Layers (e.g., MobileNet)

Layer 3: Dense Classification Layer with Softmax

Layer 4: Output Prediction

Model Quiz - 3 Questions

Test your understanding

What is the main advantage of using pre-trained models like VGG or ResNet?

AThey save time by reusing learned features from large datasets

BThey require training from scratch for every new task

CThey only work with black and white images

DThey do not need any preprocessing of input images

Key Insight

Pre-trained models allow us to use powerful image recognition without training from zero. They extract meaningful features from images, and fine-tuning them on new data improves accuracy efficiently.