0
0
Computer Visionml~12 mins

Augmentation policy search (AutoAugment) in Computer Vision - Model Pipeline Trace

Choose your learning style9 modes available
Model Pipeline - Augmentation policy search (AutoAugment)

This pipeline automatically finds the best ways to change images to help a model learn better. It tries different image changes, like flipping or color shifts, to improve the model's ability to recognize objects.

Data Flow - 5 Stages
1Raw Image Dataset
50000 images x 32 x 32 x 3Original CIFAR-10 images50000 images x 32 x 32 x 3
Image of a cat with RGB colors
2Augmentation Policy Search
50000 images x 32 x 32 x 3Apply different image transformations like rotate, shear, color adjust with varying strengths to find best policies50000 images x 32 x 32 x 3 (augmented)
Image rotated 15 degrees and brightness increased by 20%
3Train/Test Split
50000 images x 32 x 32 x 3Split dataset into training (45000) and validation (5000)45000 images x 32 x 32 x 3 (train), 5000 images x 32 x 32 x 3 (validation)
Training image of a dog, validation image of a truck
4Model Training
45000 images x 32 x 32 x 3Train convolutional neural network on augmented imagesTrained model weights
CNN learns to recognize objects better with augmented images
5Validation Evaluation
5000 images x 32 x 32 x 3Evaluate model accuracy on validation setAccuracy score (percentage)
Model achieves 88% accuracy on validation images
Training Trace - Epoch by Epoch
Loss
1.2 |****
1.0 |***
0.8 |**
0.6 |**
0.4 |*
0.2 |
    +----------------
     1 5 10 15 20 Epochs
EpochLoss ↓Accuracy ↑Observation
11.200.55Model starts learning with moderate accuracy
50.750.75Loss decreases and accuracy improves as model learns
100.500.85Model shows strong learning with good accuracy
150.400.89Loss continues to decrease, accuracy nearing 90%
200.350.91Model converges with high accuracy and low loss
Prediction Trace - 6 Layers
Layer 1: Input Image
Layer 2: Augmentation Policy Applied
Layer 3: Convolutional Layer
Layer 4: Pooling Layer
Layer 5: Fully Connected Layer
Layer 6: Softmax Activation
Model Quiz - 3 Questions
Test your understanding
What is the main purpose of the augmentation policy search step?
ATo split the dataset into training and validation sets
BTo find the best image changes that help the model learn better
CTo evaluate the model accuracy on test data
DTo reduce the size of images for faster training
Key Insight
AutoAugment helps the model learn better by automatically finding the best image changes. This makes the model more accurate and robust by showing it many useful variations of the images during training.