0
0
Computer Visionml~12 mins

Why augmentation multiplies training data in Computer Vision - Model Pipeline Impact

Choose your learning style9 modes available
Model Pipeline - Why augmentation multiplies training data

This pipeline shows how image augmentation creates more training data by changing images slightly. This helps the model learn better by seeing many versions of the same image.

Data Flow - 3 Stages
1Original Dataset
1000 images x 64x64 pixels x 3 color channelsStart with original images1000 images x 64x64 pixels x 3 color channels
Image of a cat, 64x64 pixels, RGB
2Augmentation
1000 images x 64x64 pixels x 3 color channelsApply random flips, rotations, and brightness changes5000 images x 64x64 pixels x 3 color channels
Original cat image flipped horizontally, rotated 15 degrees, brightness increased
3Train/Test Split
5000 images x 64x64 pixels x 3 color channelsSplit into training and testing sets4000 training images, 1000 testing images
4000 augmented cat images for training, 1000 augmented images for testing
Training Trace - Epoch by Epoch

Loss
1.2 |**************
1.0 |**********
0.8 |********
0.6 |******
0.4 |****
0.2 |**
0.0 +----------------
     1  3  5  7  10 Epochs
EpochLoss ↓Accuracy ↑Observation
11.20.45Model starts learning with high loss and low accuracy
30.80.65Loss decreases and accuracy improves as model sees more augmented data
50.50.80Model learns better features due to diverse augmented images
70.350.88Loss continues to decrease, accuracy approaches high values
100.250.92Model converges with low loss and high accuracy
Prediction Trace - 6 Layers
Layer 1: Input Image
Layer 2: Convolutional Layer
Layer 3: Activation (ReLU)
Layer 4: Pooling Layer
Layer 5: Fully Connected Layer
Layer 6: Softmax
Model Quiz - 3 Questions
Test your understanding
Why does augmentation multiply the training data?
ABecause it removes bad images from the dataset
BBecause it compresses images to smaller sizes
CBecause it creates new images by changing originals
DBecause it labels images automatically
Key Insight
Augmentation helps the model see many versions of the same image, making it learn better and generalize well. This increases the training data size without collecting new images.