Computer Visionml~12 mins

Why augmentation multiplies training data in Computer Vision - Model Pipeline Impact

Choose your learning style9 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Model Pipeline - Why augmentation multiplies training data

This pipeline shows how image augmentation creates more training data by changing images slightly. This helps the model learn better by seeing many versions of the same image.

Data Flow - 3 Stages

1Original Dataset

1000 images x 64x64 pixels x 3 color channels→Start with original images→1000 images x 64x64 pixels x 3 color channels

Image of a cat, 64x64 pixels, RGB

↓

2Augmentation

1000 images x 64x64 pixels x 3 color channels→Apply random flips, rotations, and brightness changes→5000 images x 64x64 pixels x 3 color channels

Original cat image flipped horizontally, rotated 15 degrees, brightness increased

↓

3Train/Test Split

5000 images x 64x64 pixels x 3 color channels→Split into training and testing sets→4000 training images, 1000 testing images

4000 augmented cat images for training, 1000 augmented images for testing

Training Trace - Epoch by Epoch


Loss
1.2 |**************
1.0 |**********
0.8 |********
0.6 |******
0.4 |****
0.2 |**
0.0 +----------------
     1  3  5  7  10 Epochs

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Model starts learning with high loss and low accuracy
3	0.8	0.65	Loss decreases and accuracy improves as model sees more augmented data
5	0.5	0.80	Model learns better features due to diverse augmented images
7	0.35	0.88	Loss continues to decrease, accuracy approaches high values
10	0.25	0.92	Model converges with low loss and high accuracy

Prediction Trace - 6 Layers

Layer 1: Input Image

Layer 2: Convolutional Layer

Layer 3: Activation (ReLU)

Layer 4: Pooling Layer

Layer 5: Fully Connected Layer

Layer 6: Softmax

Model Quiz - 3 Questions

Test your understanding

Why does augmentation multiply the training data?

ABecause it removes bad images from the dataset

BBecause it compresses images to smaller sizes

CBecause it creates new images by changing originals

DBecause it labels images automatically

Key Insight

Augmentation helps the model see many versions of the same image, making it learn better and generalize well. This increases the training data size without collecting new images.