Practice

(1/5)

1. Why is data augmentation important in training computer vision models?

easy

A. It increases the variety of training images to help the model generalize better.

B. It reduces the size of the training dataset to speed up training.

C. It removes noisy images from the dataset automatically.

D. It guarantees 100% accuracy on the training data.

Solution

Step 1: Understand data augmentation purpose
Data augmentation creates new images by slightly changing existing ones to increase variety.
Step 2: Connect augmentation to model learning
More variety helps the model learn features that work on new, unseen images, improving generalization.
Final Answer:
It increases the variety of training images to help the model generalize better. -> Option A
Quick Check:
Data augmentation = better generalization [OK]

Hint: Think: more image variety means better learning [OK]

Common Mistakes:

Confusing augmentation with data reduction
Believing augmentation removes bad images
Assuming augmentation guarantees perfect accuracy

2. Which of the following is a correct way to apply horizontal flip augmentation using Python's torchvision library?

easy

A. transforms.FlipHorizontal(prob=0.5)

B. transforms.HorizontalFlip(0.5)

C. transforms.RandomHorizontalFlip(p=0.5)

D. transforms.RandomFlipHorizontal()

Solution

Step 1: Recall torchvision syntax for horizontal flip
The correct transform is RandomHorizontalFlip with a probability parameter p.
Step 2: Check each option's correctness
Only transforms.RandomHorizontalFlip(p=0.5) matches the correct syntax and parameter name.
Final Answer:
transforms.RandomHorizontalFlip(p=0.5) -> Option C
Quick Check:
Correct torchvision flip syntax = transforms.RandomHorizontalFlip(p=0.5) [OK]

Hint: Look for 'RandomHorizontalFlip' with parameter p= [OK]

Common Mistakes:

Using wrong class names like HorizontalFlip
Incorrect parameter names like prob instead of p
Missing the probability parameter

3. What will be the output shape of the augmented image after applying the following PyTorch transform?

transform = transforms.Compose([
  transforms.Resize((128, 128)),
  transforms.RandomRotation(30),
  transforms.ToTensor()
])
augmented_image = transform(original_image)

medium

A. [128, 3, 128]

B. [128, 128, 3]

C. [1, 128, 128]

D. [3, 128, 128]

Solution

Step 1: Analyze the transform steps
Resize changes image to 128x128 pixels. RandomRotation keeps size same. ToTensor converts image to tensor with channels first.
Step 2: Determine tensor shape format
PyTorch tensors from images have shape [channels, height, width]. For RGB images, channels=3.
Final Answer:
[3, 128, 128] -> Option D
Quick Check:
PyTorch image tensor shape = [channels, height, width] [OK]

Hint: PyTorch image tensors are channels first: [3, H, W] [OK]

Common Mistakes:

Confusing channel order with height and width
Assuming rotation changes image size
Mixing up tensor shape formats

4. You wrote this augmentation code but get an error:

transform = transforms.Compose([
  transforms.RandomRotation(45),
  transforms.RandomHorizontalFlip(0.3),
  transforms.ToTensor()
])

What is the likely cause?

medium

A. RandomHorizontalFlip expects a keyword argument p, not a positional float.

B. RandomRotation requires integer degrees, not float.

C. ToTensor must come before RandomRotation.

D. Compose cannot combine these transforms.

Solution

Step 1: Check RandomHorizontalFlip usage
RandomHorizontalFlip requires the probability parameter as a keyword argument p=, not a positional argument.
Step 2: Verify other transform usages
RandomRotation accepts float degrees, ToTensor can be last, Compose supports these transforms.
Final Answer:
RandomHorizontalFlip expects a keyword argument p, not a positional float. -> Option A
Quick Check:
RandomHorizontalFlip(p=0.3) correct syntax [OK]

Hint: Check if transform params use correct keywords [OK]

Common Mistakes:

Passing probability as positional argument
Thinking rotation degrees must be integer
Misordering transforms in Compose

5. You have a small dataset of 100 images for a classification task. Which data augmentation strategy will most likely improve your model's ability to recognize objects in new photos?

hard

A. Only resize images to a fixed size without any other changes.

B. Apply random flips, rotations up to 30 degrees, and brightness changes during training.

C. Add Gaussian noise to all images without any geometric transforms.

D. Train without augmentation but increase model layers.

Solution

Step 1: Consider dataset size and augmentation needs
Small datasets benefit from augmentations that create varied views of images to prevent overfitting.
Step 2: Evaluate augmentation types
Random flips, rotations, and brightness changes simulate real-world variations, improving generalization better than noise alone or no augmentation.
Final Answer:
Apply random flips, rotations up to 30 degrees, and brightness changes during training. -> Option B
Quick Check:
Varied augmentations = better generalization on small data [OK]

Hint: Use varied simple transforms for small datasets [OK]

Common Mistakes:

Ignoring augmentation on small datasets
Using only noise without geometric changes
Relying on bigger models instead of data variety

Why Data augmentation importance in Computer Vision? - Purpose & Use Cases

Start learning this pattern below

Practice

Solution

Step 1: Understand data augmentation purpose

Step 2: Connect augmentation to model learning

Final Answer:

Quick Check:

Solution

Step 1: Recall torchvision syntax for horizontal flip

Step 2: Check each option's correctness

Final Answer:

Quick Check:

Solution

Step 1: Analyze the transform steps

Step 2: Determine tensor shape format

Final Answer:

Quick Check:

Solution

Step 1: Check RandomHorizontalFlip usage

Step 2: Verify other transform usages

Final Answer:

Quick Check:

Solution

Step 1: Consider dataset size and augmentation needs

Step 2: Evaluate augmentation types

Final Answer:

Quick Check: