Practice

(1/5)

1. What is the main purpose of nn.BatchNorm in PyTorch?

easy

A. To normalize the inputs of each mini-batch to stabilize learning

B. To increase the size of the neural network

C. To reduce the number of layers in the model

D. To randomly drop neurons during training

Solution

Step 1: Understand batch normalization role
Batch normalization normalizes inputs of each mini-batch to keep data balanced.
Step 2: Identify the effect on learning
This normalization stabilizes and speeds up training by reducing internal covariate shift.
Final Answer:
To normalize the inputs of each mini-batch to stabilize learning -> Option A
Quick Check:
Batch normalization = normalize mini-batch inputs [OK]

Hint: BatchNorm normalizes batch data to stabilize training [OK]

Common Mistakes:

Thinking BatchNorm increases model size
Confusing BatchNorm with dropout
Believing BatchNorm reduces layers

2. Which of the following is the correct way to create a 1D batch normalization layer for 10 features in PyTorch?

easy

A. nn.BatchNorm2d(10)

B. nn.BatchNorm(10)

C. nn.BatchNorm1d(10)

D. nn.BatchNormLayer(10)

Solution

Step 1: Recall PyTorch BatchNorm classes
PyTorch uses nn.BatchNorm1d for 1D features, nn.BatchNorm2d for images.
Step 2: Match correct syntax
For 10 features in 1D, the correct syntax is nn.BatchNorm1d(10).
Final Answer:
nn.BatchNorm1d(10) -> Option C
Quick Check:
1D batch norm uses nn.BatchNorm1d [OK]

Hint: Use BatchNorm1d for 1D feature vectors [OK]

Common Mistakes:

Using nn.BatchNorm instead of nn.BatchNorm1d
Confusing 1d and 2d batch norm classes
Using non-existent nn.BatchNormLayer

3. Consider the following code snippet:

import torch
import torch.nn as nn

batch_norm = nn.BatchNorm1d(3)
input_tensor = torch.tensor([[1.0, 2.0, 3.0],
                             [4.0, 5.0, 6.0],
                             [7.0, 8.0, 9.0]])
output = batch_norm(input_tensor)
print(output)

What will be the shape of output?

medium

A. [3, 3]

B. [1, 3]

C. [3]

D. [3, 1]

Solution

Step 1: Check input tensor shape
The input tensor has shape (3, 3) - 3 samples, each with 3 features.
Step 2: Understand BatchNorm1d output shape
BatchNorm1d normalizes each feature across the batch but keeps input shape unchanged.
Final Answer:
[3, 3] -> Option A
Quick Check:
BatchNorm1d output shape = input shape [OK]

Hint: BatchNorm1d output shape matches input shape [OK]

Common Mistakes:

Assuming BatchNorm changes tensor shape
Confusing batch size with feature size
Expecting output to be a single vector

4. You wrote this code but get a runtime error:

batch_norm = nn.BatchNorm1d(5)
input_tensor = torch.randn(10, 3)
output = batch_norm(input_tensor)

What is the likely cause of the error?

medium

A. The batch size (10) is too small

B. The input feature size (3) does not match BatchNorm1d's expected size (5)

C. BatchNorm1d cannot process random tensors

D. BatchNorm1d requires input to be 3D tensor

Solution

Step 1: Check BatchNorm1d expected feature size
BatchNorm1d(5) expects input with 5 features per sample.
Step 2: Compare input tensor shape
Input tensor shape is (10, 3), meaning 3 features per sample, which mismatches 5.
Final Answer:
The input feature size (3) does not match BatchNorm1d's expected size (5) -> Option B
Quick Check:
Feature size mismatch causes runtime error [OK]

Hint: BatchNorm feature size must match input feature dimension [OK]

Common Mistakes:

Thinking batch size causes error
Believing BatchNorm needs 3D input always
Assuming random tensors cause errors

5. You want to apply batch normalization to a convolutional layer output with shape (batch_size, 16, 32, 32). Which PyTorch batch normalization layer should you use and why?

hard

A. nn.BatchNorm1d(16), because it normalizes over 1D features

B. nn.BatchNorm(16), because it works for any input shape

C. nn.BatchNorm3d(16), because the input has 4 dimensions

D. nn.BatchNorm2d(16), because it normalizes over 2D feature maps with 16 channels

Solution

Step 1: Analyze input tensor shape
The tensor shape is (batch_size, channels=16, height=32, width=32), typical for images.
Step 2: Choose correct BatchNorm type
For 4D tensors with channels and 2D spatial dimensions, nn.BatchNorm2d is appropriate.
Final Answer:
nn.BatchNorm2d(16), because it normalizes over 2D feature maps with 16 channels -> Option D
Quick Check:
Conv output uses BatchNorm2d with channel count [OK]

Hint: Use BatchNorm2d for conv layers with 2D spatial data [OK]

Common Mistakes:

Using BatchNorm1d for image tensors
Choosing BatchNorm3d incorrectly
Assuming generic BatchNorm works for all shapes

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Initial training with high loss and low accuracy
2	0.9	0.60	Loss decreased, accuracy improved after batch normalization
3	0.7	0.72	Model continues to learn with stable normalization
4	0.5	0.80	Batch normalization helps maintain steady training progress
5	0.4	0.85	Training converges with lower loss and higher accuracy

Batch normalization (nn.BatchNorm) in PyTorch - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand batch normalization role

Step 2: Identify the effect on learning

Final Answer:

Quick Check:

Solution

Step 1: Recall PyTorch BatchNorm classes

Step 2: Match correct syntax

Final Answer:

Quick Check:

Solution

Step 1: Check input tensor shape

Step 2: Understand BatchNorm1d output shape

Final Answer:

Quick Check:

Solution

Step 1: Check BatchNorm1d expected feature size

Step 2: Compare input tensor shape

Final Answer:

Quick Check:

Solution

Step 1: Analyze input tensor shape

Step 2: Choose correct BatchNorm type

Final Answer:

Quick Check: