Challenge - 5 Problems

🎖️

Parallel Data Loading Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

Effect of num_workers on DataLoader speed

In PyTorch, setting num_workers in DataLoader controls how many subprocesses load data in parallel. What is the main effect of increasing num_workers from 0 to a positive number?

AIt decreases the batch size automatically to fit more data in memory.

BIt allows data loading to happen in parallel, potentially speeding up training by reducing data wait time.

CIt disables shuffling of data during training.

DIt changes the model architecture to use multiple GPUs.

Attempts:

2 left

❓ Predict Output

intermediate

2:00remaining

Output of DataLoader with num_workers=0

Consider this PyTorch code snippet:

from torch.utils.data import DataLoader, TensorDataset
import torch

data = torch.arange(10)
dataset = TensorDataset(data)
loader = DataLoader(dataset, batch_size=3, num_workers=0)

batches = [batch[0].tolist() for batch in loader]
print(batches)

What is the output?

ARaises RuntimeError due to num_workers=0

B[[0, 1, 2, 3], [4, 5, 6, 7], [8, 9]]

C[[0, 1], [2, 3], [4, 5], [6, 7], [8, 9]]

D[[0, 1, 2], [3, 4, 5], [6, 7, 8], [9]]

Attempts:

2 left

❓ Hyperparameter

advanced

2:00remaining

Choosing num_workers for best performance

You want to speed up training by setting num_workers in your PyTorch DataLoader. Which of these is the best advice?

ASet <code>num_workers</code> to the number of CPU cores available for maximum parallelism.

BAlways set <code>num_workers</code> to 0 to avoid any multiprocessing overhead.

CSet <code>num_workers</code> to a very high number like 100 to maximize data loading speed.

DSet <code>num_workers</code> to 1 regardless of CPU cores to keep things simple.

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Error caused by num_workers > 0 on Windows

On Windows, setting num_workers > 0 in DataLoader sometimes causes this error:

RuntimeError: DataLoader worker (pid(s) ...) exited unexpectedly

What is the most common cause?

AThe dataset or transform uses lambda functions or local functions that cannot be pickled.

BThe batch size is too large for the GPU memory.

CThe learning rate is set too high causing instability.

DThe model architecture is incompatible with multiprocessing.

Attempts:

2 left

❓ Metrics

expert

2:00remaining

Impact of num_workers on training throughput

You run two training sessions with identical models and data. Session 1 uses num_workers=0, Session 2 uses num_workers=4. You measure average training throughput (samples/sec) as:

Session 1: 120 samples/sec
Session 2: 180 samples/sec

What is the most accurate explanation for this difference?

AUsing 4 workers disables data shuffling, making training faster.

BUsing 4 workers increases GPU computation speed directly, improving throughput.

CUsing 4 workers allows parallel data loading, reducing data wait time and increasing throughput by 50%.

DUsing 4 workers reduces batch size, causing faster iterations.

Attempts:

2 left