Challenge - 5 Problems

🎖️

YOLO Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

What is the main advantage of YOLO compared to traditional object detection methods?

YOLO (You Only Look Once) is a popular object detection model. What is its main advantage compared to older methods like sliding window or region proposal based detectors?

AIt processes the entire image in a single pass, making detection faster.

BIt uses multiple passes over the image to improve accuracy.

CIt requires manual feature engineering for each object class.

DIt detects objects by scanning small patches independently.

Attempts:

2 left

❓ Predict Output

intermediate

2:00remaining

What is the output shape of YOLO's final layer for a 7x7 grid with 2 boxes per cell and 20 classes?

YOLO divides the image into an SxS grid. Each grid cell predicts B bounding boxes and C class probabilities. Given S=7, B=2, and C=20, what is the shape of the output tensor from the final layer?

A[7, 7, 30]

B[7, 7, 14]

C[7, 7, 24]

D[7, 7, 28]

Attempts:

2 left

❓ Model Choice

advanced

2:00remaining

Which PyTorch layer is best suited to implement YOLO's final prediction layer?

YOLO's final layer outputs a tensor with shape [batch_size, S, S, B*5 + C]. Which PyTorch layer is most appropriate to produce this output from a feature map?

Ann.Conv2d with kernel size 3 and output channels = B*5 + C

Bnn.Conv2d with kernel size 1 and output channels = B*5 + C

Cnn.Linear with output features = S*S*(B*5 + C)

Dnn.MaxPool2d with kernel size 2

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

Which metric best evaluates YOLO's object detection performance?

YOLO predicts bounding boxes and class labels. Which metric is most appropriate to evaluate its detection accuracy?

AAccuracy

BMean Squared Error (MSE)

CMean Average Precision (mAP)

DCross-Entropy Loss

Attempts:

2 left

🔧 Debug

expert

2:00remaining

Why does YOLO sometimes miss small objects in images?

YOLO divides the image into a grid and predicts boxes per cell. Why might it struggle to detect very small objects?

AYOLO only predicts one bounding box per image.

BYOLO uses too many grid cells, causing overfitting on small objects.

CYOLO's loss function ignores small objects during training.

DSmall objects may fall into one grid cell and be overshadowed by larger objects, causing missed detections.

Attempts:

2 left