Challenge - 5 Problems

🎖️

Object Localization Mastery

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

1:30remaining

Why do object detection models predict bounding boxes?

Object detection models output bounding boxes around objects. Why is this localization important?

AIt converts images into grayscale for easier processing.

BIt reduces the number of objects detected by ignoring small ones.

CIt increases the image resolution automatically.

DIt helps identify the exact position and size of objects in the image.

Attempts:

2 left

❓ Model Choice

intermediate

1:30remaining

Which model architecture is designed to both classify and localize objects?

Choose the model architecture that outputs both object classes and bounding box coordinates.

AYOLO (You Only Look Once)

BAutoencoder

CGAN (Generative Adversarial Network)

DRNN (Recurrent Neural Network)

Attempts:

2 left

❓ Predict Output

advanced

2:00remaining

What is the output shape of bounding box predictions in a detection model?

Given a batch of 4 images, each with 3 predicted bounding boxes, and each box represented by 4 coordinates, what is the shape of the output tensor?

PyTorch

import torch
batch_size = 4
num_boxes = 3
coords_per_box = 4
output = torch.randn(batch_size, num_boxes, coords_per_box)
print(output.shape)

Atorch.Size([3, 4, 4])

Btorch.Size([4, 3, 4])

Ctorch.Size([4, 4, 3])

Dtorch.Size([4, 3])

Attempts:

2 left

❓ Metrics

advanced

1:30remaining

Which metric measures how well predicted bounding boxes match ground truth boxes?

In object detection, which metric quantifies the overlap between predicted and true bounding boxes?

AIntersection over Union (IoU)

BPrecision

CAccuracy

DMean Squared Error (MSE)

Attempts:

2 left

🔧 Debug

expert

2:30remaining

Why does a detection model fail to localize objects correctly?

Given this PyTorch snippet, why might the model fail to localize objects properly?

import torch
pred_boxes = torch.tensor([[0.1, 0.2, 0.3, 0.4],
                           [0.5, 0.6, 0.7, 0.8]])
gt_boxes = torch.tensor([[0.15, 0.25, 0.35, 0.45],
                         [0.55, 0.65, 0.75, 0.85]])

loss = torch.nn.functional.mse_loss(pred_boxes, gt_boxes)
print(loss.item())

What is a likely reason the model's localization is poor despite low loss?

AThe tensors have different shapes causing a runtime error.

BThe model output is not normalized between 0 and 1.

CUsing MSE loss on raw coordinates ignores the spatial relationship and scale of boxes.

DThe loss function should be cross-entropy for bounding boxes.

Attempts:

2 left