Challenge - 5 Problems

🎖️

Object Localization Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

Why do object detection models output bounding boxes?

Object detection models not only identify what objects are in an image but also where they are. Why do these models output bounding boxes around objects?

ATo increase the number of objects detected by splitting them into smaller parts.

BTo blur the background and highlight the object visually.

CTo convert the image into a grid of pixels for classification.

DTo specify the exact location and size of each detected object within the image.

Attempts:

2 left

❓ Model Choice

intermediate

2:00remaining

Which model architecture is designed to localize objects in images?

Among the following model types, which one is specifically designed to both detect and localize objects in images?

AYou Only Look Once (YOLO) for object detection

BRecurrent Neural Network (RNN) for sequence prediction

CConvolutional Neural Network (CNN) for image classification

DAutoencoder for image compression

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

What metric evaluates how well an object detection model localizes objects?

Which metric measures the accuracy of predicted bounding boxes compared to the true object locations?

AAccuracy score

BIntersection over Union (IoU)

CMean Squared Error (MSE)

DF1 Score

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Why does this object detection model fail to localize objects correctly?

Consider a detection model that predicts bounding boxes but often places them far from the actual objects. Which issue below most likely causes this localization failure?

AThe model was trained with incorrect bounding box labels.

BThe model uses too many convolutional layers.

CThe input images are too large.

DThe model uses dropout layers during inference.

Attempts:

2 left

❓ Predict Output

expert

3:00remaining

What is the output of this bounding box prediction code snippet?

Given the following Python code that simulates bounding box predictions, what is the printed output?

Computer Vision

import numpy as np

def predict_boxes(image_shape, predictions):
    height, width = image_shape
    boxes = []
    for (x_center, y_center, w, h) in predictions:
        x_min = int(x_center * width - w * width / 2)
        y_min = int(y_center * height - h * height / 2)
        x_max = int(x_center * width + w * width / 2)
        y_max = int(y_center * height + h * height / 2)
        boxes.append((x_min, y_min, x_max, y_max))
    return boxes

image_shape = (100, 200)
predictions = [(0.5, 0.5, 0.2, 0.4), (0.1, 0.1, 0.1, 0.1)]
print(predict_boxes(image_shape, predictions))

A[(80, 30, 120, 70), (0, 5, 20, 15)]

B[(90, 20, 110, 80), (0, 0, 20, 20)]

C[(80, 30, 120, 70), (10, 5, 30, 15)]

D[(80, 20, 120, 80), (5, 0, 15, 10)]

Attempts:

2 left