Challenge - 5 Problems

🎖️

YOLO Mastery Badge

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

1:30remaining

What is the main purpose of the YOLO architecture?

YOLO is a popular model in computer vision. What is its main goal?

ATo detect objects in images quickly and accurately by predicting bounding boxes and class probabilities in one pass

BTo classify images into categories without locating objects

CTo segment images into pixel-level classes for detailed object outlines

DTo generate new images from text descriptions using deep learning

Attempts:

2 left

❓ Model Choice

intermediate

1:30remaining

Which component is NOT part of the YOLO architecture?

YOLO architecture includes several key parts. Which one below is NOT part of it?

AA single convolutional neural network that predicts bounding boxes and class probabilities

BGrid division of the input image to localize objects

CA region proposal network that generates candidate object regions before classification

DAnchor boxes to predict bounding box shapes and sizes

Attempts:

2 left

❓ Predict Output

advanced

2:00remaining

What is the output shape of YOLO for a 7x7 grid with 2 boxes per cell and 20 classes?

YOLO divides the image into a 7x7 grid. Each grid cell predicts 2 bounding boxes and class probabilities for 20 classes. What is the shape of the output tensor?

Computer Vision

grid_size = 7
boxes_per_cell = 2
num_classes = 20
output_shape = (grid_size, grid_size, boxes_per_cell * 5 + num_classes)
print(output_shape)

A(7, 7, 50)

B(7, 7, 25)

C(7, 7, 40)

D(7, 7, 30)

Attempts:

2 left

❓ Metrics

advanced

1:30remaining

Which metric is commonly used to evaluate YOLO's object detection performance?

YOLO detects objects with bounding boxes and class labels. Which metric best measures its detection quality?

ACross-Entropy Loss

BMean Average Precision (mAP)

CMean Squared Error (MSE)

DAccuracy

Attempts:

2 left

🔧 Debug

expert

2:00remaining

What error will this YOLO output processing code raise?

Consider this Python code snippet that processes YOLO output tensor. What error will it raise?

Computer Vision

import numpy as np
output = np.zeros((7,7,30))
for i in range(7):
    for j in range(7):
        boxes = output[i,j,:10].reshape(2,5)
        scores = output[i,j,10:30]
        max_score_index = np.argmax(scores)
        print(f"Max score index: {max_score_index}")

ANo error, code runs correctly

BIndexError due to slicing beyond array bounds

CValueError due to incorrect reshape dimensions

DTypeError due to incompatible data types

Attempts:

2 left