beginner

What does YOLO stand for in computer vision?

YOLO stands for You Only Look Once. It means the model looks at the image just once to detect objects quickly.

Click to reveal answer

beginner

How does YOLO divide an image for object detection?

YOLO splits the image into a grid of cells. Each cell predicts bounding boxes and class probabilities for objects inside it.

Click to reveal answer

intermediate

What are the main outputs of the YOLO model for each grid cell?

Each grid cell outputs bounding box coordinates, confidence scores (how sure it is about the object), and class probabilities (what object it is).

Click to reveal answer

intermediate

Why is YOLO considered fast compared to other object detectors?

YOLO processes the whole image in one pass using a single neural network, unlike others that scan multiple regions separately. This makes it very fast.

Click to reveal answer

advanced

What is a common challenge YOLO faces with small objects?

YOLO can struggle to detect small objects because each grid cell covers a large area, so small objects might be missed or merged with others.

Click to reveal answer

What is the main idea behind YOLO's approach to object detection?

AUse separate models for each object type

BDetect objects by looking at the image once

CScan the image multiple times for each object

DDetect objects only in the center of the image

How does YOLO predict objects in an image?

ABy dividing the image into a grid and predicting boxes per cell

BBy scanning the image pixel by pixel

CBy using a sliding window over the image

DBy cropping the image into small patches

Which output is NOT part of YOLO's prediction for each grid cell?

AImage resolution

BConfidence score

CClass probabilities

DBounding box coordinates

Why is YOLO faster than many other object detection methods?

AIt uses multiple passes over the image

BIt only detects one object per image

CIt ignores small objects

DIt uses a single neural network to process the whole image once

What is a limitation of YOLO when detecting small objects?

AIt detects small objects too many times

BIt merges small objects into larger ones

CIt struggles because grid cells cover large areas

DIt only detects small objects

Explain how YOLO detects objects in an image and why it is considered fast.

Describe the main outputs YOLO produces for each grid cell and their purpose.