beginner

What is a bounding box in object detection?

A bounding box is a rectangle that tightly encloses an object in an image, defined by coordinates usually as (x_min, y_min, x_max, y_max). It helps locate and identify objects.

Click to reveal answer

beginner

How do you convert bounding boxes from (x, y, width, height) format to (x_min, y_min, x_max, y_max)?

You keep x_min = x, y_min = y, then calculate x_max = x + width - 1 and y_max = y + height - 1.

Click to reveal answer

intermediate

What is Intersection over Union (IoU) in bounding box handling?

IoU measures how much two bounding boxes overlap. It is the area of overlap divided by the area of union of the two boxes. It helps evaluate detection accuracy.

Click to reveal answer

intermediate

Why is Non-Maximum Suppression (NMS) used in bounding box handling?

NMS removes duplicate bounding boxes that detect the same object by keeping only the box with the highest confidence score and discarding others with high overlap.

Click to reveal answer

beginner

How can bounding boxes be normalized for neural network input?

Bounding box coordinates can be divided by the image width and height to scale them between 0 and 1, making them independent of image size.

Click to reveal answer

What does the bounding box format (x_min, y_min, x_max, y_max) represent?

ACenter coordinates only

BCenter coordinates and width/height

CTop-left and bottom-right corners of the box

DWidth and height only

What is the main purpose of Non-Maximum Suppression (NMS)?

ATo remove overlapping boxes for the same object

BTo increase the number of bounding boxes

CTo resize images

DTo normalize bounding box coordinates

How is Intersection over Union (IoU) calculated?

ADifference between box widths

BArea of overlap divided by area of union

CSum of areas of two boxes

DDistance between box centers

Why normalize bounding box coordinates before feeding to a model?

ATo convert coordinates to pixels

BTo remove bounding boxes

CTo increase coordinate values

DTo make coordinates relative to image size

Which PyTorch tensor shape is typical for a batch of bounding boxes?

A[batch_size, num_boxes, 4]

B[4, num_boxes]

C[num_boxes, batch_size]

D[batch_size, 4]

Explain how Non-Maximum Suppression (NMS) works and why it is important in bounding box handling.

Describe the process to calculate Intersection over Union (IoU) between two bounding boxes.