beginner

What is the main purpose of torchvision detection models?

Torchvision detection models are used to find and locate objects within images. They can tell what objects are present and where they are by drawing boxes around them.

Click to reveal answer

beginner

Name two popular torchvision detection models.

Two popular torchvision detection models are Faster R-CNN and SSD (Single Shot MultiBox Detector). Both are used for detecting objects but differ in speed and accuracy.

Click to reveal answer

beginner

How do you load a pretrained Faster R-CNN model from torchvision?

You can load it using: <br><code>import torchvision.models.detection as detection<br>model = detection.fasterrcnn_resnet50_fpn(pretrained=True)</code><br>This loads a model trained on COCO dataset ready to detect common objects.

Click to reveal answer

intermediate

What kind of output do torchvision detection models produce?

They output a list of dictionaries, each dictionary contains:<br>- boxes: coordinates of detected objects<br>- labels: class IDs of objects<br>- scores: confidence scores for each detection

Click to reveal answer

intermediate

Why is it important to set the model to evaluation mode during inference?

Setting the model to evaluation mode (model.eval()) disables training behaviors like dropout and batch normalization updates, ensuring consistent and correct predictions.

Click to reveal answer

Which torchvision detection model is known for balancing speed and accuracy well?

AVGG16

BResNet-50

CAlexNet

DFaster R-CNN

What does the 'boxes' output from a detection model represent?

ACoordinates of detected objects

BClass names of objects

CImage pixel values

DModel weights

How do you prepare a torchvision detection model for inference?

ACall model.train()

BReset model weights

CCall model.eval()

DIncrease learning rate

Which dataset is commonly used to pretrain torchvision detection models?

ACOCO

BImageNet

CMNIST

DCIFAR-10

What does the 'scores' output from a detection model indicate?

ATraining loss

BConfidence level of each detected object

CSize of the image

DNumber of objects detected

Explain how to use a pretrained torchvision detection model to detect objects in a new image.

Describe the structure of the output from torchvision detection models and what each part means.