Computer Visionml~20 mins

Python CV ecosystem (OpenCV, PIL, torchvision) in Computer Vision - Practice Problems & Coding Challenges

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Challenge - 5 Problems

🎖️

Python CV Ecosystem Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ Predict Output

intermediate

2:00remaining

What is the output of this OpenCV image shape code?

Given an image loaded with OpenCV, what will be the output of the shape attribute?

Computer Vision

import cv2
img = cv2.imread('sample.jpg')
print(img.shape)

A(height, width, channels) tuple representing image dimensions

B(width, height, channels) tuple representing image dimensions

CA single integer representing total pixels

DRaises an error because shape attribute does not exist

Attempts:

2 left

❓ Model Choice

intermediate

2:00remaining

Which torchvision model is best for image classification on ImageNet?

You want to use a pretrained model from torchvision for classifying images into 1000 classes. Which model is designed specifically for this?

Atorchvision.models.segmentation.fcn_resnet50(pretrained=True)

Btorchvision.models.resnet50(pretrained=True)

Ctorchvision.models.detection.fasterrcnn_resnet50_fpn(pretrained=True)

Dtorchvision.models.video.r3d_18(pretrained=True)

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

How to correctly compute accuracy for a torchvision classification model?

You have model outputs as logits and true labels as integers. Which code snippet correctly computes accuracy?

preds = outputs.argmax(dim=1)
accuracy = (preds == labels).sum().item()

preds = outputs.max(dim=0)
accuracy = (preds == labels).float().mean().item()

preds = outputs.argmax(dim=1)
accuracy = (preds == labels).float().mean().item()

preds = outputs.argmax(dim=1)
accuracy = (preds == labels).float().sum().item() / len(labels)

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Why does this PIL image conversion code raise an error?

Code snippet: from PIL import Image img = Image.open('photo.png') img = img.convert('HSV')

Computer Vision

from PIL import Image
img = Image.open('photo.png')
img = img.convert('HSV')

ANo error, image converts successfully

BFileNotFoundError because 'photo.png' does not exist

CValueError because 'HSV' is not a supported mode in PIL

DTypeError because convert expects an integer

Attempts:

2 left

🧠 Conceptual

expert

2:00remaining

What is the main difference between OpenCV and PIL in image processing?

Choose the statement that best describes a key difference between OpenCV and PIL libraries.

APIL is designed for real-time computer vision, OpenCV is not

BPIL supports video processing, OpenCV does not

COpenCV cannot read PNG images, PIL can

DOpenCV uses BGR color order by default, PIL uses RGB

Attempts:

2 left

Practice

(1/5)

1. Which Python library is best known for fast image and video processing tasks?

easy

A. PIL (Pillow)

B. OpenCV

C. torchvision

D. matplotlib

Python CV ecosystem (OpenCV, PIL, torchvision) in Computer Vision - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand library purposes

Step 2: Compare with other libraries

Final Answer:

Quick Check:

Solution

Step 1: Identify OpenCV image reading syntax

Step 2: Differentiate from other libraries

Final Answer:

Quick Check:

Solution

Step 1: Understand OpenCV image shape

Step 2: Know OpenCV color format

Final Answer:

Quick Check:

Solution

Step 1: Identify PIL image mode issue

Step 2: Fix by converting to RGB mode

Final Answer:

Quick Check:

Solution

Step 1: Understand torchvision transform pipeline

Step 2: Normalize tensor with mean and std

Final Answer:

Quick Check: