Practice

(1/5)

1. Which Python library is best known for fast image and video processing tasks?

easy

A. PIL (Pillow)

B. OpenCV

C. torchvision

D. matplotlib

Solution

Step 1: Understand library purposes
OpenCV is designed for fast image and video processing, widely used in computer vision.
Step 2: Compare with other libraries
PIL is mainly for image editing, torchvision is for ML image datasets, matplotlib is for plotting.
Final Answer:
OpenCV -> Option B
Quick Check:
Fast image/video processing = OpenCV [OK]

Hint: OpenCV = fast image/video tasks, PIL = editing, torchvision = ML prep [OK]

Common Mistakes:

Confusing PIL as the fastest for video processing
Thinking torchvision handles video processing
Assuming matplotlib is for image processing

2. Which of the following is the correct way to read an image using OpenCV in Python?

easy

A. img = cv2.imread('image.jpg')

B. img = Image.open('image.jpg')

C. img = torchvision.io.read_image('image.jpg')

D. img = plt.imread('image.jpg')

Solution

Step 1: Identify OpenCV image reading syntax
OpenCV uses cv2.imread() to load images from files.
Step 2: Differentiate from other libraries
PIL uses Image.open(), torchvision uses torchvision.io.read_image(), matplotlib uses plt.imread().
Final Answer:
img = cv2.imread('image.jpg') -> Option A
Quick Check:
OpenCV image read = cv2.imread() [OK]

Hint: OpenCV reads images with cv2.imread() [OK]

Common Mistakes:

Using Image.open() which is from PIL, not OpenCV
Using plt.imread() which is for plotting, not OpenCV
Confusing torchvision's read_image with OpenCV

3. What will be the shape and color format of the image loaded by this OpenCV code?

import cv2
img = cv2.imread('image.jpg')
print(img.shape)

medium

A. (height, width, 3) with RGB color order

B. (width, height, 3) with BGR color order

C. (width, height, 3) with RGB color order

D. (height, width, 3) with BGR color order

Solution

Step 1: Understand OpenCV image shape
OpenCV loads images as NumPy arrays with shape (height, width, channels).
Step 2: Know OpenCV color format
OpenCV uses BGR color order by default, not RGB.
Final Answer:
(height, width, 3) with BGR color order -> Option D
Quick Check:
OpenCV shape = (H, W, 3), color = BGR [OK]

Hint: OpenCV images: shape (H,W,3), color BGR [OK]

Common Mistakes:

Assuming RGB color order instead of BGR
Swapping width and height in shape
Thinking OpenCV loads grayscale by default

4. This code tries to convert a PIL image to a NumPy array for OpenCV processing but causes an error:

from PIL import Image
import numpy as np
img_pil = Image.open('image.jpg')
img_cv = np.array(img_pil)

What is the likely cause and fix?

medium

A. PIL image must be converted to grayscale first

B. Color channels are in wrong order; convert RGB to BGR after np.array()

C. Use img_pil.convert('RGB') before np.array() to ensure 3 channels

D. No error; code works fine as is

Solution

Step 1: Identify PIL image mode issue
PIL images may not be in RGB mode by default; could be 'P' or 'L' mode causing np.array to have unexpected shape.
Step 2: Fix by converting to RGB mode
Use img_pil.convert('RGB') to ensure 3 color channels before converting to NumPy array.
Final Answer:
Use img_pil.convert('RGB') before np.array() to ensure 3 channels -> Option C
Quick Check:
PIL to NumPy needs RGB mode [OK]

Hint: Convert PIL image to RGB before np.array() [OK]

Common Mistakes:

Assuming np.array always works without convert()
Ignoring color channel order differences
Trying to convert to grayscale unnecessarily

5. You want to prepare an image for a PyTorch model using torchvision transforms. Which sequence correctly converts a PIL image to a tensor normalized for pretrained models?

hard

A. Use torchvision.transforms.ToTensor() then torchvision.transforms.Normalize(mean, std)

B. Use cv2.imread() then convert to tensor manually

C. Use PIL.Image.open() then convert to NumPy array and normalize manually

D. Use torchvision.transforms.Normalize() only on PIL image

Solution

Step 1: Understand torchvision transform pipeline
To prepare images for PyTorch models, convert PIL image to tensor with ToTensor(), which scales pixels to [0,1].
Step 2: Normalize tensor with mean and std
Use Normalize() with pretrained model's mean and std to standardize input.
Final Answer:
Use torchvision.transforms.ToTensor() then torchvision.transforms.Normalize(mean, std) -> Option A
Quick Check:
ToTensor + Normalize = correct PyTorch prep [OK]

Hint: Use ToTensor() then Normalize() for PyTorch image prep [OK]

Common Mistakes:

Trying to normalize PIL images directly
Using OpenCV images without conversion
Skipping normalization step

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Model starts learning with moderate loss and low accuracy
2	0.9	0.60	Loss decreases and accuracy improves as model learns features
3	0.7	0.72	Model continues to improve with better predictions
4	0.5	0.80	Loss drops further and accuracy reaches good level
5	0.4	0.85	Training converges with low loss and high accuracy

Python CV ecosystem (OpenCV, PIL, torchvision) in Computer Vision - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand library purposes

Step 2: Compare with other libraries

Final Answer:

Quick Check:

Solution

Step 1: Identify OpenCV image reading syntax

Step 2: Differentiate from other libraries

Final Answer:

Quick Check:

Solution

Step 1: Understand OpenCV image shape

Step 2: Know OpenCV color format

Final Answer:

Quick Check:

Solution

Step 1: Identify PIL image mode issue

Step 2: Fix by converting to RGB mode

Final Answer:

Quick Check:

Solution

Step 1: Understand torchvision transform pipeline

Step 2: Normalize tensor with mean and std

Final Answer:

Quick Check: