Practice

(1/5)

1. What is the main purpose of feature extraction in computer vision?

easy

A. To increase the size of image files

B. To change image colors randomly

C. To convert images into numbers that describe important parts

D. To delete parts of the image

Solution

Step 1: Understand feature extraction goal
Feature extraction transforms images into numerical data representing key details.
Step 2: Compare options to this goal
Only To convert images into numbers that describe important parts describes this process correctly; others describe unrelated actions.
Final Answer:
To convert images into numbers that describe important parts -> Option C
Quick Check:
Feature extraction = convert images to numbers [OK]

Hint: Feature extraction means turning images into numbers [OK]

Common Mistakes:

Thinking feature extraction changes image colors
Confusing feature extraction with image resizing
Believing it deletes image parts

2. Which of the following is a correct way to describe SIFT in feature extraction?

easy

A. A way to convert images to grayscale

B. A method that detects and describes local features in images

C. A technique to increase image resolution

D. A method to compress image files

Solution

Step 1: Recall what SIFT does
SIFT finds and describes important local features in images for matching and recognition.
Step 2: Match options to SIFT's function
Only A method that detects and describes local features in images correctly describes SIFT; others describe unrelated image processes.
Final Answer:
A method that detects and describes local features in images -> Option B
Quick Check:
SIFT = local feature detection [OK]

Hint: SIFT finds key points and describes them [OK]

Common Mistakes:

Confusing SIFT with image resizing
Thinking SIFT changes image colors
Believing SIFT compresses images

3. Given the following Python code using OpenCV, what will be the shape of the feature vector extracted by SIFT for an image with 500 keypoints?

import cv2
img = cv2.imread('image.jpg', cv2.IMREAD_GRAYSCALE)
sift = cv2.SIFT_create()
keypoints, descriptors = sift.detectAndCompute(img, None)
print(descriptors.shape)

medium

A. (null, 128)

B. (128, 500)

C. (500, 64)

D. (500, 128)

Solution

Step 1: Understand SIFT descriptor shape
SIFT descriptors have 128 features per keypoint, so shape is (number_of_keypoints, 128).
Step 2: Apply to given keypoints
With 500 keypoints, descriptors shape is (500, 128).
Final Answer:
(500, 128) -> Option D
Quick Check:
SIFT descriptors shape = (keypoints, 128) [OK]

Hint: SIFT descriptors = keypoints x 128 features [OK]

Common Mistakes:

Swapping dimensions of descriptors
Assuming 64 features per keypoint
Thinking descriptors shape depends on image size

4. You wrote this code to extract features using SIFT but get an error:

import cv2
img = cv2.imread('image.jpg')
sift = cv2.SIFT_create()
keypoints, descriptors = sift.detectAndCompute(img, None)
print(len(keypoints))

What is the likely cause of the error?

medium

A. The image is not loaded in grayscale, causing SIFT to fail

B. SIFT_create() is not a valid OpenCV function

C. detectAndCompute requires a mask argument

D. print(len(keypoints)) is incorrect syntax

Solution

Step 1: Check image loading method
The image is loaded in color by default; SIFT expects grayscale images.
Step 2: Identify error cause
Not converting to grayscale can cause detectAndCompute to fail or return null.
Final Answer:
The image is not loaded in grayscale, causing SIFT to fail -> Option A
Quick Check:
Load image grayscale for SIFT [OK]

Hint: Always load images in grayscale for SIFT [OK]

Common Mistakes:

Thinking SIFT_create() is invalid
Believing mask argument is mandatory
Assuming print syntax is wrong

5. You want to extract features from images for a complex object recognition task. Which approach is best to capture detailed and high-level features?

hard

A. Use a deep learning model like a convolutional neural network (CNN)

B. Use simple edge detection filters only

C. Use random pixel values as features

D. Use image resizing without feature extraction

Solution

Step 1: Understand feature needs for complex tasks
Complex object recognition requires capturing detailed and abstract features.
Step 2: Compare methods for feature extraction
Deep learning models like CNNs learn rich features automatically, outperforming simple filters or random values.
Final Answer:
Use a deep learning model like a convolutional neural network (CNN) -> Option A
Quick Check:
Complex features need CNNs [OK]

Hint: Deep models capture complex features best [OK]

Common Mistakes:

Relying only on simple filters
Using random pixels as features
Skipping feature extraction by resizing only

Epoch	Loss ↓	Accuracy ↑	Observation
1	1.2	0.45	Model starts with high loss and low accuracy
2	0.9	0.60	Loss decreases, accuracy improves as model learns features
3	0.7	0.72	Model captures important patterns, accuracy rises
4	0.5	0.82	Loss continues to drop, model gets better
5	0.4	0.88	Training converges with good accuracy

Feature extraction approach in Computer Vision - Model Pipeline Trace

Start learning this pattern below

Practice

Solution

Step 1: Understand feature extraction goal

Step 2: Compare options to this goal

Final Answer:

Quick Check:

Solution

Step 1: Recall what SIFT does

Step 2: Match options to SIFT's function

Final Answer:

Quick Check:

Solution

Step 1: Understand SIFT descriptor shape

Step 2: Apply to given keypoints

Final Answer:

Quick Check:

Solution

Step 1: Check image loading method

Step 2: Identify error cause

Final Answer:

Quick Check:

Solution

Step 1: Understand feature needs for complex tasks

Step 2: Compare methods for feature extraction

Final Answer:

Quick Check: