SIFT (Scale-Invariant Feature Transform) features are widely used in computer vision. What is their main purpose?
Think about what SIFT helps with when matching parts of images taken from different distances or angles.
SIFT features detect key points and describe them so they can be matched even if the image is scaled or rotated. This helps in tasks like object recognition and image stitching.
What will be the output of the following Python code using OpenCV's SIFT detector?
import cv2 import numpy as np img = np.zeros((100, 100), dtype=np.uint8) sift = cv2.SIFT_create() keypoints = sift.detect(img, None) print(len(keypoints))
Consider what happens when you detect features in a completely black image.
The image is completely black with no edges or corners, so no keypoints are detected. The output is 0.
You want to match features between images taken from different distances and angles. Which feature descriptor is best suited for this?
Think about which descriptor is designed to handle changes in scale and rotation.
SIFT is specifically designed to be invariant to scale and rotation, making it ideal for matching features under such changes. Haar Cascades are for object detection, HOG is for shape, and raw pixels are not invariant.
In SIFT, the number of octave layers controls the number of scale levels per octave. What is the effect of increasing this number?
Think about how more layers per octave affect the scale space and processing.
Increasing octave layers means more scale levels are analyzed, which can detect more keypoints but requires more computation.
You matched SIFT features between two images and want to evaluate the quality of matches. Which metric best measures the ratio of correct matches to total matches?
Consider the metric that tells you how many matches are actually correct out of all matches found.
Precision measures the proportion of true positive matches among all matches found, indicating match quality. Recall measures how many true matches were found out of all possible true matches. MSE and confusion matrix are not directly used here.