In deep neural network (DNN) based face detection, what is the primary purpose of anchor boxes?
Think about how the model predicts where faces might be in an image.
Anchor boxes are predefined boxes of various sizes and shapes that help the model predict bounding boxes for faces at different scales and aspect ratios.
Consider a DNN face detector that outputs a tensor of shape (batch_size, 8732, 7). What does the number 7 represent in this output?
Recall that face detection usually involves bounding box coordinates and classification between face and background.
The 7 values typically include 4 for bounding box coordinates, 2 for class probabilities (face vs background), and 1 objectness score indicating confidence.
In a DNN-based face detector, what is the effect of setting a very high confidence threshold for detected faces?
Think about what happens if the model only accepts very confident detections.
A high confidence threshold means the model only accepts detections it is very sure about, reducing false alarms but possibly missing some real faces.
Which statement best describes how Intersection over Union (IoU) is used to evaluate face detection models?
Think about how to check if a predicted box matches the actual face location.
IoU calculates the ratio of the overlapping area to the combined area of predicted and true bounding boxes, helping decide if a detection is accurate.
A DNN face detection model trained on RGB images shows very low accuracy when tested on new images. The new images are loaded as grayscale and then converted to 3 channels by repeating the grayscale channel. What is the most likely cause of the low accuracy?
Consider how color information affects feature extraction in CNNs.
The model was trained on RGB images with color features. Repeating grayscale channels creates fake RGB input without true color, leading to poor feature extraction and low accuracy.