In object detection, we want to know not only if the object is found but also if it is correctly located. The key metric is Intersection over Union (IoU). IoU measures how much the predicted box overlaps with the true box. A higher IoU means better localization.
Besides IoU, Precision and Recall are important. Precision tells us how many detected boxes are correct, while Recall tells us how many true objects were found. The mean Average Precision (mAP) combines these by checking precision at different recall levels, considering localization quality.