For text detection in images, the main goal is to find all text areas correctly without too many false alarms. So, Recall is very important because it tells us how many real text parts the model found out of all the text parts that exist. Missing text means bad results.
Precision is also important because it shows how many detected text parts are actually text. Too many false detections confuse users.
The F1 score balances precision and recall, giving a single number to check overall quality.
Sometimes, Intersection over Union (IoU) is used to measure how well the detected boxes match the real text boxes.