In text recognition, the main goal is to correctly read characters or words from images. The key metrics are Character Error Rate (CER) and Word Error Rate (WER). These measure how many characters or words the model got wrong compared to the true text. Lower CER and WER mean better recognition.
Accuracy is also used, but CER and WER give a clearer picture because they count insertions, deletions, and substitutions of characters or words, which are common errors in text recognition.