For Named Entity Recognition (NER), the key metrics are Precision, Recall, and F1-score. These metrics tell us how well the model finds and labels entities correctly.
Precision shows how many of the entities the model found are actually correct. This matters because we want to avoid wrong information.
Recall shows how many of the real entities the model managed to find. This matters because missing important information can reduce usefulness.
F1-score balances precision and recall, giving a single number to understand overall quality.