In stemming, the main goal is to reduce words to their root form. The key metric is accuracy of normalization, which means how well the stemmer groups related words together without losing meaning.
We also look at precision and recall in the context of information retrieval or text classification tasks that use stemming. Precision measures how many of the stemmed words are correctly grouped, while recall measures how many related words are successfully captured by the stemmer.
Good stemming improves downstream tasks by reducing word variations, so metrics like F1 score on those tasks also matter.