Collaborative filtering predicts user preferences, so we want metrics that show how close predictions are to actual user choices.
Common metrics include:
- Root Mean Squared Error (RMSE): Measures average prediction error size. Lower is better.
- Mean Absolute Error (MAE): Average absolute difference between predicted and actual ratings.
- Precision and Recall: For top-N recommendations, precision shows how many recommended items were liked, recall shows how many liked items were recommended.
- F1 Score: Balances precision and recall for recommendation relevance.
- Mean Average Precision (MAP): Measures ranking quality of recommended items.
We choose metrics based on the goal: rating prediction accuracy or recommendation relevance.