Experiment - Evaluation metrics (accuracy, F1, confusion matrix)
Problem:You have trained a text classification model to identify positive and negative movie reviews. The model's training accuracy is 90%, but you want to understand how well it performs on unseen data using evaluation metrics.
Current Metrics:Training accuracy: 90%, Validation accuracy: 85%
Issue:Accuracy alone does not give a full picture of model performance, especially if classes are imbalanced. You need to compute F1 score and confusion matrix to better evaluate the model.