In fine-grained sentiment analysis with 5 classes (e.g., very negative, negative, neutral, positive, very positive), accuracy is a common metric because it shows how often the model predicts the exact sentiment correctly.
However, accuracy alone can hide problems if some classes are rare. So, we also use macro-averaged precision, recall, and F1-score. These treat each class equally, helping us see if the model struggles with any specific sentiment.
For example, if the model often misses "very negative" reviews, recall for that class will be low, signaling a problem.