When using transfer learning on small datasets, accuracy is often used to see how well the model predicts. But accuracy alone can be misleading if classes are unbalanced.
We also focus on precision and recall because small datasets can cause the model to miss important cases (low recall) or make many wrong positive predictions (low precision).
F1 score helps balance precision and recall, giving a better overall picture.