When using torchvision pre-trained models, the key metrics depend on the task. For image classification, accuracy is often used to measure how many images are correctly labeled. However, accuracy alone can be misleading if classes are unbalanced.
For more detailed insight, precision, recall, and F1 score help understand how well the model identifies each class, especially for rare classes. For example, in medical image classification, recall is critical to catch all positive cases.
In object detection or segmentation tasks, metrics like mean Average Precision (mAP) or Intersection over Union (IoU) are used, but for simple classification, accuracy, precision, recall, and F1 are the main metrics.