For image classification using CNNs, accuracy is often the first metric to check. It tells us how many images were correctly labeled out of all images.
However, if classes are uneven (some classes have many images, others few), accuracy can be misleading. Then, precision, recall, and F1 score per class become important to understand how well the model detects each category.
For example, if the model misses many images of a rare class, recall for that class will be low, even if overall accuracy is high.