When using geometric transforms like rotate, flip, and crop in computer vision, the key metric to watch is model accuracy or performance on validation data. This is because these transforms change the input images to help the model learn better. We want to see if these changes improve the model's ability to recognize objects correctly.
Also, robustness is important. This means the model should still perform well even if images are rotated or flipped in real life. So, metrics like accuracy, precision, recall, or F1 score on transformed images tell us if the model learned well from these geometric changes.