When dealing with dataset bias in vision tasks, accuracy alone can be misleading. Instead, precision, recall, and F1 score are important to understand how well the model performs across different groups or classes. This helps reveal if the model favors some categories over others due to bias in the data.
Also, confusion matrices help visualize errors per class, showing if some classes are systematically misclassified because of bias.