When using built-in datasets like those in torchvision.datasets, the key metrics depend on the task you train your model for. For example, if you train a classifier on CIFAR-10, accuracy is a simple and clear metric to see how well your model predicts the correct class.
However, if the dataset is imbalanced (some classes appear more than others), accuracy alone can be misleading. In that case, metrics like precision, recall, and F1 score become important to understand how well the model performs on each class.
In summary, the dataset provides the data, but the metric you choose depends on your model's goal and the dataset's balance.