When we load a saved model and use it to make predictions (inference), the key metric is accuracy or the metric used during training. This tells us if the model still works well after loading.
We also care about inference speed because in real life, predictions should be fast enough to be useful.
So, the main metrics are prediction correctness (like accuracy, precision, recall) and inference time.