Experiment - ONNX Runtime
Problem:You have a computer vision model trained in PyTorch that performs image classification. The model runs well but is slow during inference. You want to speed up the model inference using ONNX Runtime.
Current Metrics:Inference time per image: 120 ms, Accuracy on validation set: 85%
Issue:The model inference is too slow for real-time applications, although accuracy is good.