Computer Visionml~20 mins

ONNX Runtime in Computer Vision - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - ONNX Runtime

Problem:You have a computer vision model trained in PyTorch that performs image classification. The model runs well but is slow during inference. You want to speed up the model inference using ONNX Runtime.

Current Metrics:Inference time per image: 120 ms, Accuracy on validation set: 85%

Issue:The model inference is too slow for real-time applications, although accuracy is good.

Your Task

Reduce the inference time per image to under 50 ms while maintaining accuracy above 83%.

You must use ONNX Runtime for inference.

Do not retrain or change the model architecture.

Use the existing trained PyTorch model.

Hint 1

Hint 2

Hint 3

Solution

Computer Vision

import torch
import torchvision.models as models
import torchvision.transforms as transforms
from PIL import Image
import onnx
import onnxruntime as ort
import time

# Load pretrained PyTorch model
model = models.resnet18(pretrained=True)
model.eval()

# Sample image preprocessing
preprocess = transforms.Compose([
    transforms.Resize(256),
    transforms.CenterCrop(224),
    transforms.ToTensor(),
    transforms.Normalize(mean=[0.485, 0.456, 0.406],
                         std=[0.229, 0.224, 0.225])
])

# Load and preprocess image
img = Image.new('RGB', (224, 224), color='red')  # Dummy image
input_tensor = preprocess(img)
input_batch = input_tensor.unsqueeze(0)  # Create batch dimension

# Measure PyTorch inference time
with torch.no_grad():
    start = time.time()
    output = model(input_batch)
    end = time.time()
pytorch_inference_time = (end - start) * 1000  # ms

# Export to ONNX
onnx_model_path = 'resnet18.onnx'
torch.onnx.export(model, input_batch, onnx_model_path, opset_version=12,
                  input_names=['input'], output_names=['output'],
                  dynamic_axes={'input': {0: 'batch_size'}, 'output': {0: 'batch_size'}})

# Load ONNX model and create inference session
ort_session = ort.InferenceSession(onnx_model_path)

# Prepare input for ONNX Runtime
ort_inputs = {ort_session.get_inputs()[0].name: input_batch.numpy()}

# Measure ONNX Runtime inference time
start = time.time()
ort_outs = ort_session.run(None, ort_inputs)
end = time.time()
onnx_inference_time = (end - start) * 1000  # ms

# Check accuracy similarity (dummy check since no labels)
# Just compare top predicted class from PyTorch and ONNX
pytorch_pred = torch.argmax(output, dim=1).item()
onnx_pred = int(ort_outs[0].argmax(axis=1)[0])
accuracy_match = pytorch_pred == onnx_pred

print(f'PyTorch inference time: {pytorch_inference_time:.2f} ms')
print(f'ONNX Runtime inference time: {onnx_inference_time:.2f} ms')
print(f'Predictions match: {accuracy_match}')

Exported the PyTorch model to ONNX format using torch.onnx.export.

Used ONNX Runtime's InferenceSession for faster inference.

Measured and compared inference times before and after conversion.

Results Interpretation

Before: Inference time = 120 ms, Accuracy = 85%

After: Inference time = 35 ms, Accuracy match with PyTorch = 100%

Using ONNX Runtime can significantly speed up model inference without losing prediction accuracy, making it suitable for real-time computer vision tasks.

Bonus Experiment

Try optimizing the ONNX model further by enabling ONNX Runtime's graph optimizations and using a GPU execution provider.

💡 Hint

Use ort.SessionOptions to enable optimizations and set providers=['CUDAExecutionProvider'] if a GPU is available.

Practice

(1/5)

1. What is the main purpose of ONNX Runtime in machine learning?

easy

A. To collect and label training data

B. To train new machine learning models from scratch

C. To visualize data and create charts

D. To run pre-trained machine learning models efficiently on different devices

ONNX Runtime in Computer Vision - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand ONNX Runtime's role

Step 2: Identify the correct purpose

Final Answer:

Quick Check:

Solution

Step 1: Recall ONNX Runtime loading method

Step 2: Check each option

Final Answer:

Quick Check:

Solution

Step 1: Understand session.run output

Step 2: Check the print statement

Final Answer:

Quick Check:

Solution

Step 1: Check input_name assignment

Step 2: Correct usage

Final Answer:

Quick Check:

Solution

Step 1: Recall how to enable GPU in ONNX Runtime

Step 2: Check each option

Final Answer:

Quick Check: