Experiment - Latency optimization
Problem:You have a text generation model that takes too long to produce answers. The average response time is 5 seconds, which is too slow for users.
Current Metrics:Average latency: 5 seconds per request; Model accuracy: 92%
Issue:High latency causing slow user experience, though accuracy is good.