Experiment - Latency and cost benchmarking
Problem:You want to measure how fast your AI agent responds and how much it costs to run it. Currently, you have a model that answers questions but you don't know if it is fast enough or if it is too expensive.
Current Metrics:Average response latency: 1200 ms, Cost per 1000 requests: $5.00
Issue:The response time is slow for real-time use and the cost is high for frequent queries.
