Experiment - Regression testing for agent changes
Problem:You have an AI agent that performs tasks based on learned behavior. After updating the agent's code or model, you want to ensure it still performs well on previous tasks without losing accuracy.
Current Metrics:Previous agent accuracy on test tasks: 92%. After update, accuracy on same tasks dropped to 78%.
Issue:The agent update caused regression: performance on old tasks decreased significantly.