Experiment - Intermediate result handling
Problem:You are building an AI agent that processes data in multiple steps. Currently, the agent only returns the final output, making it hard to debug or improve intermediate steps.
Current Metrics:Final output accuracy: 78%, no visibility into intermediate step performance.
Issue:Lack of intermediate result handling causes difficulty in understanding where errors occur and slows down improvements.