Experiment - Debate and consensus patterns
Problem:You want to build an AI system where multiple agents debate different answers to a question and then reach a consensus on the best answer.
Current Metrics:Agents debate but often fail to agree, resulting in low consensus accuracy of 60%.
Issue:The system shows poor consensus quality and low agreement among agents, reducing overall answer accuracy.