Debate + consensus
📖 3 min readUpdated 2026-04-19
Debate is a multi-agent pattern where two or more agents argue opposing sides of a question. A judge, another agent or a human, evaluates and picks a winner or synthesizes. It sounds exotic. It actually works: on many reasoning tasks, debate produces more accurate answers than a single agent running alone. The trick is that arguing against yourself surfaces flaws your first-draft reasoning missed.
The setup
- The question is posed.
- Agent A is assigned one side (e.g., "argue yes").
- Agent B is assigned the opposite side, or the "find the flaws" role.
- They exchange arguments for N rounds (usually 2-4).
- A judge reads the full transcript and either picks a side or synthesizes.
Why it works
A single agent's first-pass reasoning can be confidently wrong in a consistent direction. When you force the same model to argue the opposite side, it's now actively looking for holes in the first position. The holes that exist, it finds. The judge, seeing both sides, has more signal than either agent alone had. Result: better final answers on problems where single-agent output tends to be inconsistent.
When to reach for debate
- Decision problems with multiple plausible answers. Build vs buy, react vs deliberate, approve vs escalate.
- Fact-checking. One agent makes a claim, another tries to refute it. Surfaces unsupported claims fast.
- Strategy evaluation. "Approach X vs approach Y" benefits from having each side championed.
- Risk analysis. "Optimist vs pessimist" agent produces a balanced view neither would have alone.
A worked example: a fact-check
Claim under review: "Startup X raised $50M Series B in March 2026."
- Claim agent: searches for evidence of the raise. Finds two press releases and a Crunchbase entry. Argues: "Yes, the raise happened."
- Skeptic agent: searches for contradictions. Notices the press releases are both from the company itself, no third-party reporting, Crunchbase entry created recently with minimal provenance. Argues: "The evidence chain is thin."
- Round 2: claim agent defends by finding independent coverage. Skeptic notes the coverage dates are all after the initial press release, no primary-source confirmation.
- Judge: "Claim is plausible but unverified. Recommend flagging as 'reported, unverified' rather than 'confirmed.'"
A single agent would likely have taken the press releases at face value. Debate caught the soft evidence chain.
Is it worth the cost?
The empirical picture
Published benchmarks show debate-based agents outperforming single-agent baselines on math, logic, fact-checking, and multi-step reasoning tasks. The gains are biggest where single-agent outputs are inconsistent run-to-run. For tasks the model nails the first time, debate is just expensive noise.
Failure modes
- Both agents converge. Models are trained to be compliant; if the "opposing" agent agrees instead of pushing back, you got no value. Fix: harden the personas, require each agent to find at least one flaw per round.
- Shallow arguments. If the model doesn't have depth on the topic, both agents produce generic talking points. Fix: give them tools (search, retrieval) so they can find real evidence.
- Weak judge. A judge that rubber-stamps one side undoes the entire mechanism. Fix: prompt the judge to explicitly weigh each argument and cite specific points.
- Cost. 3-5× single-agent spend. Use only where warranted.
- Endless debate. No stop condition; agents bicker forever. Fix: hard cap at 3-4 rounds.
Prompting notes
- Give each debater a specific position and a mandate ("you must find at least one flaw in the opposing argument each round").
- The judge prompt should include: read every round, weigh evidence, identify the strongest argument on each side, render a decision with rationale.
- Share the full transcript with each debater before each new round, so they can respond specifically.
What to do with this
- Try debate on your most-error-prone decision task. Measure eval delta against a single-agent baseline.
- Read peer agents for the general pattern debate sits inside.
- Read reflection for the single-agent cousin of the same idea.