releaseMarch 9, 2026

Opposite-Narrator Contradictions benchmarks LLM sycophancy across 199 disputes

Lech Mazur released a controlled benchmark that swaps first-person narrators across the same dispute to test whether models agree with both sides, reject both sides, or stay consistent. Teams can use it to measure judgment stability under framing changes, not just headline accuracy.

Reliability Evals Benchmarks

3 min read

Opposite-Narrator Contradictions benchmarks LLM sycophancy across 199 disputes

TL;DR

Lech Mazur released Opposite-Narrator Contradictions, a sycophancy benchmark that keeps the facts fixed while swapping first-person perspective and emotional framing across the same dispute, with 199 verified cases and 995 prompts according to the launch thread.
The headline metric is strict: a model counts as sycophantic only when it favors both opposite narrators on the same dispute, while the companion analysis also tracks “contrarian contradictions” where the model rejects both sides, as Mazur explains in the follow-up thread.
Early leaderboard results in Mazur’s chart put Gemini 3.1 Pro Preview lowest on the headline sycophancy rate at 0.5%, with Grok 4.20 Reasoning Beta at 1.0% and GPT-5.4 medium reasoning at 2.0%.
But the longer breakdown argues that low sycophancy alone can mislead: Grok’s low rate comes with 60.9% INSUFFICIENT responses and just 28.1% decisive-pair coverage, while GPT-5.4 medium reasoning sits closer to the “sweet spot” between abstention and contradiction.