AI PrimerAI Primer
LLM Debate Benchmark ranks Sonnet 4.6 first across 1,162 side-swapped debates | AI Primer