updateMarch 14, 2026

Claude Opus 4.6 ranks 78.3% on MRCR v2 at 1M tokens

Third-party MRCR v2 results put Claude Opus 4.6 at a 78.3% match ratio at 1M tokens, ahead of Sonnet 4.6, GPT-5.4, and Gemini 3.1 Pro. If you are testing long-context agents, measure retrieval quality and task completion, not just advertised context window size.

Claude Code Claude Agent Readiness Benchmarks

3 min read

Claude Opus 4.6 ranks 78.3% on MRCR v2 at 1M tokens

TL;DR

Anthropic's 1M rollout post says Claude Opus 4.6 and Sonnet 4.6 now have a generally available 1M-token context window, and the same post adds support for up to 600 images in a prompt.
The MRCR chart puts Claude Opus 4.6 at 78.3% mean match ratio on MRCR v2 at 1M tokens, ahead of Sonnet 4.6 at 65.1%, GPT-5.4 at 36.6%, and Gemini 3.1 Pro at 25.9%.
Practitioner screenshots in CLI comparison and Claude Code screenshot suggest the rollout is already visible in Claude Code, where Opus 4.6 shows up as a 1M-context model on high paid tiers.
The early engineering takeaway from