updateMarch 20, 2026

MiniMax M2.7 ranks #5 on PinchBench at $0.30 per million input tokens

Kilo said MiniMax M2.7 placed fifth on PinchBench, 1.2 points behind Opus 4.6 at much lower input cost, while community tests showed strong multi-loop agent behavior on graphics tasks. If you route coding-agent traffic by price, M2.7 looks worth a controlled bake-off.

Coding Agents Cost Optimization Benchmarks

3 min read

MiniMax M2.7 ranks #5 on PinchBench at $0.30 per million input tokens

TL;DR

Kilo says MiniMax M2.7 now ranks fifth out of 50 models on PinchBench, scoring 86.2% and landing just 1.2 points behind Opus 4.6, while charging $0.30 per million input tokens according to Kilo benchmark post.
In Kilo's own writeup, M2.7 also posted a 47% pass rate on the 89-task Kilo Bench and showed a "3.7-point improvement" over M2.5 on PinchBench, which Kilo frames as a jump into the top tier of coding models benchmark details.
The practical tradeoff is also clear in Kilo's launch writeup: M2.7 tends to do more reading and analysis before coding, which helped it solve tasks "zero other models could crack" but can also mean more tokens and occasional timeouts launch thread.
Early community tests are leaning into that agentic pattern rather than one-shot prompting: one developer paired Opus 4.6 as planner/reviewer with four M2.7 workers and a five-iteration loop to build Three.js and voxel-art demos, claiming the run still cost less than Sonnet 4.6