AI PrimerAI Primer
Vals benchmarks Grok 4.20 Beta: ProofBench rises to 14% while legal tasks regress | AI Primer