workflowMarch 9, 2026

Karpathy releases autoresearch after nanochat cuts Time to GPT-2 by 11%

Andrej Karpathy open-sourced autoresearch, a minimal agent loop for automated ML research, and reported roughly 20 additive changes that reduced nanochat’s Time to GPT-2 from 2.02 hours to 1.80 hours. Research teams can use it as a concrete recipe for closed-loop experimentation on any metric with cheap proxy evaluations.

Coding Agents Benchmarks Developer Experience

4 min read

Karpathy releases autoresearch after nanochat cuts Time to GPT-2 by 11%

TL;DR

Andrej Karpathy open-sourced autoresearch as a minimal recipe for an agent that edits training code, runs experiments, evaluates loss, and keeps improvements in Git; early descriptions of the repo emphasized a "single GPU" setup and "5-minute training runs" Karpathy's release early repo summary.
In Karpathy's first nanochat run, the agent worked through roughly 700 autonomous changes, kept about 20 additive improvements, and cut the leaderboard's "Time to GPT-2" from 2.02 hours to 1.80 hours, an ~11% gain nanochat results.
The changes were not cosmetic tuning: Karpathy says the loop found concrete fixes in attention scaling, regularization, AdamW betas, weight decay scheduling, and initialization that transferred from a depth-12 run to larger depth-24 models technical findings.