breakingMarch 18, 2026

OpenAI launches Parameter Golf with 16 MB models and 8xH100 training limit

OpenAI opened its first Model Craft challenge, asking participants to train the best language model that fits inside a 16 MB artifact and trains in under 10 minutes on eight H100s. Engineers get a concrete optimization target, an automated GitHub leaderboard, and a public benchmark for training-efficiency tricks.

Evals Benchmarks Developer Experience

3 min read

OpenAI launches Parameter Golf with 16 MB models and 8xH100 training limit

TL;DR

OpenAI opened its first Model Craft challenge, Parameter Golf, asking entrants to train “the best language model” that fits in a 16 MB artifact and finishes training in under 10 minutes on 8×H100s, according to OpenAI’s launch post and the community summary in challenge thread.
The scoring target is compression on the FineWeb validation set, measured in a tokenizer-agnostic bits-per-byte metric, with OpenAI’s repo screenshot framing the contest as a parameter-constrained optimization problem rather than a standard quality benchmark.
OpenAI is pairing the challenge with public infrastructure: the GitHub repo hosts baselines and evaluation code, while OpenAI staff say there is a leaderboard, a Runpod starter template, and “$1M of compute” backing the event