Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Breaking

Epoch releases MirrorCode with 25 long-horizon SWE tasks and a 56% score

Epoch introduced MirrorCode, a benchmark where models reimplement real programs from specs with no internet and hidden held-out tests; the best current score is 56%. The setup matters because it scales inference into multi-day runs and targets software jobs estimated to take humans weeks.

Epoch releases MirrorCode with 25 long-horizon SWE tasks and a 56% score
New
Benchmarks·26th June·5 min read
Breaking

Google AI Studio adds Design Variations for one-click UI layout proposals

Google AI Studio shipped Design Variations, which generates multiple UI directions from an existing build and lets users apply one directly. It matters because builders can branch app presentation without rewriting aesthetic prompts or manually rebuilding layouts.

Google AI Studio adds Design Variations for one-click UI layout proposals
New
DX Tooling·26th June·3 min read
Breaking

Perceptron adds video_frames to Mk1 and cuts 1080p time-to-first-token from ~42s to ~4s

Perceptron launched a video_frames input for Mk1 that accepts pre-decoded frames with timestamps instead of forcing clip re-encoding. The change matters for edge and sparse-footage pipelines because 10 minutes of 1080p video can start returning tokens roughly ten times faster.

Perceptron adds video_frames to Mk1 and cuts 1080p time-to-first-token from ~42s to ~4s
New
Multimodal·26th June·3 min read
See all stories →
🤖Agentic Engineering(18)
🧩Agent Development(5)
🧠Models & APIs(4)
Inference & Infrastructure(6)
🔒Security & Reliability(3)
🔬Research & Benchmarks(5)
📊Business & Policy(3)

Top storiesthis week

Breaking

OpenAI reports Codex drives 99.8% of internal AI output tokens

OpenAI published usage data showing Codex now generates 99.8% of its internal AI output tokens, with sharp growth in legal, support, recruiting, and finance. The report measures agent adoption as delegated parallel work, not just chat inside engineering.

OpenAI reports Codex drives 99.8% of internal AI output tokens
New
Codex·25th June·6 min read
See all stories →
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.