Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Release

NVIDIA releases Nemotron 3 Ultra: 550B MoE, 1M context

NVIDIA shipped Nemotron 3 Ultra, a 550B/55B-active hybrid Mamba-Transformer MoE with open weights, data, and recipe, plus broad runtime and host support. It matters because the model pairs frontier open benchmarks with immediate agent-serving options, though local use still needs heavy quantization or large-memory hardware.

NVIDIA releases Nemotron 3 Ultra: 550B MoE, 1M context
New
LLM Serving·4th June·6 min read
Breaking

Anthropic reports Claude wrote 80% of merged code

Anthropic published internal metrics showing Claude wrote 80% of merged code, with 8x engineer output and 52x training-code speedups in Mythos Preview. The post matters because it gives a rare lab-side look at AI-assisted engineering gains, while still saying research judgment remains a bottleneck and recursive self-improvement is unproven.

Anthropic reports Claude wrote 80% of merged code
New
Claude Code·4th June·6 min read
Breaking

Cognition launches Devin Productivity Guarantee with $10M cap

Cognition said it will fund Devin usage up to $10 million when measured engineering value falls below cost, and published a technical writeup estimating productive engineering hours per session. It matters because the company is shifting agent pricing from tokens to claimed output and extending coding evaluation toward much longer task horizons.

Cognition launches Devin Productivity Guarantee with $10M cap
New
Agent Product Launch·4th June·5 min read
See all stories →
🤖Agentic Engineering(25)
🧩Agent Development(6)
🧠Models & APIs(8)
Inference & Infrastructure(1)
🔒Security & Reliability(4)
📊Business & Policy(1)
📌Other(2)

Top storiesthis week

Breaking

Gemma 4 12B ships encoder-free multimodal local model with 16GB target and 256K context

Google released Gemma 4 12B, an Apache 2.0 encoder-free multimodal model with native audio and vision for 16GB-class laptops. Day-zero support in llama.cpp, vLLM, Ollama, MLX, and SGLang should make local agents and on-device apps easier to deploy immediately.

Gemma 4 12B ships encoder-free multimodal local model with 16GB target and 256K context
New
Gemma·3rd June·5 min read
See all stories →
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.