Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Breaking

Kimi K2.6 launches with 58.6 SWE-Bench Pro and 4,000-tool-call agent runs

Moonshot open-sourced Kimi K2.6, a 1T-parameter MoE with 32B active parameters, 256K context, multimodal input, and larger agent swarms. It now sits near frontier closed models for long-horizon coding and tool use, so teams can try it for agent workflows.

Kimi K2.6 launches with 58.6 SWE-Bench Pro and 4,000-tool-call agent runs
New
Benchmarks·20th April·7 min read
Breaking

Kimi K2.6 adds day-one support across vLLM, SGLang, Ollama, and OpenRouter

Kimi K2.6 shipped across vLLM, SGLang, OpenRouter, Baseten, Ollama, OpenCode, Hermes Agent, and Droid within hours of launch. That cuts the usual lag between model release and production trials, so mixed-provider agent stacks can test it sooner.

Kimi K2.6 adds day-one support across vLLM, SGLang, Ollama, and OpenRouter
New
LLM Serving·20th April·4 min read
Breaking

Claude Code 2.1.116 adds 67% faster /resume and safer sandbox rm checks

Claude Code 2.1.116 shipped 24 CLI changes, including faster resume on large sessions, stricter guardrails around rm and rmdir, and automatic plugin dependency installs. It also updates terminal input behavior and model surface area for agent workflows, so teams should upgrade if they rely on the CLI.

Claude Code 2.1.116 adds 67% faster /resume and safer sandbox rm checks
New
Claude Code·20th April·3 min read
See all stories →
🤖Agentic Engineering(22)
🧩Agent Development(2)
🧠Models & APIs(4)
Inference & Infrastructure(5)
🔒Security & Reliability(2)
💰Cost & Operations(2)
🔬Research & Benchmarks(2)
📊Business & Policy(2)
📌Other(1)

Top storiesthis week

Opus 4.7 users report 1.46x tokenization and faster limit burn

Four days after the Opus 4.7 launch, independent tests measured about 1.35-1.46x more text tokens than 4.6 while users kept reporting faster limit burn and weaker coding. That can change effective cost and session economics in Claude Code even if list prices stay flat.

Opus 4.7 users report 1.46x tokenization and faster limit burn
Claude Code·19th April·6 min read
See all stories →
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.