Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Breaking

Claude Code 2.1.154 adds Dynamic Workflows for hundreds of parallel subagents

Claude Code 2.1.154 added Dynamic Workflows, a research-preview mode that writes orchestration scripts and runs hundreds of subagents in one session. Anthropic also shipped 2.1.156 to fix Opus 4.8 thinking-block API errors, so teams should watch for workflow and API stability.

Claude Code 2.1.154 adds Dynamic Workflows for hundreds of parallel subagents
New
Claude Code·28th May·7 min read
Release

Hermes Agent v0.15.0 adds skill bundles and makes session search 750x faster

Nous Research released Hermes Agent v0.15.0 with skill bundles, MCP Catalog, new model support, and major performance and security work. The update cuts load times 50%, speeds session search 750x, and adds Bitwarden plus prompt-injection defenses.

Hermes Agent v0.15.0 adds skill bundles and makes session search 750x faster
New
Hermes Agent·28th May·5 min read
Breaking

Artificial Analysis launches AA-WER Streaming with Cartesia Ink-2 at 3.7% WER

Artificial Analysis launched AA-WER Streaming to benchmark streaming speech-to-text models on accuracy and latency for voice agents. The first leaderboard puts Cartesia Ink-2 and ElevenLabs Scribe v2 on the price-latency frontier, so teams should compare cost against latency before choosing a model.

Artificial Analysis launches AA-WER Streaming with Cartesia Ink-2 at 3.7% WER
New
Voice Agents·28th May·4 min read
See all stories →
🤖Agentic Engineering(22)
🧩Agent Development(3)
🧠Models & APIs(4)
Inference & Infrastructure(5)
🔒Security & Reliability(1)
🔬Research & Benchmarks(3)
📊Business & Policy(1)

Top storiesthis week

Breaking

DeepSWE benchmarks GPT-5.5 at 70% on 113 tasks across 91 repos

DeepSWE launched a coding benchmark built from 113 original tasks across 91 repos and five languages, with GPT-5.5 leading at 70%. The setup is meant to better reflect repo search, multi-file edits, and verification in real agent workflows.

DeepSWE benchmarks GPT-5.5 at 70% on 113 tasks across 91 repos
New
Benchmarks·27th May·5 min read
See all stories →
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.