Fresh stories

Fable 5 users report Opus 4.8 fallbacks, refusals, and $321 sessions
Users posted mixed reports after Anthropic brought Fable 5 back: some sessions stayed on Fable, while others routed most work to Opus 4.8 or stalled mid-run. Watch for routing changes and cost spikes, since reports also mention refusals on ordinary tasks and ad hoc multi-model workarounds.
xAI launches Voice Agent Builder with $0.05/min pricing and SIP routing
xAI opened a no-code builder for Grok Voice agents with phone numbers, SIP routing, call recording, MCP and API connections, and 80+ built-in voices. The beta prices audio at $0.05 per minute, plus $0.01 per minute for xAI-provided telephony.

Ramp introduces PorTAL with half-cost LoRA porting across Qwen and Gemma models
Ramp published PorTAL, a method that learns a reusable task representation once and recalibrates only a thin converter when moving that task to a new base model. In reported Qwen and Gemma experiments, it matched per-task LoRA accuracy while cutting data and cost roughly in half.


Fable 5 users report Opus 4.8 fallbacks, refusals, and $321 sessions
Users posted mixed reports after Anthropic brought Fable 5 back: some sessions stayed on Fable, while others routed most work to Opus 4.8 or stalled mid-run. Watch for routing changes and cost spikes, since reports also mention refusals on ordinary tasks and ad hoc multi-model workarounds.

Devin launches Security Swarm with Agentic MapReduce and 36/50 GHSA hits
Cognition introduced Devin Security Swarm, a repo-wide vulnerability scanner built on an Agentic MapReduce architecture that fans out over code shards and verifies findings in sandboxes. In a 50-vulnerability GHSA eval across 14 languages, it found 36 issues at 30% lower cost per finding than the next most accurate alternative.

Claude Sonnet 5 ranks #3 on Vals and hits 183 turns on AA-Briefcase
Vals and Artificial Analysis published independent Sonnet 5 results a day after launch, placing it just behind Opus 4.8 and Fable 5 while using far more turns than Sonnet 4.6. Lower token pricing did not make agentic tasks cheaper, and some finance benchmarks still triggered refusals.

Claude Code 2.1.198 adds background agents, Chrome sessions, and eval CLI
Anthropic shipped Claude Code 2.1.198 with Claude in Chrome, background agents that auto-commit and open draft PRs, and a new eval command with ablation and judge-model options. The release also adds AWS upstream failover and retries transient mid-response network drops instead of aborting turns.
xAI launches Voice Agent Builder with $0.05/min pricing and SIP routing
GLM 5.2 supports Amp, dcode, and Next.js workflows after Composio tops 41 tool tasks
Letta Agent launches persistent digital coworkers with Slack, Discord, and BYOK state
Ramp introduces PorTAL with half-cost LoRA porting across Qwen and Gemma models
Top storiesthis week
Anthropic launches Claude Sonnet 5 with 1M context and adaptive thinking
Anthropic launched Claude Sonnet 5 across Claude, the API, and Claude Code with 1M context, adaptive thinking, and $2/$10 intro pricing through Aug. 31. Independent evals place it near Opus 4.8 on coding and tool use, so teams should benchmark it against Opus before switching.


US Commerce removes Fable 5 export controls; Anthropic restores access July 1
The US Commerce Department removed export controls on Fable 5 and Mythos 5, and Anthropic said access starts returning July 1. Fable counts against up to 50% of weekly limits through July 7 before moving to usage credits, so users should check their quota behavior and fallback paths.

Google releases Nano Banana 2 Lite and Gemini Omni Flash
Google shipped Nano Banana 2 Lite for image generation and Gemini Omni Flash for conversational video generation and editing in the Gemini API and AI Studio. The release sets image generation at about 4 seconds and $0.034 per 1K image, while Omni Flash adds multi-turn video edits at $0.10 per second.

The Information reports OpenAI cuts inference costs by more than 50% on some models
Multiple summaries of The Information report said OpenAI found inference optimizations that more than halved costs on some existing models. If that holds, it changes the margin, pricing, and usage-limit math behind ChatGPT and API serving even before new model releases arrive.

Vercel adds Dockerfile Functions and Services with VCR registry
Vercel added Dockerfile-based Functions, a Services model for multi-framework apps in one project, and a VCR registry for container images at Ship NYC. The release lets teams deploy OCI images and collocated services with atomic rollbacks, private networking, and active-CPU billing, so Docker-based apps can move without single-runtime constraints.








