Fresh stories
Cognition launches Devin Fusion with mid-session routing and 35% lower Fable-class cost
Cognition launched Devin Fusion, a hybrid coding harness that reroutes work mid-task and says it cuts Fable-class cost by 35%. Use it when upfront routing misses late complexity; the router can re-evaluate after investigation starts.

Vercel adds useRealtime, generateSpeech, and transcribe to AI Gateway
Vercel shipped realtime speech and transcription support in AI Gateway and AI SDK 7, then added Grok voice models through the same interface. The update puts voice agents on the same gateway, WebSocket, and AI SDK stack Vercel already uses for text models.

Snowflake releases Arctic RL with ZoRRo: Text2SQL-R2 training drops to ~36 hours
Snowflake open-sourced Arctic RL and said its ZoRRo optimization delivers up to 6x actor-update speedup and 3.5x end-to-end gains. The repo packages those gains into VeRL and SkyRL integrations plus open Text2SQL and multi-hop QA recipes.


Cognition launches Devin Fusion with mid-session routing and 35% lower Fable-class cost
Cognition launched Devin Fusion, a hybrid coding harness that reroutes work mid-task and says it cuts Fable-class cost by 35%. Use it when upfront routing misses late complexity; the router can re-evaluate after investigation starts.

Meituan releases LongCat 2.0: 1.6T MoE on domestic chips
Meituan disclosed LongCat 2.0, a 1.6T-parameter MoE with about 48B active parameters, 1M context, and 35T training tokens on domestic hardware. The release ties a near-frontier open model to a Chinese domestic compute stack and a custom sparse-attention design.

Vercel raises Functions package limit to 5 GB on Fluid compute
Vercel raised the maximum package size for Functions on Fluid compute from 250 MB to 5 GB, a 20x increase. The change removes a common deployment blocker for browser automation, larger Python AI stacks, image processing, and heavier backend workloads.

Vercel adds useRealtime, generateSpeech, and transcribe to AI Gateway
Vercel shipped realtime speech and transcription support in AI Gateway and AI SDK 7, then added Grok voice models through the same interface. The update puts voice agents on the same gateway, WebSocket, and AI SDK stack Vercel already uses for text models.
Claude Code 2.1.196 adds org default model and pending approval for repo-local MCP
Next.js 16.3 Preview cuts Turbopack memory up to 90% and warms builds 5.5x
Snowflake releases Arctic RL with ZoRRo: Text2SQL-R2 training drops to ~36 hours
Codex fixes usage overcounting with one extra banked reset and auto-review rollback
Top storiesthis week
Codex users report /goal, /rewind, and /compact workflows after launch
A day after /goal and thread automations landed in Codex, practitioners started standardizing on /goal specs, /fork or /side detours, and /rewind plus /compact recovery. The pattern matters because verifier design and compaction timing now control how well long runs hold together.


Plannotator v0.21.3 adds file-scoped review comments and Codex app-server support
Plannotator v0.21.3 shipped file-scoped comments, a unified review UX, default per-file Ask AI chats, and a more reliable Codex app-server path. It matters because guided reviews and plan checks can now plug into agent workflows with less custom glue.

Google limits Meta's Gemini use after capacity shortages
The FT reported that Google capped Meta's Gemini usage after Meta asked for more model capacity than Google could supply, affecting internal safety, support, ad, and coding projects. The restriction matters because model access is now constrained by chip, memory, and networking capacity as much as by API contracts.

Microsoft opens SkillOpt with batch eval loops for agent SOP files
Microsoft open-sourced SkillOpt, a system that treats agent skill documents as tunable artifacts and improves them against measured task batches. It matters because practitioners are already standardizing shared /research, QA, and packageable skills across harnesses, turning skill files into a new optimization surface alongside models.

xAI tests Grok 4.5 private beta on a 1.5T V9 model with Cursor data
Multiple trackers said Grok 4.5 is in private beta at SpaceX and Tesla, built on a 1.5T V9 base with supplemental Cursor data and compared internally against an unspecified Opus model. The claims matter because xAI is signaling a faster release cadence, but the reported performance is still unverified.






