Fresh stories

Claude Code users report 5-minute cache TTL and quota-meter regressions after March updates
GitHub issues and Hacker News threads added fresh evidence that Claude Code sessions still burn quota unexpectedly after the cache TTL change, with some users seeing usage before a prompt is sent and others recovering capacity by rolling back to 2.1.34. Watch cache reuse and metering behavior closely if you rely on long-running sessions.
Windsurf 2.0 integrates Devin for cloud agents that keep running after the IDE closes
Windsurf 2.0 launched with Devin embedded into the product, combining local agents with cloud agents that can continue across codebases after you close the laptop. The IDE now acts as a handoff layer between interactive edits and long-running remote execution.

OpenAI opens GPT-5.4-Cyber to Trusted Access for Cyber tiers
OpenAI expanded Trusted Access for Cyber and added GPT-5.4-Cyber, a fine-tuned variant with fewer restrictions for verified defenders. The rollout shifts advanced defensive workflows into identity-gated tiers instead of a broadly available API.


OpenAI Agents SDK adds sandbox execution and memory controls with Vercel, Modal, E2B and Daytona
OpenAI updated the Agents SDK with sandbox execution, memory controls and run snapshotting, and launch partners Vercel, Modal, E2B and Daytona shipped integrations. Long-running agents can now keep files, credentials and execution state in isolated runtimes instead of wiring harness, compute and storage layers together manually.

Claude Code users report 5-minute cache TTL and quota-meter regressions after March updates
GitHub issues and Hacker News threads added fresh evidence that Claude Code sessions still burn quota unexpectedly after the cache TTL change, with some users seeing usage before a prompt is sent and others recovering capacity by rolling back to 2.1.34. Watch cache reuse and metering behavior closely if you rely on long-running sessions.

Gemini 3.1 Flash TTS launches with Audio Tags, 70+ languages and API preview
Google released Gemini 3.1 Flash TTS with inline Audio Tags, multi-speaker control and 70+ languages, and opened preview access through the Gemini API and AI Studio with rollout to Vertex AI and Google Vids. Independent evals ranked it near the top of current speech leaderboards, but it runs slower and costs more than the leading system.

Parcae claims 1.3B Transformer quality from a 770M looped model
Together AI and UCSD released Parcae, a looped model that reuses layers with a constrained recurrent dynamic and reports stronger results than parameter-matched Transformers from 140M to 1.3B scales. The released models and code suggest recurrence can trade memory for quality under fixed FLOP budgets instead of scaling parameters alone.
Windsurf 2.0 integrates Devin for cloud agents that keep running after the IDE closes
Claude Code updates desktop app with side-by-side sessions and integrated terminal
Google DeepMind releases Gemini Robotics-ER 1.6 with 93% instrument reading
OpenAI opens GPT-5.4-Cyber to Trusted Access for Cyber tiers
Top storiesthis week
AISI reports Claude Mythos completes a 32-step corporate attack range
Anthropic's Mythos system card says the model completed the AI Security Institute's 32-step corporate attack range in about 20 human hours. The benchmark matters as a cyber capability signal, but the range is easier than a real defended enterprise network.


Claude Code users report a 5-minute cache TTL and 5x Pro Max quota burn in 1.5 hours
Anthropic acknowledged a March 6 cache optimization change, and Pro Max users report that the shorter TTL plus hidden session context now burns through Claude Code quota much faster. Watch for 500 errors and stalled streams, and apply the 2.1.105 patch if your UI hangs.

Hermes Agent releases v0.9.0 with a local dashboard and monitoring APIs
Nous Research shipped Hermes Agent v0.9.0 with a local web dashboard, new monitoring APIs, and broader platform updates. Teams using multi-agent workflows should test the new controls for profile cloning and long-running dashboard-managed sessions.

Cursor updates Cursor 3 with split agents and 87% fewer dropped frames
Cursor 3 adds split-agent panes, tighter cloud-agent controls, voice input fixes, and an 87% reduction in dropped frames during large edits. The update makes the IDE easier to use as a mixed local-cloud agent workspace, while keeping editor navigation and diff review intact.

Open Agents launches a browser-based cloud coding platform with parallel sessions
Open Agents open-sources a browser-based cloud coding platform that keeps sessions running in parallel after a laptop closes. Use the reference stack if you want sandboxed VMs, model routing, and durable execution for internal coding-agent systems.



