Fresh stories

Claude Code users report 5-minute cache TTL and quota-meter regressions after March updates
GitHub issues and Hacker News threads added fresh evidence that Claude Code sessions still burn quota unexpectedly after the cache TTL change, with some users seeing usage before a prompt is sent and others recovering capacity by rolling back to 2.1.34. Watch cache reuse and metering behavior closely if you rely on long-running sessions.
Windsurf 2.0 integrates Devin for cloud agents that keep running after the IDE closes
Windsurf 2.0 launched with Devin embedded into the product, combining local agents with cloud agents that can continue across codebases after you close the laptop. The IDE now acts as a handoff layer between interactive edits and long-running remote execution.


OpenAI Agents SDK adds sandbox execution and memory controls with Vercel, Modal, E2B and Daytona
OpenAI updated the Agents SDK with sandbox execution, memory controls and run snapshotting, and launch partners Vercel, Modal, E2B and Daytona shipped integrations. Long-running agents can now keep files, credentials and execution state in isolated runtimes instead of wiring harness, compute and storage layers together manually.

Claude Code users report 5-minute cache TTL and quota-meter regressions after March updates
GitHub issues and Hacker News threads added fresh evidence that Claude Code sessions still burn quota unexpectedly after the cache TTL change, with some users seeing usage before a prompt is sent and others recovering capacity by rolling back to 2.1.34. Watch cache reuse and metering behavior closely if you rely on long-running sessions.

Gemini 3.1 Flash TTS launches with Audio Tags, 70+ languages and API preview
Google released Gemini 3.1 Flash TTS with inline Audio Tags, multi-speaker control and 70+ languages, and opened preview access through the Gemini API and AI Studio with rollout to Vertex AI and Google Vids. Independent evals ranked it near the top of current speech leaderboards, but it runs slower and costs more than the leading system.
Top storiesthis week
Claude Code updates desktop app with side-by-side sessions and integrated terminal
Anthropic rebuilt Claude Code on desktop into a drag-and-drop multi-session workspace with file editing, HTML and PDF preview, and sidebar session management. The same rollout also shipped 2.1.108 features, including an optional 1-hour cache TTL, recap, and new built-ins that affect cost and session handoff.


Google DeepMind releases Gemini Robotics-ER 1.6 with 93% instrument reading
Google DeepMind shipped Gemini Robotics-ER 1.6 to the Gemini API and AI Studio with better visual-spatial reasoning, multi-view success detection, and gauge reading. The model's 93% instrument-reading score targets robots that need to reason over cluttered scenes and physical constraints.

OpenAI opens GPT-5.4-Cyber to Trusted Access for Cyber tiers
OpenAI expanded Trusted Access for Cyber and added GPT-5.4-Cyber, a fine-tuned variant with fewer restrictions for verified defenders. The rollout shifts advanced defensive workflows into identity-gated tiers instead of a broadly available API.

Claude Code ships Routines in research preview with API and webhook triggers
Anthropic introduced Claude Code Routines, a cloud-run automation layer that can execute on schedules, API calls, and GitHub events. The rollout moves scheduling from local runs to hosted, persistent automation and adds new trigger surfaces for plan-wide use.

Hugging Face Hub launches Kernels with 1.7x-2.5x PyTorch speedups
Hugging Face introduced Kernels on the Hub to publish pre-compiled GPU kernels matched to GPU, PyTorch version, and OS. The packaging makes kernel optimizations shareable and claims 1.7x to 2.5x speedups over PyTorch baselines with torch.compile compatibility.





