Fresh stories

GLM-5.2 adds Perplexity Agent API and Droid support on Baseten at >280 TPS
GLM-5.2 added Perplexity Agent API, Droid, and more hosting options, while Baseten reported over 280 TPS and sub-0.8s TTFT. Builders should watch the cost and benchmark data as it moves into production agent stacks.
Hermes Agent adds Windows and Linux GUI computer use via TryCua
Hermes Agent added GUI computer-use support for Windows and Linux through TryCua drivers, extending beyond existing macOS support. Teams running desktop automation across mixed operating systems should test the new coverage.

Claude Code 2.1.186 adds claude mcp login and auto-replies after ! shell commands
Claude Code 2.1.186 adds CLI-based MCP auth, automatic assistant replies after ! shell commands, and tighter named-subagent permission checks. The update cuts interactive setup for remote MCP servers and tightens policy-heavy agent workflows.


GLM-5.2 adds Perplexity Agent API and Droid support on Baseten at >280 TPS
GLM-5.2 added Perplexity Agent API, Droid, and more hosting options, while Baseten reported over 280 TPS and sub-0.8s TTFT. Builders should watch the cost and benchmark data as it moves into production agent stacks.

Fugu Ultra testers report 30-minute runs and 17x GLM cost after launch
Sakana launched Fugu Ultra on AI Gateway and published a technical report, with early testers sharing mixed results. Reports mention polished outputs on some tasks, but also 30-minute runs, uneven coding quality, and much higher cost than GLM-5.2.

Google ships Interactions API in GA as Gemini default with background agents
Google put the Interactions API into GA as the new default for Gemini, adding background execution, managed agents, remote sandboxes, and multimodal tools. Builders now get one stateful interface for models, long-running jobs, and future Gemini Omni support.

Vercel supports Claude Design one-click deploys
Claude Design now deploys directly to Vercel with one click. The integration turns design output into a live previewable app without leaving the design flow, extending Claude Design beyond imports and code sync.
Hermes Agent adds Windows and Linux GUI computer use via TryCua
Vercel supports WebSockets in Fluid with Socket.IO and 30-minute reconnects
Files SDK 2.0 adds files-sdk/api gateway and React, Vue, Svelte clients
Claude Code 2.1.186 adds claude mcp login and auto-replies after ! shell commands
Top storiesthis week
Sakana Fugu launches one-API orchestration with Fable benchmark claims
Sakana AI launched Fugu and Fugu Ultra as OpenAI-compatible orchestration models that route, verify, and synthesize across multiple models. The release matters because Sakana is selling multi-agent coordination as a single endpoint, but it has not fully disclosed model mix or pass-through costs.


Human-on-the-Bridge compares reusable eval assets with LLM judges and human review
A new Human-on-the-Bridge paper argued for front-loading expert judgment into reusable evaluation assets, while practitioners also shared double-run and multi-model review setups. The cluster matters because teams tuning agent harnesses need repeatable ways to measure behavior beyond one-off benchmark scores or subjective PR review.

sqlite-utils 4.0rc1 adds migrations and nested transactions
Simon Willison released the first sqlite-utils 4.0 release candidate with a built-in migrations system and nested transactions. The RC adds minor backward incompatibilities while expanding SQLite workflow automation for scripts and apps.

Hermes Agent adds self-hosted Mem0 and headless desktop connections
Hermes Agent can now self-host Mem0, and the desktop client can attach to headless Hermes instances or start one with the hermes desktop command. The change expands always-on memory and remote control setups outside a laptop session.

Morph supports Qwen, GLM-5.2, MiniMax M3, DeepSeek v4 with 20-35% higher code acceptance
Morph said its code-serving stack now exposes Qwen, GLM-5.2, MiniMax M3, and DeepSeek v4 with code-tuned speculative decoding. It claims 20-35% higher acceptance than Eagle 3.1 or DFlash, plus kernels for cheaper hardware.







