Fresh stories

GLM-5.2 benchmarks at 97.6% tool-calling and 2,626 tok/s on MI355X
Kilo, Composio, Together, and Wafer posted GLM-5.2 measurements including 40/41 tool tasks, 7/10 code review, and 2,626 tok/s on MI355X. Try it for lower-cost coding and tool use, but validate cross-file reasoning and latency on your workload.
Vercel adds FUSE Sandbox mounts and Agent Runs MCP/CLI access
Vercel shipped FUSE-based Sandbox mounts for S3 and network filesystems and opened Agent Runs through MCP and CLI. Use it to connect remote state, sandbox execution, and agent-readable Eve traces for self-improving workflows.

Condense.chat opens Adeline 1 proxy for 9% agent-loop compaction
Condense.chat opened a compression proxy that strips tokens with Helene 1 and compacts settled agent loops with Adeline 1 to about 9% of their size. The service claims 100M saved tokens and 3× plan extension for Claude or Codex users, so test it on non-sensitive workflows first.


GLM-5.2 benchmarks at 97.6% tool-calling and 2,626 tok/s on MI355X
Kilo, Composio, Together, and Wafer posted GLM-5.2 measurements including 40/41 tool tasks, 7/10 code review, and 2,626 tok/s on MI355X. Try it for lower-cost coding and tool use, but validate cross-file reasoning and latency on your workload.

Fable 5 users report Opus 4.8 fallbacks and $600 Max quota rotations
Fable 5 users reported Opus 4.8 fallbacks, $600 Max-account rotations, slow browser automation, and token-saving subagents. Watch routing opacity, quota burn, and latency before relying on it for long-running agent work.

Codex app reportedly leaks GPT-5.6 Sol, Terra, and Luna model names
Codex app code now references GPT-5.6 Sol, Terra, and Luna, while posts claim Sol Ultra reaches 91.9% on TerminalBench at lower cost. Treat release timing, limits, and benchmark claims as unofficial until OpenAI publishes details.

Vercel adds FUSE Sandbox mounts and Agent Runs MCP/CLI access
Vercel shipped FUSE-based Sandbox mounts for S3 and network filesystems and opened Agent Runs through MCP and CLI. Use it to connect remote state, sandbox execution, and agent-readable Eve traces for self-improving workflows.
AI SDK adds HarnessAgent for Pi, Claude, Codex, and OpenCode
OpenUI integrates Mastra, CopilotKit, and Eve through AG-UI generative components
Condense.chat opens Adeline 1 proxy for 9% agent-loop compaction
harbor exec launches agentic-map-reduce CLI via npx skills add harbor-exec
Top storiesthis week
Grok Build adds /voice dictation with Ctrl+Space transcription
Grok Build added speech-to-text dictation for coding agents through /voice or Ctrl+Space. Try it to bring Grok-powered real-time voice input into CLI coding workflows.


MinerU supports local PDF-to-Markdown OCR with 109 languages and MCP
MinerU was documented as a local OCR pipeline for PDF, Office, and image-to-Markdown with LaTeX formulas, tables, and 109 languages. The workflow adds mineru -p, mineru-api, Gradio, and an MCP server for Claude Desktop or Cursor.

Fable 5 users report Opus 4.8 fallbacks, refusals, and $321 sessions
Users posted mixed reports after Anthropic brought Fable 5 back: some sessions stayed on Fable, while others routed most work to Opus 4.8 or stalled mid-run. Watch for routing changes and cost spikes, since reports also mention refusals on ordinary tasks and ad hoc multi-model workarounds.

Claude Code releases 2.1.200/2.1.201 with Manual approval fixes
Claude Code 2.1.200 changed Manual permission defaults and fixed background-agent crash and recovery paths; 2.1.201 removed mid-conversation Sonnet 5 harness reminders. Update to reduce accidental advances and repeated reminders in stalled sessions.

Devin launches Security Swarm with Agentic MapReduce and 36/50 GHSA hits
Cognition introduced Devin Security Swarm, a repo-wide vulnerability scanner built on an Agentic MapReduce architecture that fans out over code shards and verifies findings in sandboxes. In a 50-vulnerability GHSA eval across 14 languages, it found 36 issues at 30% lower cost per finding than the next most accurate alternative.




