Fresh stories
Gemini desktop leaks Stream to Cursor, Spark local files, and Omni ahead of I/O
Leak videos and tester reports pointed to a larger Gemini desktop app with Stream to Cursor, Spark local-file access, Live, and Omni ahead of I/O. Independent testers also reported faster 3.2 and 3.5 Flash checkpoints, but Google had not announced the features publicly.

llama.cpp adds MTP for Qwen3.6 and pushes local 27B decode to 70-160 tok/s
llama.cpp added multi-token prediction for the Qwen3.6 family, and Unsloth published MTP GGUFs claiming about 1.4-2.2x faster local generation. The update moves Qwen3.6 closer to daily-driver speeds on commodity hardware, though results still vary by ROCm build and quant.

Claude Code 2.1.144 adds `/resume` for background sessions and fixes 75s startup hangs
Claude Code 2.1.144 shipped background-session `/resume`, elapsed completion notifications, exact string replacements, and grep-based system search. It also fixes startup hangs, resize corruption, and long-session terminal glitches that affected reproducibility.


Cursor ships Composer 2.5 with 2x included usage and a 10x-compute follow-on model
Cursor released Composer 2.5 in its editor and says it is stronger on long-running tasks, with included usage doubled for a week. Early comparisons place it near Opus 4.7-class coding, and Cursor says a much larger model is still training with 10x more compute.

Gemini desktop leaks Stream to Cursor, Spark local files, and Omni ahead of I/O
Leak videos and tester reports pointed to a larger Gemini desktop app with Stream to Cursor, Spark local-file access, Live, and Omni ahead of I/O. Independent testers also reported faster 3.2 and 3.5 Flash checkpoints, but Google had not announced the features publicly.

Devin launches Auto-Triage with long-term memory for bugs, alerts, and incidents
Cognition launched Devin Auto-Triage to watch issues across Slack, Linear, GitHub, schedules, webhooks, and observability tools. Teams can use it as an always-on investigation flow that returns context, next steps, or a PR.

Anthropic reports Stainless deal for SDKs, CLIs, and MCP servers across TypeScript, Python, Go, Java, and Kotlin
Anthropic said it is acquiring Stainless, the SDK and MCP server platform behind Anthropic’s own official SDKs across major languages. The deal matters because Anthropic is bringing a key part of its API and agent-connectivity toolchain in-house while developers reassess alternative codegen stacks.
llama.cpp adds MTP for Qwen3.6 and pushes local 27B decode to 70-160 tok/s
Claude Console adds prompt cache-miss diagnostics with per-message and per-tool token costs
Qwen opens 3.7 Max Preview and Plus Preview on Arena with a #10 coding rank
Claude Code 2.1.144 adds `/resume` for background sessions and fixes 75s startup hangs
Top storiesthis week
Manus introduces Scheduled Tasks 2.0 with task continuation and self-updating web apps
Manus upgraded scheduled work so recurring jobs can continue inside the same task and drive background updates in Manus-built web apps. That matters because long-lived automations can retain context between runs instead of rebuilding state each time.


Files SDK 1.4 adds 9 storage adapters, an agent CLI, and optional peer deps
Files SDK 1.4 shipped nine new storage adapters, a CLI for agents, an installable skill, and optional peer dependencies. The update broadens storage coverage while sharply shrinking install weight, though adapter dependencies now need explicit installation.

Vercel cuts firewall-mitigated request charges to $0 for denied, challenged, and rate-limited traffic
Vercel stopped billing for requests blocked, challenged, or rate-limited by Vercel Firewall, extending free mitigation beyond DDoS and system rules. Teams can tighten custom edge protections without paying for attack traffic they reject.

Hermes Agent ships v0.14.0 with Grok subscriptions, Codex runtime, and Windows beta
Nous Research shipped Hermes Agent v0.14.0 with Grok subscription access, Codex as an OpenAI runtime, LINE, native video generation, and a Windows beta. This matters because Hermes is moving beyond point integrations into a broader agent runtime with new access paths and deployment surfaces.

Gemini users report Canvas and Fast mode routing to 3.2 variants ahead of I/O
Multiple users posted reproducible steps and videos showing Gemini app UI changes, Thinking Level rollout, and Fast mode or Canvas sessions that look like 3.2 or 3.5-class routing. This matters because Google appears to be testing new model paths and app surfaces in production ahead of I/O, though the exact model names remain unconfirmed.



