Fresh stories
Anthropic launches Claude Managed Agents with Dreaming, Outcomes, and multiagent orchestration
Anthropic added Dreaming in research preview plus public-beta Outcomes, multiagent orchestration, and webhooks to Claude Managed Agents. Teams should try the new grader loops and shared-container sub-agents if they want more control over long-running agent work.

OpenAI opens Multipath Reliable Connection for 100,000-plus GPU training clusters
OpenAI and partners released Multipath Reliable Connection, an RDMA transport that spreads training traffic across multiple network paths and is already deployed on the company's largest clusters. The protocol targets congestion and failure recovery in giant GPU trainings, and teams building similar clusters should track the Open Compute Project release.

Zed launches Business plan with org-wide AI controls and $30 per-seat pricing
Zed v1.1 introduced a Business plan with org-wide model controls, spend tracking, and enforceable data policies, alongside BYOK or Zed-hosted AI. Admins can use it to govern agent features and model access centrally instead of per-user settings.


Anthropic launches Claude Managed Agents with Dreaming, Outcomes, and multiagent orchestration
Anthropic added Dreaming in research preview plus public-beta Outcomes, multiagent orchestration, and webhooks to Claude Managed Agents. Teams should try the new grader loops and shared-container sub-agents if they want more control over long-running agent work.

Anthropic doubles Claude Code 5-hour limits after SpaceX Colossus 1 compute deal
Anthropic said a SpaceX compute deal will add 300+ MW and 220,000+ NVIDIA GPUs, and it doubled Claude Code 5-hour limits across paid plans. It also raised Opus API ceilings; users should still watch the unchanged weekly caps.

Zyphra releases ZAYA1-8B with <1B active params and Markovian RSA reasoning
Zyphra released ZAYA1-8B, an Apache-2.0 reasoning MoE with compressed-convolutional attention and bounded-context Markovian RSA test-time compute. The model targets math and coding workloads while keeping the active parameter count below 1B.

OpenAI opens Multipath Reliable Connection for 100,000-plus GPU training clusters
OpenAI and partners released Multipath Reliable Connection, an RDMA transport that spreads training traffic across multiple network paths and is already deployed on the company's largest clusters. The protocol targets congestion and failure recovery in giant GPU trainings, and teams building similar clusters should track the Open Compute Project release.
Claude Code 2.1.132 adds CLAUDE_CODE_DISABLE_ALTERNATE_SCREEN and session_id hooks
Firecrawl adds Question format to /scrape with grounded answers and 100x fewer tokens
OpenCode adds minimal mode with native scrollback and plugin tracing
Zed launches Business plan with org-wide AI controls and $30 per-seat pricing

Navigator n1.5 claims web computer-use Pareto gains on accuracy, latency, and cost

Perplexity adds Finance Search to Agent API with live data and FinSearchComp T1 lead

Braintrust reports unauthorized AWS-account access and tells customers to rotate provider keys
Top storiesthis week
Gemma 4 adds MTP drafters for up to 3x faster decoding
Google released Multi-Token Prediction drafters for Gemma 4 and says decoding can run up to 3x faster without output-quality loss. vLLM and SGLang support shipped day one, so local and server deployments can try the speedup immediately.


ChatGPT ships GPT-5.5 Instant by default with Memory Sources
OpenAI is rolling GPT-5.5 Instant into ChatGPT as the default model and exposing it as gpt-5.5-chat-latest, alongside Memory Sources for personalized replies. The model also claims 52.5% fewer high-stakes hallucinations, so watch for behavior changes in production prompts.

ProgramBench reports 0% on ffmpeg, SQLite, and ripgrep rebuilds without internet
The SWE-Bench team released ProgramBench, which asks models to rebuild real software from executables alone, and the initial complete-pass score is 0% across models. It matters as a harsher long-horizon coding benchmark, though its all-tests-pass metric and simpler harness make it a stress test rather than a direct proxy for production agents.

Anthropic launches 10 finance agent templates for Claude Code
Anthropic released ready-to-run finance templates for pitchbooks, valuation reviews, KYC, and month-end close across Cowork, Claude Code, and Managed Agents. Use them to start with bundled connectors, skills, and subagents instead of building each workflow from scratch.

Realtime TTS-2 releases with sub-200 ms TTFA and 100+ languages
Realtime TTS-2 ships as a low-latency speech model that conditions on prior audio turns, not just text, and claims sub-200 ms time-to-first-audio across 100+ languages. The release matters for voice-agent stacks because Replicate and LiveKit are already exposing it for real-time integration work.







