Fresh stories
OpenAI launches Codex Chrome extension for background tabs and logged-in sites
OpenAI shipped a Chrome extension for Codex on macOS and Windows that can work across logged-in sites and multiple background tabs. It should speed up testing, data entry, and other web app tasks by letting Codex run more parallel browser work.

ElevenLabs cuts Flash TTS 55%, Scribe 45%, and Agents 20% with pay-as-you-go billing
ElevenLabs lowered self-serve pricing for ElevenAPI and ElevenAgents and added pay-as-you-go billing. The biggest listed drops are to $0.05 per 1,000 tokens for Flash TTS, $0.22 for Scribe v2 speech-to-text, and $0.08 per minute for agent calls.

Google updates Gemini Interactions API with steps schema and Api-Revision 2026-05-26
Google is replacing the Gemini Interactions API’s older outputs-and-roles structure with a steps schema for multi-step agent workflows. The change matters because SDK upgrades, migration work, and schema assumptions in existing tooling may break before the new interface reaches GA.


OpenAI adds GPT-Realtime-2, Translate, and Whisper to the Realtime API
OpenAI added GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper to the Realtime API. The update gives voice agents live reasoning, translation, and transcription, but it remains API-only rather than part of ChatGPT voice mode.

OpenAI launches Codex Chrome extension for background tabs and logged-in sites
OpenAI shipped a Chrome extension for Codex on macOS and Windows that can work across logged-in sites and multiple background tabs. It should speed up testing, data entry, and other web app tasks by letting Codex run more parallel browser work.

Mozilla reports Claude Mythos Preview fixed more Firefox bugs in April than the prior 15 months
Mozilla says Claude Mythos Preview helped it fix more Firefox security bugs in April than in the previous 15 months combined. Teams building large codebases should watch this as a strong production example of frontier models accelerating defensive vulnerability work.

Anthropic introduces Natural Language Autoencoders for Claude activations
Anthropic introduced Natural Language Autoencoders, a two-model method that translates Claude activations into text explanations and reconstructs them back. The system exposed hidden rhyme planning and evaluation awareness in Claude, but Anthropic says the explanations are useful rather than guaranteed faithful.
ElevenLabs cuts Flash TTS 55%, Scribe 45%, and Agents 20% with pay-as-you-go billing
Claude Code 2.1.133 removes per-action confirmations and adds worktree.baseRef
Ramp Sheets launches Fast Ask RL subagent with +4% exact-match gain over Opus at Haiku latency
Google updates Gemini Interactions API with steps schema and Api-Revision 2026-05-26

Google releases Gemini 3.1 Flash Lite GA with 1M context and $0.25 input pricing

OpenAI rolls out GPT-5.5-Cyber limited preview for critical-infrastructure defenders

Hermes Agent v0.13.0 adds /goal, Kanban orchestration, and custom LLM providers

Perplexity releases Personal Computer Mac app for local files and native app control
Top storiesthis week
Anthropic doubles Claude Code 5-hour limits after SpaceX Colossus 1 compute deal
Anthropic said a SpaceX compute deal will add 300+ MW and 220,000+ NVIDIA GPUs, and it doubled Claude Code 5-hour limits across paid plans. It also raised Opus API ceilings; users should still watch the unchanged weekly caps.


Anthropic launches Claude Managed Agents with Dreaming, Outcomes, and multiagent orchestration
Anthropic added Dreaming in research preview plus public-beta Outcomes, multiagent orchestration, and webhooks to Claude Managed Agents. Teams should try the new grader loops and shared-container sub-agents if they want more control over long-running agent work.

Zyphra releases ZAYA1-8B with <1B active params and Markovian RSA reasoning
Zyphra released ZAYA1-8B, an Apache-2.0 reasoning MoE with compressed-convolutional attention and bounded-context Markovian RSA test-time compute. The model targets math and coding workloads while keeping the active parameter count below 1B.

OpenAI opens Multipath Reliable Connection for 100,000-plus GPU training clusters
OpenAI and partners released Multipath Reliable Connection, an RDMA transport that spreads training traffic across multiple network paths and is already deployed on the company's largest clusters. The protocol targets congestion and failure recovery in giant GPU trainings, and teams building similar clusters should track the Open Compute Project release.

Navigator n1.5 claims web computer-use Pareto gains on accuracy, latency, and cost
Yutori rolled out Navigator n1.5 as a web computer-use model and said it improves the tradeoff between accuracy, latency, and cost for browser tasks. The launch matters because related environment-generation work is aimed at the long-horizon web workflows that make computer-use agents expensive and brittle.






