Fresh stories
OpenAI Codex adds Windows computer use and ChatGPT mobile remote control
OpenAI added computer use to Codex on Windows and lets ChatGPT mobile steer tasks running on Windows PCs. The update extends Codex to existing Windows dev machines and adds remote review and debugging from mobile.

Step 3.7 Flash launches with day-one support in Kilo, Modal, SGLang, Hermes, and DesignArena
Step 3.7 Flash landed immediately across Kilo, Modal, SGLang, Hermes-linked tooling, and DesignArena as the model’s 198B MoE, 256K-context release spread through the stack. The breadth of day-one support gives engineers multiple ways to serve, benchmark, and wire the new open-weight multimodal model into agents.

llama.cpp launches official site with one-line installer and unified `llama` CLI
llama.cpp now has an official website and a single-line installer that provides one `llama` entrypoint for running, serving, and agent integrations. The packaging change simplifies local setup while reusing GGUF models already on disk.


OpenAI Codex adds Windows computer use and ChatGPT mobile remote control
OpenAI added computer use to Codex on Windows and lets ChatGPT mobile steer tasks running on Windows PCs. The update extends Codex to existing Windows dev machines and adds remote review and debugging from mobile.

Opus 4.8 users report false greens, token burn, and mixed benchmark gains
A day after launch, users and third-party evals reported false verified claims, million-token loops, and mixed task results despite strong headline wins. Watch task-by-task results and token cost closely because reliability varied sharply by effort setting and harness.

Conductor, CC Mirror, and Codex add Claude-style Dynamic Workflows
A day after Claude Code introduced Dynamic Workflows, builders shipped ports and clones for Codex, Conductor, and GLM-backed CC Mirror. The rapid ports turn the feature into a reusable orchestration pattern rather than an Anthropic-only runtime.

Step 3.7 Flash launches with day-one support in Kilo, Modal, SGLang, Hermes, and DesignArena
Step 3.7 Flash landed immediately across Kilo, Modal, SGLang, Hermes-linked tooling, and DesignArena as the model’s 198B MoE, 256K-context release spread through the stack. The breadth of day-one support gives engineers multiple ways to serve, benchmark, and wire the new open-weight multimodal model into agents.
Claude Opus 4.8 adds mid-conversation system messages without breaking prompt cache
Gemini API adds Managed Agents with sandboxed Linux, web access, and file I/O
Hermes Agent adds Tool Search to load only needed MCP and plugin tools
llama.cpp launches official site with one-line installer and unified `llama` CLI

Cursor adds auto-review mode with classifier subagent and fewer approval prompts

Claude Code 2.1.158 adds auto mode for Bedrock, Vertex, and Foundry

Codex iOS adds /side conversations, diff summaries, and Spotlight shortcuts

Vercel Sandbox adds Docker support with persistent images and isolated container runs
Top storiesthis week
Claude Opus 4.8 ships with 69.2% SWE-Bench Pro and 2.5x Fast mode
Anthropic released Claude Opus 4.8 across Claude, the API, and major clouds with higher coding scores and a cheaper 2.5x-speed Fast mode. Use it for coding workloads that want better benchmark performance without a price increase over 4.7.


Claude Code 2.1.154 adds Dynamic Workflows for hundreds of parallel subagents
Claude Code 2.1.154 added Dynamic Workflows, a research-preview mode that writes orchestration scripts and runs hundreds of subagents in one session. Anthropic also shipped 2.1.156 to fix Opus 4.8 thinking-block API errors, so teams should watch for workflow and API stability.

Agent tools add Claude Opus 4.8 to Cursor, Warp, OpenRouter, and Perplexity on day one
Independent IDEs, gateways, and agent runtimes rolled out Claude Opus 4.8 within hours of launch, including Cursor, Warp, OpenRouter, and Perplexity. That matters because teams can benchmark or swap the model into existing workflows without waiting for connector lag.

OpenAI updates GPT-5.5 Instant with writing blocks and less bullet-heavy replies
OpenAI rolled a new GPT-5.5 Instant into ChatGPT and the API with less bullet-heavy output, better pacing, and higher multilingual quality. The update also replaces Canvas in GPT-5.5 Instant and Thinking with in-chat writing and code blocks, so users should migrate workflows while legacy models still keep Canvas temporarily.

Google makes Nano Banana 2 and Nano Banana Pro GA with video input and $0.045/$0.134 pricing
Google moved Nano Banana 2 and Nano Banana Pro to GA in AI Studio and the Gemini Enterprise Agent Platform. Nano Banana 2 also takes video as input, giving image pipelines published per-image pricing and a production API.







