Fresh stories
OpenAI releases GPT-5.4, GPT-5.5, and Codex on Amazon Bedrock
OpenAI made GPT-5.4, GPT-5.5, and Codex generally available through Amazon Bedrock. AWS shops can now use OpenAI models inside existing IAM, compliance, and procurement workflows instead of adopting a separate vendor stack.


MiniMax M3 adds OpenCode, Hermes Agent, Atomic Chat, and Vercel AI Gateway support
A day after MiniMax M3 launched, OpenCode, Hermes Agent, Flowith, Atomic Chat, Kilo Code, Cloudflare AI Gateway, and Vercel AI Gateway shipped support. That breadth shows M3 plugged into agent harnesses and routing layers immediately, not just its own API.
Lovable introduces TanStack Start output with SSR, server functions, and type safety
Lovable moved newly generated apps onto TanStack Start, adding route-level SSR, SSG, CSR, server functions, and stricter type-safe boundaries to its generated stack. The migration matters because framework primitives become guardrails for both generated-code quality and deploy-anywhere app behavior.


OpenAI releases GPT-5.4, GPT-5.5, and Codex on Amazon Bedrock
OpenAI made GPT-5.4, GPT-5.5, and Codex generally available through Amazon Bedrock. AWS shops can now use OpenAI models inside existing IAM, compliance, and procurement workflows instead of adopting a separate vendor stack.

NVIDIA launches Cosmos 3 open 16B and 64B omnimodels with datasets and SGLang support
NVIDIA released Cosmos 3 as an open omnimodel family with 16B and 64B variants, plus code, datasets, and a coalition around physical AI. The release matters because it ships with serving support and top open-weight image and video rankings, so teams can use it beyond a research teaser.

MiniMax M3 adds OpenCode, Hermes Agent, Atomic Chat, and Vercel AI Gateway support
A day after MiniMax M3 launched, OpenCode, Hermes Agent, Flowith, Atomic Chat, Kilo Code, Cloudflare AI Gateway, and Vercel AI Gateway shipped support. That breadth shows M3 plugged into agent harnesses and routing layers immediately, not just its own API.

Perplexity launches Search as Code in Agent API with WANDR 0.386 and Python search pipelines
Perplexity replaced one-shot search calls with Search as Code, a Python-based search runtime in its Agent API that is also now the default in Computer. The change matters because agents can batch, rank, filter, and aggregate search steps inside code, and Perplexity says the system scored 0.386 on WANDR versus 0.152 for the next system.
Microsoft and NVIDIA launch RTX Spark PCs with 128GB unified memory and 1 PFLOP FP4
MiniMax M3 users report slow runs and broken code after launch
Claude Code resets 5-hour and weekly limits after Opus 4.8 parallel-tool bug
Lovable introduces TanStack Start output with SSR, server functions, and type safety

Qwen releases Qwen 3.7 Plus with multimodal agent mode and browser demos

Cursor raises Teams usage limits and adds Premium seats with 5x usage

Codex releases Python SDK with thread control, session resume, and sandbox access

Browser Use launches browser infrastructure at $0.02/hour with subsecond cold starts

NVIDIA claims Nemotron 3 Ultra 550B runs 5x faster and 30% cheaper

Files SDK 1.7 adds resumable uploads, provider sync, and read-only clients
Top storiesthis week
Codex raises weekly and hourly limits to 100% after 5 million users
OpenAI restored Codex weekly and hourly quotas across paid ChatGPT plans after Tibo Sottiaux said the product hit 5 million users. Watch for long-running QA loops, migration PRs, and remote desktop sessions that can still burn through quotas fast.


Opus 4.8 users report token burn, failed tool calls, and DeepSWE gaps
Three days after Opus 4.8 launched, new tests and field reports added failed tool calls, Bash-specific breakdowns, and higher token burn to the complaint list. Users report materially worse cost and stability in long coding sessions, while DeepSWE and GBA Eval point in different directions.

MiniMax M3 launches with 1M context and 59.0 SWE-Bench Pro
MiniMax shipped M3 with a 1M-token context window, native multimodal input, and frontier coding claims across SWE-Bench Pro, Terminal Bench, and MCP Atlas. It also appeared on OpenRouter, Ollama Cloud, Venice, Hermes, Cline, Together, and Arena on day one.

Coding-agent builders add shared memory, provider routing, and app launchers
Independent developers shipped sidecars that let Claude Code, Cursor, and Codex share memory, hot-swap model providers, package local projects as apps, and automate browser QA. Try these reusable tools if you want memory, routing, QA automation, and app packaging outside editor-specific features.

Developers report Codex beats Claude Code on DeepSWE, token burn, and multi-hour /goal sessions
Independent users compared GPT-5.5/Codex with Opus 4.8/Claude Code using DeepSWE cost charts, GBA Eval runs, and long coding sessions. The split matters because engineers choosing a daily coding stack now have external quality-versus-cost evidence instead of only vendor launch claims.







