Fresh stories
Z.ai releases GLM-5.2 open weights with 1M context and 46.2% DeepSWE
Z.ai released GLM-5.2 MIT-licensed open weights with 1M context and broad runtime support. Vendor and arena results put it near frontier closed models on long-horizon coding.

Cursor reports a $60B all-stock deal with SpaceX
Cursor said it agreed to a $60B all-stock deal with SpaceX, with closing targeted for Q3 and Cursor remaining a wholly owned subsidiary. The deal ties a major coding-agent channel to SpaceX compute and gives Cursor a new strategic owner.

Flue releases 1.0 Beta with agents, workflows, and channel connectors
Flue 1.0 Beta reorganizes the framework around workflows, autonomous agents, and channel connectors while keeping model-agnostic deployment. The release gives TypeScript teams a more opinionated base for durable, long-running agents.


Commerce Department limits Claude Fable 5 exports worldwide, including foreign nationals in the U.S.
BIS and new reporting show Fable 5 restrictions now apply worldwide and can cover foreign nationals in the U.S. Teams should treat the pause as a broader access risk for allied markets and global deployments.

Z.ai releases GLM-5.2 open weights with 1M context and 46.2% DeepSWE
Z.ai released GLM-5.2 MIT-licensed open weights with 1M context and broad runtime support. Vendor and arena results put it near frontier closed models on long-horizon coding.

OpenAI opens Codex Computer Use and Chrome extension in the EEA, UK, and Switzerland
OpenAI expanded Codex in Europe with Computer Use, the Chrome extension, Memory, and Chronicle. The rollout broadens browser and desktop automation outside the U.S., though some memory features remain opt-in or preview-only.

Anthropic reports Claude Code task success stays within 7 points of software engineering across occupations
Anthropic published data from 400,000 Claude Code sessions, finding average task value rose 27% and verifiable success across occupations stayed within seven points of software engineering. The report gives teams a concrete baseline for where coding agents already generalize and where domain expertise still changes outcomes.
Cursor reports a $60B all-stock deal with SpaceX
Exa launches Agent API at less than half the cost of GPT-5.5 and Opus
ENPIRE launches 8-agent Codex robot fleet for physical autoresearch
Flue releases 1.0 Beta with agents, workflows, and channel connectors
Top storiesthis week
Moonshot releases Kimi K2.7 Code HighSpeed at 180 tok/s with 2x API pricing
Moonshot rolled out HighSpeed for Kimi K2.7 Code, claiming about 180 tok/s on coding tasks, up to 260 tok/s on shorter contexts, and roughly 6x speedups. Watch the tight capacity limits and mixed benchmark results, and budget for the 2x pricing if you want the faster mode.


Report: Trump talks end without lifting Claude Fable 5 jailbreak restrictions
Talks between Anthropic and the Trump administration ended without restoring Claude Fable 5 access, and reporting said consumer access may still hinge on fixing the cited jailbreak issue. Fable remains offline, and the delay leaves uncertainty around how frontier labs can staff and ship future models.

SGLang adds DFlash and Spec V2 with 4.3x Qwen3.5-397B-A17B throughput
LMSYS and Modal shipped DFlash plus Spec V2 in SGLang, claiming 4.3x baseline throughput and 1.5x native MTP on Qwen3.5-397B-A17B. It cuts latency and serving cost for very large open models.

Anthropic delays Claude Agent SDK credit shift for claude -p and third-party apps
Anthropic paused a same-day policy change that would have moved Claude Agent SDK, claude -p, and third-party SDK apps onto separate monthly credits. Existing subscription-backed workflows continue unchanged for now, but teams should watch for the redesigned billing plan.

TryCua launches Cua-Bench for KiCad; GPT-5.5 clears 6 of 25 tasks
TryCua and Snorkel opened Cua-Bench, a computer-use benchmark with 25 expert-authored KiCad tasks graded by exact netlist matches. The early results show frontier models still struggle with GUI execution, wiring completion, and self-checking, so treat benchmark wins as incomplete for real computer-use work.








