Fresh stories
OpenAI updates Codex with locked-Mac control and Appshots
OpenAI shipped a Codex update that lets the mobile app control a locked Mac, adds Appshots for screen context, and graduates /goal. It also adds browser annotation tools, team plugin sharing, and expanded analytics for business users.

Cognition adds Windows VMs to Devin for MSBuild, IIS, and .NET migrations
Cognition added native Windows VMs to Devin so it can build, run, and test Windows applications with MSBuild, IIS, PowerShell, and SQL Server. The rollout lets Devin handle enterprise codebases where Linux sandboxes are not enough.

Claude Code 2.1.147 adds Workflow tool and `/code-review` effort levels
Claude Code 2.1.147 added a deterministic Workflow tool, renamed `/simplify` to `/code-review`, and tightened sandboxing; 2.1.148 followed with a fix for the Bash 127 regression. The release matters because it changes multi-agent orchestration and review behavior while restoring automation reliability for existing Claude Code setups.


OpenAI updates Codex with locked-Mac control and Appshots
OpenAI shipped a Codex update that lets the mobile app control a locked Mac, adds Appshots for screen context, and graduates /goal. It also adds browser annotation tools, team plugin sharing, and expanded analytics for business users.

Qwen3.7 Max launches with 1M context, 35-hour autonomy, and 56.6 AA Index
Alibaba launched Qwen3.7 Max as its new flagship agent model with 1M context, stronger coding and reasoning scores, and cross-harness benchmarks. OpenRouter, Together, AI Gateway, and Kilo support it on day one, making it ready for immediate deployment.

LangChain opens Managed Deep Agents private beta with deepagents deploy and auth proxy
LangChain opened a private beta for Managed Deep Agents, a model-agnostic deployment layer built on deepagents with durable execution, sandboxes, and a context hub. The release turns deep-agent rollout into a single config-and-deploy flow and adds an auth proxy boundary for agent actions.

Cognition adds Windows VMs to Devin for MSBuild, IIS, and .NET migrations
Cognition added native Windows VMs to Devin so it can build, run, and test Windows applications with MSBuild, IIS, PowerShell, and SQL Server. The rollout lets Devin handle enterprise codebases where Linux sandboxes are not enough.
Turbopuffer reports $100M run-rate and a 95% Cursor code-search cost cut
Google AI Studio opens iOS pre-registration for a July 1 mobile app launch
Claude Code 2.1.147 adds Workflow tool and `/code-review` effort levels
Datasette Agent releases 0.1a3 with SQL chat, charts, and Fly sandbox plugins
Top storiesthis week
OpenAI reports internal reasoning model disproves Erdős's 1946 unit-distance conjecture
OpenAI said an internal general-purpose reasoning model disproved Erdős's 1946 unit-distance conjecture without a math-specific scaffold or Lean. If the linked proof and expert commentary hold up, it shifts frontier-model discussion toward original research, not just benchmark performance.


Gemini 3.5 Flash users report 3x price hikes and broken tool chains one day after launch
Users reported failed harness runs, benchmark misses, broken Calendar and video-editing flows, and later a tripled Antigravity rate limit after Gemini 3.5 Flash launched. Watch real agent workflows closely, because the speed gains are arriving with higher spend and unstable behavior.

Lovable adds is_stuck pipeline with Overflow retrieval to cut stuck rate 5%
Lovable described a production loop where an is_stuck classifier detects repeated failures, Overflow injects past solution pairs, and send_feedback escalates real tool failures. The system lowered stuck rate 5% and raised publish rate 2%, so teams can use the same signal to debug outages and agent frustration.

GitHub reports 3,800 internal repos breached via poisoned VS Code extension
Posts reported GitHub contained a breach after a poisoned VS Code extension compromised an employee device, with attacker claims around 3,800 internal repos matching the investigation. Related SHai-Hulud payload reports are pushing teams to audit `pull_request_target`, extension trust, and secret rotation.

Cohere releases Command A+ under Apache 2.0 with 25B active params and 2x H100 deployment
Cohere open-sourced Command A+, a 218B MoE multimodal model with 25B active parameters, 48-language support, and deployment starting at two H100s. Artificial Analysis put it at 37 on its Intelligence Index and 281 tok/s, and vLLM plus Transformers added support.










