Fresh stories
DeepSeek removes visual-primitives repo after 90-KV vision details
DeepSeek briefly published a paper and threads on point-and-bbox reasoning, about 90 KV entries per 800² image, and RL-trained vision experts, then removed the repo and related mentions. The technique looked like a low-token path to computer use and multimodal reasoning in V4-Flash, but availability and reproducibility are now unclear.

Claude Security opens public beta with Opus 4.7 repo scans
Anthropic opened Claude Security to Claude Enterprise customers, letting teams scan repositories, validate findings, and review suggested patches inside Claude. The beta also adds scheduled scans, directory targeting, exports, and webhook alerts for recurring codebase reviews.

OpenAI adds Advanced Account Security with passkeys
OpenAI added an opt-in security mode for ChatGPT and Codex that disables password-based recovery, shortens sessions, and requires passkeys or physical keys. Higher-risk accounts get stronger phishing resistance and automatic exclusion from model training when the mode is enabled.


DeepSeek removes visual-primitives repo after 90-KV vision details
DeepSeek briefly published a paper and threads on point-and-bbox reasoning, about 90 KV entries per 800² image, and RL-trained vision experts, then removed the repo and related mentions. The technique looked like a low-token path to computer use and multimodal reasoning in V4-Flash, but availability and reproducibility are now unclear.

Codex adds `/goal`, role-based workflows, and 20% faster browser use
OpenAI expanded Codex with role-based work-flows, app connections, in-app previews, and the `/goal` command, while also improving browser use by about 20%. The update lets Codex keep working across docs, slides, spreadsheets, and web actions instead of staying in a single coding thread.

GPT-5.5 ranks at 71.4% on UK AISI cyber eval with 2/10 TLO completions
Multiple summaries of the UK AISI report say GPT-5.5 roughly matches Claude Mythos Preview on long-horizon cyber tasks, including 2 of 10 end-to-end TLO completions. That matters because the model is broadly usable today, shifting cyber-workflow choices toward availability and mitigations rather than gated access alone.

Claude Security opens public beta with Opus 4.7 repo scans
Anthropic opened Claude Security to Claude Enterprise customers, letting teams scan repositories, validate findings, and review suggested patches inside Claude. The beta also adds scheduled scans, directory targeting, exports, and webhook alerts for recurring codebase reviews.
Cursor SDK supports 11 integrations across Gmail, Chrome, CI, and multi-repo agents
White House blocks Mythos expansion from ~50 to ~120 organizations
Qwen-Scope releases SAE toolkit for Qwen3.5-27B steering
OpenAI adds Advanced Account Security with passkeys
Top storiesthis week
Cursor releases SDK for CI/CD, local or cloud agents, and starter apps
Cursor shipped a TypeScript SDK that exposes its runtime, harness, and models for CI/CD jobs, background automations, and embedded agents. The launch lets teams treat Cursor as programmable agent infrastructure, though it still depends on Cursor API access.


Opus 4.7 users report OpenClaw refusals, cache TTL spikes, and billing lockouts after launch
A day after Opus 4.7 launched, users reported OpenClaw-linked refusals, cache TTL cost spikes, and billing failures in Claude Code. Anthropic appears to have eased some limits, but behavior and spend still vary sharply across agent-heavy sessions.

Electric Agents introduces sync-based multi-agent platform with shared sessions and forking
ElectricSQL launched Electric Agents, treating agents as long-lived data entities that sync across shared coding sessions, swarms, and branches. The release matters for teams building collaborative agent systems that need durable state and coordination primitives, not just one-shot task runners.

Stripe Projects adds OpenRouter, Daytona, Vercel, and Render provisioning commands
Stripe Projects added agent-friendly provisioning commands for OpenRouter, Daytona, Vercel, Render, and related tools. That lets agents buy model access, sandboxes, and hosting from the terminal instead of dashboard-driven setup.

Mistral releases Medium 3.5 with 128B weights, 256K context, and Work Mode
Mistral shipped Medium 3.5 as a 128B dense model with 256K context, configurable reasoning, remote agents in Vibe, and Work Mode in Le Chat. The release broadens Mistral’s agent stack, though early comparisons question its price-performance against newer open rivals.










