Fresh stories
Qwen3.6-27B releases with 77.2 SWE-Bench Verified and Apache 2.0
Alibaba released Qwen3.6-27B, a dense open model with multimodal input and thinking or non-thinking modes that beats Qwen3.5-397B-A17B across major coding benchmarks. Day-one support across vLLM, SGLang, Ollama, llama.cpp, GGUF, and MLX makes it ready for local and hosted coding agents.

OpenAI releases Privacy Filter with 128K context and Apache 2.0 PII redaction
OpenAI open-sourced Privacy Filter, a small open-weight model for detecting and masking personally identifiable information in long text locally. Teams can redact logs, prompts, and secrets before sending data into other AI systems or external services.

Claude Code 2.1.118 adds Vim visual mode, MCP hooks, and DISABLE_UPDATES
Anthropic shipped Claude Code 2.1.118 with Vim v and V selection, hooks that can invoke MCP tools, merged usage commands, and an environment variable that blocks updates. The CLI gains tighter terminal editing and better fleet control for teams that pin versions or manage plugin and MCP behavior.


Qwen3.6-27B releases with 77.2 SWE-Bench Verified and Apache 2.0
Alibaba released Qwen3.6-27B, a dense open model with multimodal input and thinking or non-thinking modes that beats Qwen3.5-397B-A17B across major coding benchmarks. Day-one support across vLLM, SGLang, Ollama, llama.cpp, GGUF, and MLX makes it ready for local and hosted coding agents.

Google launches Gemini Enterprise Agent Platform with Agent Studio and 200+ models
Google introduced Gemini Enterprise Agent Platform as the evolution of Vertex AI, with Agent Studio, shared agent management, and Model Garden access to 200-plus models. Enterprises now get one stack for building, governing, and deploying agents across Gemini and Workspace surfaces.

Google launches TPU 8t and TPU 8i with 3x pod compute and 1,152-chip inference pods
Google unveiled eighth-generation TPUs split into TPU 8t for training and TPU 8i for inference, saying 8t delivers nearly 3x per-pod compute over Ironwood while 8i links 1,152 chips in a pod. Google is tuning its hardware stack for larger training runs and lower-latency agent inference at cloud scale.

Kimi K2.6 adds free Hermes and Cline access plus Replicate, Perplexity, and Together support
A day after Kimi K2.6’s launch, providers and tools opened new access paths including temporary free use in Hermes and Cline plus availability on Replicate, Together, Perplexity, and Tinker. Engineers can test the open model across agent harnesses and hosted runtimes without standing up their own stack first.
OpenAI releases Privacy Filter with 128K context and Apache 2.0 PII redaction
OpenAI launches ChatGPT for Clinicians and HealthBench Professional in U.S. preview
GitHub Copilot adds bring-your-own keys across Free, Pro, Business, and Enterprise
Claude Code 2.1.118 adds Vim visual mode, MCP hooks, and DISABLE_UPDATES
Top storiesthis week
OpenAI launches GPT Image 2 with thinking, 2K outputs, and text rendering gains
OpenAI released GPT Image 2 in ChatGPT, Codex, and the API with thinking mode and 2K outputs. Early tests and Arena scores suggest it is usable for slides, UI mockups, and dense infographic layouts.


Google launches Deep Research Max with MCP, native charts, and 85.9% BrowseComp
Google added Deep Research and Deep Research Max to the Gemini API with collaborative planning, multimodal inputs, MCP support, and native charts. The agents push cited web-plus-private-data reports into developer workflows, and Max is tuned for slower overnight runs.

LightOn releases LateOn and DenseOn at 149M params with BEIR 57.22
LightOn open-sourced DenseOn and LateOn plus the training pipeline behind them, including 1.4 billion query-document pairs and decontaminated BEIR results. Teams can use the small open retrieval models and reproduced data mixtures instead of opaque closed-data baselines.

Anthropic tests removing Claude Code from Pro, then restores the pricing page
Anthropic briefly removed Claude Code from new Pro signups on its pricing page, then staff said it was a small test and the page was reverted. Watch subscription pages closely if you rely on entry-level access to Anthropic's coding agent.

OpenRouter adds Firecrawl web search with full-page markdown grounding
OpenRouter added Firecrawl as a search provider, letting models ground responses in scraped full web pages instead of snippet-only search. The launch folds crawling into the existing plugin settings flow and includes a capped free plan on the Firecrawl side.






