Fresh stories
DeepSeek cuts V4 Pro pricing 75% to $0.435 input and $0.87 output
DeepSeek made the temporary 75% V4 Pro discount permanent, cutting first-party pricing to $0.435 per million input tokens and $0.87 output. Artificial Analysis now places it on the cost-performance frontier, but practitioners still question per-task efficiency on harder coding work.

Hermes Agent adds Bitwarden Secrets Manager for key rotation and team access
Hermes Agent now supports Bitwarden Secrets Manager, giving users a managed way to store, rotate, and share agent credentials. That matters because secret handling becomes a real operational problem once agents move beyond solo local setups.

Antigravity updates Gemini 3.5 Flash with permanent 3x quotas and 2x context
A day after Antigravity raised weekly Gemini quotas, the team said the 3x increase is permanent and doubled Gemini 3.5 Flash max context in AGY. The same update batch also clarified the IDE split and shipped Windows fixes, changing day-to-day limits and workflow behavior for developers.


DeepSeek cuts V4 Pro pricing 75% to $0.435 input and $0.87 output
DeepSeek made the temporary 75% V4 Pro discount permanent, cutting first-party pricing to $0.435 per million input tokens and $0.87 output. Artificial Analysis now places it on the cost-performance frontier, but practitioners still question per-task efficiency on harder coding work.

Anthropic reports 10,000 high-severity flaws in Project Glasswing
Anthropic said Project Glasswing has found more than 10,000 high- or critical-severity issues across open-source software since launch. Mythos-class models could reach general release after stronger safeguards, so teams should watch patching and disclosure timelines.

Qwen 3.7 Max users report 5-minute cache creation, $43 vibe-coding bills, and uneven task quality
A day after Qwen 3.7 Max launched, users posted both standout benchmark wins and rough real-work reports, including 5-minute cache creation and $43 in 15 minutes of vibe coding. That matters because teams evaluating coding agents are seeing a gap between leaderboard strength and per-task reliability.

Codex users report better compaction and Colab control after v0.133.0
Developers say Codex v0.133.0 improved compaction, remote-control workflows, and Chrome-driven Colab runs after `/goal` became default. The same update window also brought easier skill discovery and new diff options, though some users saw approval-pause regressions in full-access mode.
Hermes Agent adds Bitwarden Secrets Manager for key rotation and team access
Cursor releases Composer 2.5 SDK for Python and TypeScript
Letta Code adds embedded local server with Ollama and LM Studio support
Antigravity updates Gemini 3.5 Flash with permanent 3x quotas and 2x context

Perplexity launches Bumblebee scanner for macOS and Linux developer machines

Warp adds BYOK to Warp Agent with OpenAI-compatible endpoints

Claude Code releases 2.1.149 with `/usage` breakdown and PowerShell cwd fix

MCP releases 2026-07-28 candidate with stateless requests and no session IDs
Top storiesthis week
OpenAI updates Codex with locked-Mac control and Appshots
OpenAI shipped a Codex update that lets the mobile app control a locked Mac, adds Appshots for screen context, and graduates /goal. It also adds browser annotation tools, team plugin sharing, and expanded analytics for business users.


Qwen3.7 Max launches with 1M context, 35-hour autonomy, and 56.6 AA Index
Alibaba launched Qwen3.7 Max as its new flagship agent model with 1M context, stronger coding and reasoning scores, and cross-harness benchmarks. OpenRouter, Together, AI Gateway, and Kilo support it on day one, making it ready for immediate deployment.

LangChain opens Managed Deep Agents private beta with deepagents deploy and auth proxy
LangChain opened a private beta for Managed Deep Agents, a model-agnostic deployment layer built on deepagents with durable execution, sandboxes, and a context hub. The release turns deep-agent rollout into a single config-and-deploy flow and adds an auth proxy boundary for agent actions.

Cognition adds Windows VMs to Devin for MSBuild, IIS, and .NET migrations
Cognition added native Windows VMs to Devin so it can build, run, and test Windows applications with MSBuild, IIS, PowerShell, and SQL Server. The rollout lets Devin handle enterprise codebases where Linux sandboxes are not enough.

Google AI Studio opens iOS pre-registration for a July 1 mobile app launch
Google opened iOS pre-registration for the AI Studio mobile app and confirmed native iOS and Android clients for AI Studio workflows. The rollout matters because it extends Google’s developer-facing Gemini environment beyond the browser into a mobile form factor for prototyping and testing.





