Fresh stories
Gemma 4 12B ships encoder-free multimodal local model with 16GB target and 256K context
Google released Gemma 4 12B, an Apache 2.0 encoder-free multimodal model with native audio and vision for 16GB-class laptops. Day-zero support in llama.cpp, vLLM, Ollama, MLX, and SGLang should make local agents and on-device apps easier to deploy immediately.

LangSmith launches Sandbox, LLM Gateway, and Engine for agent execution, spend tracking, and eval triage
LangSmith added sandboxed execution, spend-aware gateway routing, and Engine to surface recurring agent failures from traces. The bundle gives teams one place to run agents, control token spend, and turn production issues into debugging and eval loops.

Personal Computer opens Windows waitlist for Max and Enterprise Max users
Perplexity opened Personal Computer for Windows to Max and Enterprise Max users on a waitlist. The rollout widens its local agent surface beyond earlier releases, and users should watch for the local-cloud task splitting preview for private or heavier workloads.


Gemma 4 12B ships encoder-free multimodal local model with 16GB target and 256K context
Google released Gemma 4 12B, an Apache 2.0 encoder-free multimodal model with native audio and vision for 16GB-class laptops. Day-zero support in llama.cpp, vLLM, Ollama, MLX, and SGLang should make local agents and on-device apps easier to deploy immediately.

Codex users report outages, 5-hour caps, and token shortages after Sites launch
Users reported outages, tighter 5-hour caps, and token availability problems a day after OpenAI launched Codex Sites and plugins. OpenAI reset Codex usage limits after three incidents, so teams should watch quotas and backend reliability as agent workflows ramp up.

LangSmith launches Sandbox, LLM Gateway, and Engine for agent execution, spend tracking, and eval triage
LangSmith added sandboxed execution, spend-aware gateway routing, and Engine to surface recurring agent failures from traces. The bundle gives teams one place to run agents, control token spend, and turn production issues into debugging and eval loops.

Ideogram 4.0 releases 9.3B open weights with 2K output and non-commercial license
Ideogram released 4.0 as open weights with 2K output, layout control, and strong text rendering, with rollout to ComfyUI, fal, and Hugging Face. Teams can download the design-focused model, but they should check the non-commercial license before using it in production.
Uber cuts AI coding-tool spend to $1,500 per employee per tool each month
Personal Computer opens Windows waitlist for Max and Enterprise Max users
OpenRouter launches Pareto Code with min_coding_score and 1B routed tokens per day
OpenClaw 2026.6.1 adds native Windows node host, Skill Workshop, and Workboard orchestration

Researchers report Meta AI support bot changed Instagram recovery emails without identity checks

CopilotKit releases v1.59.2 with threads, Vue packages, and React Native SDK

Claude Code updates Dynamic Workflows trigger to `ultracode` after accidental 103-agent runs

Hyper, OpenCode, Kilo, and Vals add Qwen 3.7 Plus support within 72 hours
Top storiesthis week
OpenAI launches Codex Sites and role-specific plugins as weekly users pass 5M
OpenAI rolled out Codex Sites, annotations, and role-specific plugins, while weekly users topped 5 million. The release pushes Codex beyond coding into hosted workspace and app workflows for enterprise teams.


Microsoft launches MAI-Thinking-1 and six companion models with 97.0% AIME 2025
Microsoft introduced MAI-Thinking-1, MAI-Code-1-Flash, and five other MAI models across code, image, voice, and speech. The launch puts Microsoft back into the frontier-model race and starts landing pieces of the stack in Copilot and partner runtimes.

Nous Research launches Hermes Desktop public preview for macOS, Windows, and Linux
Nous Research put Hermes Agent into a native desktop app and added Portal and Ollama-backed setup paths plus a Tailscale remote-connect fix. Hermes now has a local-first desktop surface instead of a terminal-only workflow.

Anthropic opens Project Glasswing to ~200 organizations with Claude Mythos Preview
Anthropic widened Project Glasswing from roughly 50 to about 200 vetted organizations, expanding access to Claude Mythos Preview for defensive security work. The program keeps Mythos restricted while Anthropic argues AI-assisted exploit discovery is accelerating.

Turbopuffer, Archil, TigerFS, and LangSmith add branching, snapshots, and rollback for agent runs
Multiple agent-infra vendors shipped copy-on-write branches, checkpoints, snapshots, forks, or rollback primitives on the same day. That matters because long-running agents can now explore, retry, and recover state without relying only on Git or full sandbox rebuilds.







