Fresh stories

Opus 4.7 users report 1.46x tokenization and faster limit burn
Four days after the Opus 4.7 launch, independent tests measured about 1.35-1.46x more text tokens than 4.6 while users kept reporting faster limit burn and weaker coding. That can change effective cost and session economics in Claude Code even if list prices stay flat.
ChatGPT Pro users report GPT-5.4 Pro with faster SVG and UI generation
Multiple Pro users said GPT-5.4 Pro started producing richer front-end and SVG outputs with much faster runtimes, despite no formal OpenAI announcement. The reports matter because they affect whether long visual and code-generation tasks are practical inside ChatGPT.

CopilotKit releases A2UI Composer with AG-UI theater and JSON export
CopilotKit shipped an interactive A2UI Composer for building widgets, inspecting AG-UI event streams, and generating reusable A2UI JSON. Teams can now prototype and copy agent-facing UI components without hand-authoring every widget schema.


Opus 4.7 users report 1.46x tokenization and faster limit burn
Four days after the Opus 4.7 launch, independent tests measured about 1.35-1.46x more text tokens than 4.6 while users kept reporting faster limit burn and weaker coding. That can change effective cost and session economics in Claude Code even if list prices stay flat.

Vercel reports OAuth-linked breach via compromised AI tool
Vercel disclosed unauthorized access to internal systems affecting a limited subset of customers and said a compromised Google Workspace OAuth app at a third-party AI tool was the entry point. Some non-sensitive environment variables may have been exposed, so teams should review SaaS integrations and secret handling now.

ChatGPT Pro users report GPT-5.4 Pro with faster SVG and UI generation
Multiple Pro users said GPT-5.4 Pro started producing richer front-end and SVG outputs with much faster runtimes, despite no formal OpenAI announcement. The reports matter because they affect whether long visual and code-generation tasks are practical inside ChatGPT.

Codex users report subagent, MCP, and canary deploy workflows
Practitioners shared repeatable Codex workflows for long-lived threads, background subagents, computer-use access through MCP, and canary rollouts. Codex is being used less as a one-shot assistant and more as a persistent automation harness.
Gemma 4 ecosystem ships 60+ on-device demos and local agent benchmarks
CopilotKit releases A2UI Composer with AG-UI theater and JSON export
Qwen3.6-35B-A3B benchmarks 40 tok/s on M3 Ultra with Strix Halo follow-ups
Claude Design users report HTML and JSX exports for Claude Code handoffs
Top storiesthis week
Opus 4.7 users report 1.47x token overhead and web-search refusals two days after launch
Users and analysts say Opus 4.7 is using more tokens, refusing web search, and missing orchestration steps in Claude Code-style workflows. Watch token costs and regression reports closely if you rely on xhigh defaults or tokenizer-sensitive prompts.


Claude Code users report unexplained bans and no appeal path
A 60-seat org and multiple T3 Code users said Anthropic blocked Claude access without warning, and one restored account still lacks a public explanation. Teams that depend on Claude Code should plan for sudden access disruption and keep a fallback workflow ready.

Grok launches STT and TTS APIs with WebSocket streaming and 25-plus languages
Grok added standalone speech-to-text and text-to-speech APIs with WebSocket streaming, word timestamps, diarization, and support for 25-plus languages. Developers building realtime audio apps can now call Grok Voice infrastructure directly instead of wiring it through the app UI.

Moonshot claims 1.54x throughput and 64% lower P90 TTFT with cross-datacenter prefill
Moonshot says its Prefill-as-a-Service setup makes prefill/decode disaggregation practical across datacenters and mixed hardware by shrinking KV cache with Kimi Linear. The paper reports 1.54x throughput and a 64% drop in P90 time-to-first-token, so benchmark the approach before planning production adoption.

OpenCode 1.4.11 adds workspace support for git worktrees and remote environments
OpenCode 1.4.11 beta lets sessions run inside git worktrees or remote environments, with a remote server that keeps sessions alive and resyncs locally after reconnects. Use it if you run multi-session agent work across machines or plugin-defined runtimes.







