Fresh stories
Gemma 4 adds MTP drafters for up to 3x faster decoding
Google released Multi-Token Prediction drafters for Gemma 4 and says decoding can run up to 3x faster without output-quality loss. vLLM and SGLang support shipped day one, so local and server deployments can try the speedup immediately.

Anthropic launches 10 finance agent templates for Claude Code
Anthropic released ready-to-run finance templates for pitchbooks, valuation reviews, KYC, and month-end close across Cowork, Claude Code, and Managed Agents. Use them to start with bundled connectors, skills, and subagents instead of building each workflow from scratch.

Gemini API adds multimodal File Search with page citations
Google expanded Gemini API File Search to index text and images together, add custom metadata filtering, and return page-level citations. RAG builders can use it for tighter retrieval control and more auditable answers.


Gemma 4 adds MTP drafters for up to 3x faster decoding
Google released Multi-Token Prediction drafters for Gemma 4 and says decoding can run up to 3x faster without output-quality loss. vLLM and SGLang support shipped day one, so local and server deployments can try the speedup immediately.

ChatGPT ships GPT-5.5 Instant by default with Memory Sources
OpenAI is rolling GPT-5.5 Instant into ChatGPT as the default model and exposing it as gpt-5.5-chat-latest, alongside Memory Sources for personalized replies. The model also claims 52.5% fewer high-stakes hallucinations, so watch for behavior changes in production prompts.

OpenClaw ships 2026.5.4 with faster Gateway startup
OpenClaw 2026.5.4 adds cleaner plugin installs, faster Gateway startup paths, sharper doctor repairs, and fixes for Windows and Discord. The release is aimed at reliability, so update if you rely on the Gateway or plugin tooling.

Anthropic launches 10 finance agent templates for Claude Code
Anthropic released ready-to-run finance templates for pitchbooks, valuation reviews, KYC, and month-end close across Cowork, Claude Code, and Managed Agents. Use them to start with bundled connectors, skills, and subagents instead of building each workflow from scratch.
ProgramBench reports 0% on ffmpeg, SQLite, and ripgrep rebuilds without internet
Cursor adds always-on CI agents that open fix PRs
Claude Code 2.1.129 adds --plugin-url and fixes cache TTL to 5 minutes
Gemini API adds multimodal File Search with page citations

Perplexity Computer launches Professional Finance with 35 workflows and licensed data

Realtime TTS-2 releases with sub-200 ms TTFA and 100+ languages

AI Studio adds edit mode and Nano Banana image assets

OpenAI Agents SDK adds TypeScript support and sandbox agents

Raindrop launches Triage for Slack digests and trace search
Top storiesthis week
OpenClaw 2026.5.3 adds /steer, /side, and paired-node file transfer
OpenClaw 2026.5.3 shipped paired-node file transfer, live /steer nudges, and /side side-questions, plus hardened plugin installs. The release changes how long-running agents are controlled and moves files with policy-gated 16 MB hops instead of stdout hacks.


Zyphra releases folded TSP with 173M tok/s on 1,024 MI300X GPUs
Zyphra published folded Tensor and Sequence Parallelism, claiming 173M tok/s versus 86M for matched TP+SP on 1,024 MI300X GPUs. The design keeps more replicas inside a node, reducing per-GPU memory pressure and cross-node communication.

deepsec launches CLI-first security harness with sandbox fanout for large repos
Vercel released deepsec, a CLI-first coding-security harness that runs agent reviews locally or fans out across sandbox workers for large repos. Early comparisons against Warden suggest a cheaper but less exhaustive scan profile, so teams should weigh coverage against cost.

Zyphra Inference launches MI355X endpoints for DeepSeek V3.2, Kimi K2.6, and GLM 5.1
Zyphra launched serverless inference on AMD MI355X for DeepSeek V3.2, Kimi K2.6, and GLM 5.1, aimed at long-horizon agent workloads. The service leans on high-HBM nodes to keep more long-context sessions resident and reduce queueing.

Cursor releases Team Kit with /verify-this, /loop-on-ci, and harness skills
Cursor's Team Kit packages internal skills like /verify-this, CLI and UI automation harnesses, PR cleanup, and /loop-on-ci, installable with /add-plugin cursor-team-kit. It turns several internal review and validation habits into reusable commands for agent-driven coding workflows.






