Fresh stories
Zyphra releases folded TSP with 173M tok/s on 1,024 MI300X GPUs
Zyphra published folded Tensor and Sequence Parallelism, claiming 173M tok/s versus 86M for matched TP+SP on 1,024 MI300X GPUs. The design keeps more replicas inside a node, reducing per-GPU memory pressure and cross-node communication.

Genspark launches sb-git with agent-native Git, diff, and 1 GB free storage
Genspark released sb-git, a Git server for agents with clone, push, diff, blame, rollback, and branch semantics plus 1 GB free storage. The service strips GitHub account setup out of agent workflows while preserving normal Git operations.

Claude Code 2.1.128 fixes EnterWorktree branching, OTEL leaks, and MCP reconnect floods
Claude Code 2.1.128 shipped 37 CLI changes, including local-HEAD worktree branching, OTEL env isolation for subprocesses, and summarized MCP reconnect announcements. The update reduces accidental tracing, preserves unpushed commits in worktree flows, and trims noisy tool re-announcements in long sessions.


Zyphra releases folded TSP with 173M tok/s on 1,024 MI300X GPUs
Zyphra published folded Tensor and Sequence Parallelism, claiming 173M tok/s versus 86M for matched TP+SP on 1,024 MI300X GPUs. The design keeps more replicas inside a node, reducing per-GPU memory pressure and cross-node communication.

OpenClaw 2026.5.3 adds /steer, /side, and paired-node file transfer
OpenClaw 2026.5.3 shipped paired-node file transfer, live /steer nudges, and /side side-questions, plus hardened plugin installs. The release changes how long-running agents are controlled and moves files with policy-gated 16 MB hops instead of stdout hacks.

deepsec launches CLI-first security harness with sandbox fanout for large repos
Vercel released deepsec, a CLI-first coding-security harness that runs agent reviews locally or fans out across sandbox workers for large repos. Early comparisons against Warden suggest a cheaper but less exhaustive scan profile, so teams should weigh coverage against cost.

Copilot users report $221 for 15 GPT-5.5 messages before June 1 billing switch
Ahead of GitHub Copilot's June 1 usage-based billing switch, users documented GPT-5.5 sessions hitting 60M tokens and $221 across 15 messages on the legacy per-message plan. The examples show why flat message buckets break once single requests can run for hours and consume extreme token counts.
Genspark launches sb-git with agent-native Git, diff, and 1 GB free storage
Cursor releases Team Kit with /verify-this, /loop-on-ci, and harness skills
Warp opens docs repo after 285 Oz agents migrated its CMS in 3 hours
Claude Code 2.1.128 fixes EnterWorktree branching, OTEL leaks, and MCP reconnect floods

Gemini API ships Webhooks and field-level Interactions errors

TinyFish opens Search and Fetch for free with MCP, CLI, and <0.5 s p50

Perplexity Computer integrates with Microsoft Teams via Marketplace launch

Goodfire reports eval awareness raises Fortress refusals 16% and cuts StereoSet stereotypes 20%

Zyphra Inference launches MI355X endpoints for DeepSeek V3.2, Kimi K2.6, and GLM 5.1
Top storiesthis week
Codex users report `/goal` sessions with 70-minute Stripe fixes and a 4,000-prompt cap
Users posted long-running Codex `/goal` sessions with auto-continuations, `pause`/`resume`, and file-backed goals. Watch the 4,000-prompt startup cap and early-stop drift if you plan to run longer agent loops.


Hermes Agent v0.12.0 adds Kanban boards for multi-agent workspaces
Nous Research added a Kanban workflow where specialized agents claim linked tasks, share files, and persist progress in SQLite-backed workspaces. The update moves Hermes from a single-agent loop to coordinated queues with human comments, heartbeats, and crash recovery.

Codex updates Auto-Review to default with ~200x fewer approvals
OpenAI said Auto-Review is now the default inside Codex after an internal rollout cut needed approvals by about 200x. The shift moves more coding-agent work into guarded review loops with policy and egress controls.

Codex community ships Security plugin, Plannotator, and `dcg` hooks as third-party tooling forms
Independent builders shipped a Codex security-review pack, planning and annotation integration, and `dcg` safety-hook support in the same window. The burst matters because review, guardrail, and workflow tooling is forming around Codex beyond OpenAI’s own releases.

ClawSweeper 0.2.0 adds guarded PR repair and automerge loops
ClawSweeper 0.2.0 turns OpenClaw repo maintenance into an issue-to-PR loop with build checks, repair passes, re-review, and conservative automerge. The release packages a Codex-driven maintenance bot that other repositories can fork instead of wiring their own triage stack.





