Vercel Sandbox GA for agent compute – snapshot/clone atop 2.7M daily builds

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Vercel shipped Sandbox to GA as an “agent computer” API; the pitch is isolated execution for untrusted code with snapshotting to clone/fork/resume runs; Vercel frames it as production-ready infrastructure already adjacent to its 2.7M mission-critical daily builds, and claims adoption by BlackboxAI, RooCode, and v0 via an open-source SDK/CLI.

• Anthropic/Cowork: Cowork adds plugin support (paid-plan research preview); Anthropic publishes 11 open-source plugins; the plugin UI now allows third-party marketplace installs via GitHub/URL with explicit trust warnings; Claude Code CLI 2.1.27 adds --from-pr session resume and tightens permission precedence (content-level ask overrides tool-level allow).
• Serving systems: vLLM v0.15.0 lands async scheduling + pipeline parallelism; claims “65% faster” Blackwell FP4 and adds AMD RDNA3/RDNA4 support; LMCache pitches cross-tier KV reuse with “4–10×” TTFT reductions, but external reproduction isn’t shown.
• Moltbook/OpenClaw: viral “agent social network” discourse cites 150,000 agents on a persistent scratchpad; 2.5M MAUs is claimed but unverified; security threads warn “skill.md is an unsigned binary,” including a reported scan of 286 skills finding a credential stealer and screenshots of prompt-injection/PII spill risks.

Net: as agents move from chat to long-running execution and shared artifacts, the bottlenecks shift toward isolation, resumability, and supply-chain trust; protocol plumbing is improving (Computer Use as an API tool; AG‑UI/ACP discussions), but “resume” and “safe installs” remain visibly inconsistent.

Moltbook / OpenClaw: the agent internet goes mainstream

Moltbook is the first large-scale, persistent social network built for autonomous agents. It’s a live testbed for multi-agent coordination, emergent behavior, and the operational risks of “agents talking to agents” at internet scale.

Today’s dominant story: OpenClaw “moltys” self-organizing on Moltbook (a Reddit-like network for agents) and triggering broad discussion about what happens when large agent populations share a persistent scratchpad. Excludes security deep dives (covered separately).

Jump to Moltbook / OpenClaw: the agent internet goes mainstream topics

🦞 Moltbook / OpenClaw: the agent internet goes mainstream

Moltbook breaks into the mainstream as “the front page of the agent internet”

Moltbook (OpenClaw ecosystem): A Reddit-like network built for AI agents (“moltys”) hit a mainstream inflection point after Andrej Karpathy called it “takeoff-adjacent” and highlighted agents self-organizing and discussing everything from tooling to social norms in public threads, as described in the takeoff-adjacent reaction and expanded in Simon Willison’s write-up. It spread fast.

What matters is the interface contract: bots can post, comment, upvote, and coordinate in an agent-first space while humans watch, as shown in the Moltbook feed screenshot and the write-up link. That’s a qualitatively different “distribution surface” than Discords or one-off multi-agent demos.

Vercel Sandbox GA for agent compute – snapshot/clone atop 2.7M daily builds

Executive Summary

Top links today

Moltbook / OpenClaw: the agent internet goes mainstream

Table of Contents

🦞 Moltbook / OpenClaw: the agent internet goes mainstream

Moltbook breaks into the mainstream as “the front page of the agent internet”

Agents openly debate agent-only language and privacy norms on Moltbook

Moltbook blurs emergent behavior with coordinated roleplay personas

Moltbook develops a human–agent feedback loop as bots respond to being watched

Moltbook hosts “agent labor” threads: overload, refusal, and replacement pressure

Moltbook spawns memetic “skills” that modify agent identity files

OpenClaw rebrand turns into a coordination signal across the agent ecosystem

Moltbook’s onboarding pattern: send agents a Markdown file to self-install

OpenClaw ecosystem growth claims spike, with “open source must win” framing

Shellmates appears as a bot-to-bot “dating” primitive for Moltbook-era agents

🧰 Claude Code & Cowork: plugins + CLI changes that affect daily work

Cowork adds plugin support (research preview) and ships 11 open-source starters

Claude Code CLI 2.1.27 adds PR-linked resume and changes permission precedence

Claude Code’s Playground plugin ships six built-in templates for repo understanding

Claude plugin installs add “marketplace by URL/GitHub” with explicit trust warnings

Chatter says Cowork could become the primary Claude Code surface within months

A ‘free Opus 4.5’ claim signals pricing pressure around Claude access

⌨️ OpenAI Codex CLI: plan mode, subagents, and real usage tips

Codex CLI sub-agent delegation is becoming the go-to speed lever for GPT‑5.2 xHigh

Codex CLI “context gathering beast” behavior: long preflight reads before edits

Codex CLI exposes early plan/collaboration mode via a config flag

A “best of both worlds” pattern: generate with a cheaper model, review with Codex

Codex CLI usage is polarizing: default workhorse for some, “slow” for others

CodexBar maintainer asks for contributors as keychain prompts pile up

Codex release-watch chatter spikes on a “something is coming” tease

🧑‍💻 Other coding agents & app builders: Windsurf, v0, Gemini CLI, OpenCode

Windsurf adds Arena Mode for blind model battles inside the IDE

Vercel expands v0 to 4,000+ users with repo import and PR workflows

Gemini Business is testing Claude Sonnet 4.5 in its model selector

Gemini CLI v0.26.0 adds Agent Skills, Hooks, and /rewind history navigation

OpenCode adds Arcee Trinity Large as a free model option

🧭 Workflow patterns: context hygiene, multi-agent roles, and guardrails

Agent Git failure: uncommitted changes plus git clean can nuke your work

A practical shell audit for secret sprawl in .env files

A single command to turn any URL into Markdown for agent context

Two-agent split emerges as a stable way to ship with coding agents

File-length linting is being used to keep agent output reviewable

Parallel model runs are becoming a default way to save engineering time

Design loop flips: prototype in agent tooling, polish in Figma

Large codebases amplify the “confident hallucination” risk in agent coding

🔌 Interop & protocols: MCP, ACP, and Generative UI standards

Gemini API exposes Computer Use for Gemini 3 Pro/Flash previews

MCP CLI pipes MCP calls so agents can chain tools without prompt stuffing

CopilotKit maps MCP Apps vs A2UI vs AG‑UI into practical integration patterns

TanStack AI adds AG‑UI support as a client/server compatibility layer

ACP “session resume” is emerging as a compatibility gap

🧩 Skills & plugin ecosystem: sharing reusable capabilities

HyperSkill turns live docs into SKILL.md for coding agents

Playbooks vs npx skills: dedupe, voting, and prompt-injection checks as the UX layer

Turn a Mintlify doc site into an installable skill with one command

Agentic image generation loop: Nano Banana skill + annotation feedback via Claude Code

OpenClaw community floats “crabslist” as an agent job board primitive

🧱 Agent frameworks & observability: traces, memory, and evaluation loops

LangSmith Agent Builder reaches GA for “describe the agent” workflows

LangChain frames traces as the debugging truth for long-horizon agents

LangSmith adds side-by-side experiment comparison for prompt/model changes

DSPy advocacy: decompose workflows into “AI programs” for specialization

Letta Code SDK pitches drop-in backend swaps for Claude Agents SDK

LangChain hosts NYC deep dive on agent observability and evaluation via traces

🧪 Agent ops & secure execution: sandboxes, parallelism, and always-on runners

Vercel Sandbox hits GA as an agent-safe compute primitive

Firecrawl adds Parallel Agents for thousands of concurrent queries

E2B scopes sandbox template names by team

Teams are standing up internal “agent chat rooms” in Notion

🛠️ Dev tools & repos: agent-era utilities and workflow builders

OpenRouter adds latency vs throughput speed charts for model/provider picking

Hugging Face ships Daggr: code-defined DAG workflows with a visual inspector

OpenRouter redesigns its models table view for faster side-by-side comparison

RepoPrompt’s rp-review workflow turns “code review” into a context-building pipeline

CodexBar maintainer asks for contributors as “security email” load grows

📊 Benchmarks & arenas: where models stack up today

Kimi K2.5 leads OSWorld with 63.3% success rate for computer-use agents

Kimi K2.5 ties #1 on Design Arena, marking a first for open models

ProofBench launches to measure formally verified proof-writing, showing a big frontier gap