OpenAI Codex macOS app launches – 2× limits for 2 months

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

OpenAI shipped the Codex desktop app for macOS as a multi-agent “command center”; core primitives are parallel long-running threads, built-in git worktrees, Skills, and scheduled Automations with an approval queue; rollout is paired with a time-boxed promo that doubles paid-plan usage limits for 2 months and temporarily unlocks Codex for ChatGPT Free and Go. OpenAI’s own demos emphasize diff-first supervision, Plan mode via /plan, /personality toggles, and verification loops that run tests/QA as part of an agent run; Windows is teased as “coming soon,” but no date is given.

• Skills + interop: Codex starts reading Skills from .agents/skills with .codex/skills deprecation intent; Skills can auto-install/auto-auth MCP servers via project config.
• Automation plumbing: scheduled runs are tracked in a local SQLite DB (debuggable without a hosted dashboard).
• Friction signals: early reports show ~103% CPU in a renderer helper and ~95% in the core process; no account switcher yet (logout is in the macOS menu).

In parallel, Anthropic’s Claude Code “Swarms” UI leaks suggest 70k–118k-token orchestration runs; a separate allegation says Claude Code edited Ghostty config but isn’t reproduced—both underline that agent UX is moving into always-on, host-mutating territory while guardrails remain uneven.

🧠 Claude Code & Cowork: swarm features, mobile knobs, and integration footguns

Continues the Claude Code/Cowork storyline, but today’s novelty is around swarm-style parallelism previews and UX/platform quirks (mobile/web differences), plus reports of questionable tool behavior affecting developer environments.

Claude Code “Swarms” preview shows multi-team, hierarchical sub-agent orchestration

Claude Code Swarms (Anthropic): Early screenshots show a “Swarms on Claude Code” workflow that coordinates multiple sub-agents/“leads” with owners, dependencies, broadcasts, and a message system, with the author calling it an “absolute token destroyer” in Swarms preview.

• Orchestration shape: The UI/log implies a team-lead view tracking parallel execution across leads (backend/platform/research/etc.), with explicit “blocked by” relationships and status updates, as shown in Swarms preview.
• Cost/throughput signal: The same screenshot surfaces per-agent tool-use counts and token totals into the ~70k–118k range for a single run, reinforcing that swarms are a “burn tokens to buy wall-clock time” pattern, per Swarms preview.
• Feature parity rumors: Separate chatter describes the swarm concept as “runs multiple sub-agents in parallel” with “own context” and “background tasks,” as summarized in Swarm feature rumor.

Nothing here is an official ship note yet; it’s still “preview + rumor” evidence.

OpenAI Codex macOS app launches – 2× limits for 2 months

Executive Summary

Top links today

OpenAI Codex app ships on macOS: multi-agent worktrees + Skills + scheduled Automations (with promo limits)

Table of Contents

🧩 OpenAI Codex app ships on macOS: multi-agent worktrees + Skills + scheduled Automations (with promo limits)

Codex app is macOS-only for now; Electron build is meant to speed Windows delivery

Codex app “local environment actions” add one-click dev server/build triggers

Codex app Automations are tracked in a local SQLite DB (useful for debugging)

Codex app demo shows agent self-checking by launching apps and running tests

Codex app setting: prevent macOS sleep while a thread is running

Codex Skills can auto-install and auto-auth MCP servers via project config

OpenAI announces a Codex hackathon in SF with $90k credits and 1 year Pro prizes

Codex app account friction: log out is in the macOS menu bar, no account switcher yet

Codex app is being used for product ops work across Linear, Notion, and Slack

Codex app notifications for approvals and completions work while the app is backgrounded

🧠 Claude Code & Cowork: swarm features, mobile knobs, and integration footguns

Claude Code “Swarms” preview shows multi-team, hierarchical sub-agent orchestration

Claude Cowork adds plugins; Anthropic open-sources 11 starter plugins

Claude Code in Slack: @Claude mentions can spawn sessions and push fixes

Report: Claude Code wrote to Ghostty config and broke Shift+Enter in other TUIs

Sonnet 5 rumors shift toward “Swarms” features and a May 2025 cutoff claim

Claude mobile app shows tool-notification toggle and Plan/Code mode switch

Claudeception quirk: artifacts calling Anthropic API work on web, fail on iOS

Swarms skepticism: “most people shouldn’t be running swarms” and $1k/mo plan fears

Claude Max plan friction: token budget guilt and requests for rollover tokens

Model positioning chatter: Gemini 3 Pro ‘smarter overall’ vs Opus 4.5 for coding

🧰 Agent runners & always-on coordination: harnesses, scheduling, and team chat for agents

MoltSlack brings Slack-style channels to AI agents via OpenClaw

OpenClaw shows up as a top-tier workload on OpenRouter usage charts

Agent chat coordination is converging on heartbeat and polling cadences

Kimi K2.5 provider overload becomes an uptime constraint for agents

Agent Relay repo is positioned as the substrate for agent workspaces

Superset keeps surfacing as a “multi-session” agent runner UI

Conductor shows up in side-by-side comparisons with Superset

🧭 Agentic engineering practice: context discipline, self-improvement loops, and prompt ergonomics

A trace-driven loop for improving agents without fine-tuning

Coding agents push software work from writing code to supervising outcomes

Autocomplete fades as builders switch to chat-level micromanagement

Context engineering is becoming its own discipline for inference

Gemini persona prompts work better when the persona is just a job title

Multi-model harnesses are forcing teams to read model prompting guides

“Vibe coding” hits its one-year anniversary as a community workflow marker

🧱 Skills & extension ecosystem (beyond built-ins): marketplaces, moderation, and purpose-built skills

ClawHub adds skill reporting and upload gating to reduce skill-market spam risk

OpenSkills 2 pitches cross-agent skill distribution with auto-install and lockfile sync

Browser Use skill adds domain-specific cookie profiles for browser automation

TypeScript skill packs shift from “best practices” to task-specific outcomes

✅ Code quality automation: security agents, PR spam, and release autopilots

GeminiCLI security agent found a critical OpenClaw bug, wrote a PoC, and landed the fix

Maintainers report AI issue/PR spam pushing unwanted rewrites

Vercel Labs ships Autoship: an end-to-end changeset release autopilot

🔌 Interop & routing: OpenRouter Free Router, local-model glue, and assistant portability

OpenRouter adds openrouter/free: automatic routing across free models

Ollama promotes “ollama launch openclaw” for local-model OpenClaw setups

OpenClaw model-defaults gotcha when switching to openrouter/free

A practical default-model pick for OpenClaw: Step 3.5 Flash

Gemini adds “Import AI chats (BETA)” to reduce assistant switching costs

📦 Model releases & availability (open + proprietary): coding, OCR, and TTS upgrades

Step 3.5 Flash is now widely usable as a free, high-throughput hosted model

GLM-OCR launches with 0.9B params and a production-oriented deployment surface

Kimi K2.5 is getting “best open” claims across coding and reasoning evals

Eleven v3 becomes GA with fewer numeric and notation errors

Arcee’s Trinity Large preview surfaces with a 512k context pitch

Qwen3-Max-Thinking is now a Code Arena contender

⚙️ Serving & self-hosting: day-0 runtime support, local OCR, and throughput claims

vLLM posts a day-0 deployment recipe for Step 3.5 Flash (reasoning + tool parsers)

SGLang claims day-0 Step 3.5 Flash support with up to 350 tokens/sec

Ollama ships a one-command local install path for GLM-OCR

SGLang posts a GLM-OCR launch command with EAGLE speculative decoding flags

A hosted Step 3.5 Flash free listing shows ~171 tokens/sec throughput

🏗️ Compute & infra signals: GPU scarcity, mega-capex narratives, and SpaceX↔xAI consolidation

SpaceX acquires xAI and frames it as an AI compute play

OpenAI reiterates NVIDIA as core partner; cites compute fleet at ~1.9 GW in 2025

SpaceX FCC filing for up to 1M “orbital data center” satellites resurfaces

AI data center buildout gets framed as a $3T financing problem, not a GPU problem alone

GPU scarcity shows up again: Kimi K2.5 inference providers overloaded, asked to scale down

Huang clarifies OpenAI “$100B” invite: step-by-step across rounds, not one check

Oracle plans to raise $45B–$50B; frames data center buildout as multi-customer AI demand

📊 Evals & arenas: enterprise workflow failures, coding leaderboards, and “models posting live” experiments