Genie 3 hits 24fps at 720p – ~60s sessions for $249.99 Ultra

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Google DeepMind pushed Project Genie (Genie 3) into public hands via Google Labs; access is gated to Google AI Ultra in the U.S. (18+); creation flow is now explicit: text+optional image prompt → Nano Banana Pro preview/edit checkpoint → real-time navigable world generation. Early clips converge on runtime constraints—live 24fps at 720p, with ~60-second session caps—telegraphing high per-run inference cost; rollout friction shows up immediately (broken/404 paths), while hands-ons surface failure modes like third-person “loses the character,” stuck states, terrain clipping, and prompt edits that delete prior elements. A Fortnite-looking output clip is circulating; it’s being read as leakage by some, but there’s no independent dataset accounting.

• OpenAI Codex: web search flips to default-on (cached); --yolo/web_search="live" forces live results; fixes the “cutoff docs” complaint but doesn’t solve clunky research handoffs.
• ARC Prize: ARC-AGI-3 Toolkit ships local ~2,000 FPS environments; three official public games posted with current AI scores <5%.
• Ollama exposure: 175,108 publicly reachable servers reported; ~48% advertise tool-calling, expanding risk beyond free inference.

Net: “interactive world models” are arriving as productized demos with hard caps and soft brittleness; evaluation still looks like clips and vibes until longer-horizon, reproducible harnesses land.

Genie 3 / Project Genie: real-time world model goes public (Ultra US)

DeepMind’s Genie 3 reaches users via Project Genie: real-time, prompt-driven interactive worlds. For builders, it’s a new runtime for simulation/gameplay/embodied training with huge inference-cost and control implications.

High-volume rollout and hands-on clips of Google DeepMind’s Project Genie (powered by Genie 3): promptable characters + environments with real-time navigation. This category is the day’s feature because it dominated cross-account discussion.

Jump to Genie 3 / Project Genie: real-time world model goes public (Ultra US) topics

🧞 Genie 3 / Project Genie: real-time world model goes public (Ultra US)

Project Genie rolls out to Google AI Ultra subscribers in the U.S.

Project Genie (Google DeepMind): Google is rolling out Project Genie—a Labs prototype powered by Genie 3—to Google AI Ultra subscribers in the U.S. (18+), focused on creating, exploring, and remixing interactive worlds, as stated in the rollout post and echoed in the availability note.

• Where it lives: Access is positioned as a Labs experiment with a direct entrypoint, as shown in the try link.
• What it’s for: DeepMind frames it explicitly as a research prototype to learn about immersive experiences and world-model interaction design, per the Google blog post.

Access appears intentionally narrow (Ultra + US), and some users are already hitting rough edges like broken/404 paths during early rollout, as shown in the 404 screenshot.

Genie 3 hits 24fps at 720p – ~60s sessions for $249.99 Ultra

Executive Summary

Top links today

Genie 3 / Project Genie: real-time world model goes public (Ultra US)

Table of Contents

🧞 Genie 3 / Project Genie: real-time world model goes public (Ultra US)

Project Genie rolls out to Google AI Ultra subscribers in the U.S.

Genie 3’s prompt-to-world pipeline couples Nano Banana Pro with real-time generation

Genie 3 early constraints: 24fps 720p real-time, with short generation windows

Genie 3 early failures include off-target outputs and losing the main character

“Can Genie 3 run Doom?” reframes world models as engine-less games

Genie controllability improves with game-like starting images and highlighted subjects

Project Genie access is Ultra-only in the U.S., and rollout friction shows up fast

Genie 3 shows strong tolerance for unusual characters and viewpoints

Genie can auto-follow tracks if the starting image contains a clear path

Genie 3 sometimes “respawns” after you fall off the map

🧰 OpenAI Codex: web search defaults + CLI knobs

Codex CLI and IDE extension turn on web search by default, with cache-first behavior

Builders keep framing GPT‑5.2 Codex as “close to Opus” but cheaper and faster

Codex CLI’s fastest way to de-stale: `--yolo` or `web_search = "live"`

codex-1up 0.3.20 adds web-search toggles and experimental settings knobs

Research-to-Codex handoffs are still a manual seam in some workflows

🧪 Claude Code & Cowork: release notes + real-world failure modes

Claude Cowork runs a 9-step workflow: scan Zoom files, upload to YouTube, trim silences

Claude recovers a missing source file by decompiling .pyc with decompyle3

Long-run Claude “myopia loops” in complex code: fixes regress other behaviors

Claude context hygiene pain: repeated reminders and piling details into CLAUDE.md

User sentiment: “agents are sucking away my own agency” and “soulless” coding

Claude Code CLI 2.1.25 fixes beta header validation for Bedrock/Vertex gateways

🧑‍💻 Coding agent products: OpenCode, Cline, Kilo, and agent-browser adoption spikes

Kimi Code switches to token billing and upgrades to Kimi K2.5

Cline hits 5M installs and announces a $1M open source grant

Kilo Code says Kimi K2.5 is now its most-used model via OpenRouter

agent-browser passes 100K downloads 18 days after launch

OpenCode offers Kimi 2.5 free for a limited time and ships bug fixes

Open-source agent maintainer churn as AI labs poach maintainers

🔌 Interop plumbing: Agent Trace + ACP ecosystem

Agent Trace proposes a vendor-neutral format to attribute AI agent work to code changes

ACP Registry push highlights protocol compliance as a compatibility gate

Agentation becomes a high-usage UI-to-agent handoff tool

🧭 Workflow patterns: context discipline, agent drift, and multi-agent overhead

A practical control pattern: let the agent run, but gate irreversible steps

Multi-agent systems can lose to single agents on non-parallel tasks

A common failure mode: the endless fix-regress loop when context saturates

A git-based way to back up and sync agent configs across tools

Engineers are naming the downside: agents can reduce your own agency

A context hygiene trick: route MCP calls through subagents

Skill invocation UX is getting messy as slash commands proliferate

🛠️ Dev tools shipping for agent-era engineering

GitHub Issues gets semantic search plus major latency improvements

Vercel adds agent-friendly markdown rendering for pages (Accept: text/markdown)

WarpGrep posts production evals: ~0.73 F1 with faster feedback streaming

Conductor speeds up long agent chats with incremental parsing

Ramp Rate launches spend-based vendor adoption metrics for 100+ software vendors

📊 Benchmarks & eval tooling: ARC-AGI-3 toolkit, METR, and usage analytics

ARC-AGI-3 Toolkit ships a local 2,000 FPS environment engine for agent evals

METR revises its time-horizon estimate to a 131-day post-2023 doubling time

ARC Prize publishes a beta “Standard Benchmarking Agent” harness for ARC-AGI-3

ARC-AGI-3 scoring adds “Relative Human Action Efficiency” normalization

Arena adds Auto-Mode and searchable chat history to reduce model-picking friction

OpenRouter shows GPT-5.2 Pro demand concentrates in science/finance/legal

Artificial Analysis crowdsources traits for benchmarking model communication style

📄 Docs-for-agents surfaces: markdown/web compression as a distribution lever

Vercel serves markdown-first pages for agents via Accept: text/markdown

skills.sh leans on CDN memoization to make large agent-skill directories fast

📦 Model releases (non-world-model): ASR, OCR, and smaller research drops

Qwen open-sources Qwen3-ASR and ForcedAligner for multilingual, messy-audio ASR

PaddleOCR-VL-1.5 claims SOTA doc parsing with 0.9B parameters

RLM-Qwen3-8B update: post-trained “native” recursion and model release

vLLM adds day‑0 serving for Qwen3-ASR, including audio deps and serve command

Black Forest Labs says FLUX.2 [flex] got up to 3× faster

🧠 Research notes: agent architectures, learning effects, and systems pragmatism

Anthropic RCT finds AI coding help lowered concept mastery by ~17% for juniors

Amazon’s Insight Agents shows a “small models first” routing stack for data agents

Ai2 open-sources Theorizer, a “theory builder” that emits laws with citations

Huawei survey maps RL techniques needed for long-horizon “deep research” agents

🛡️ Security & misuse: exposed local LLMs, agent sandboxing, and trust controls

Scan finds 175k publicly reachable Ollama servers, with tool-calling enabled on many

ChatGPT ads onboarding shows personalization sources and new ad controls

FlashLabs launches SuperAgent as a hosted alternative to running powerful localhost agents