Project Genie (Genie 3) hits Google Labs – ~1‑minute playable worlds

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Google DeepMind rolled out Project Genie on Google Labs: text prompt or photo → approve/modify a Nano Banana Pro preview → Genie 3 generates an explorable real‑time world; access is framed as US‑first and tied to an Ultra tier; early demos show a short session window (often described as ~1 minute), making it closer to an interactive shot generator than a persistent sandbox. Hands‑on probes focus on “worldness” basics: collision behavior (cars/doors block movement); brief occlusion memory (turn away and return; entities often persist, but “not perfect”); a directing surface with WASD movement plus arrow‑key camera framing, with an unlisted Shift run control spotted.

• xAI/fal: Grok Imagine Video edit endpoint surfaces pay‑per‑use pricing (example $0.36 for a 6s edit); positioning is surgical continuity edits, not new footage.
• Angles v2: pushes camera-as-parameter after image gen with 360° orbits and lighting adaptation; continuity via coverage rather than re‑prompting.
• Open world-model baselines: LingBot‑World is marketed on offscreen memory/cause‑effect, but no public eval artifacts in the cited posts.

Failure modes are already getting formalized (mirrors; infinite backward walking); capacity warnings/rate limits hint the bottleneck may be availability as much as coherence.

While you're reading this, something just shipped.

New models, tools, and workflows drop daily. The creators who win are the ones who know first.

Last week: 47 releases tracked · 12 breaking changes flagged · 3 pricing drops caught

World models go hands-on: Project Genie (Genie 3) and the push toward interactive worlds

Project Genie turns prompts into explorable worlds with real-time control—an early glimpse of “directable simulation” becoming a mainstream creative canvas.

The dominant story is Google DeepMind’s Project Genie (powered by Genie 3): text/photo → a controllable, explorable world with early signals for physics, memory, and directing controls. Also includes adjacent conversation about “world models” as a 2026 creative medium (excludes non-world-model video tools).

Jump to World models go hands-on: Project Genie (Genie 3) and the push toward interactive worlds topics

🌍 World models go hands-on: Project Genie (Genie 3) and the push toward interactive worlds

Project Genie launches on Google Labs for AI Ultra users in the US

Project Genie (Google DeepMind): Google DeepMind is rolling out Project Genie, a Google Labs experiment that turns a text prompt (or photo) into an explorable real-time world powered by Genie 3, with an image “preview” step generated by Nano Banana Pro as described in the Launch thread; access is framed as US-first via Google Labs according to the Try it link, with some posts calling out “only in the US” right now as noted in the US-only note. This matters for filmmakers and game artists because it’s not just image-to-video—it adds input control (movement/camera) and short-lived world coherence.

• Pipeline: “Describe a world or upload a photo → approve/modify the Nano Banana Pro preview → Genie 3 generates the world in real time,” as explained in the Launch thread and reiterated in the Pipeline recap.
• Distribution surface: Signup/entry point is centralized on the Labs site, linked in the Project page.

The visible constraint in demos is a short session window (often described as about a minute), so it currently behaves like an interactive shot generator rather than a persistent sandbox.

Project Genie (Genie 3) hits Google Labs – ~1‑minute playable worlds

Executive Summary

While you're reading this, something just shipped.

Top links today

World models go hands-on: Project Genie (Genie 3) and the push toward interactive worlds

Table of Contents

🌍 World models go hands-on: Project Genie (Genie 3) and the push toward interactive worlds

Project Genie launches on Google Labs for AI Ultra users in the US

Genie 3 passes basic “no clipping” collision probes

Genie 3 shows short-term scene memory when you turn away

Project Genie adds a directing layer: WASD navigation plus camera control

Genie 3 is being used for bodycam-style pre-vis tests

Fire and disaster scenes are being used to probe world dynamics

Mirror hallways are becoming a stress test for Genie 3 stability

Project Genie driving demos collide with early capacity limits

“Advancing Open-source World Models” circulates as context for the Genie moment

LingBot-World is marketed as “doesn’t forget offscreen”

🎬 AI video tools beyond world models: Grok Imagine edits, Runway Gen‑4.5 I2V, Kling FPV, Luma 1080p

Grok Imagine multi-shot prompting uses [cut] as a shot delimiter

Kling 2.6 FPV prompts are being shared as reusable camera recipes

Luma Ray 3.14 Image-to-Video now outputs native 1080p

10-second Grok Imagine clips are being used for dialogue beats

Grok Imagine continuity via a single image anchor across multiple gens

Runway Gen-4.5 shows a photo-to-motion ‘Day at the Museum’ flow

A Runway-generated internal-body camera move gets treated as an editable beat

Krea Realtime Edit beta shows fast restyling with texture prompts

Midjourney Video is being used for abstract motion stress tests

One-shot script-to-animation promo clips are being pitched as ad production

🧩 Copy/paste prompts & style references (Nano Banana, Midjourney srefs, spec-sheet JSON, cyanotype, FPV recipes)

Nano Banana Pro “Submerged Product Effect” prompt enforces true submersion (no platforms)

“Accuracy-first” comparison-chart prompt bans guessing and forces official-source specs

Nano Banana Pro cyanotype blueprint prompt locks Prussian-blue realism (no warm tones)

Midjourney --sref 1061349747 targets cinematic realistic anime storyboards

Midjourney --sref 2487252379 for character design sheets (anime-influenced Western)

Nano Banana Pro prompt yields glossy “flag product” 3D ice cream renders

Midjourney --sref 1470170 pushes “flawed” crayon texture over 8K polish

Midjourney --sref 20240916 is pitched as a fast “cinematic cyberpunk” shortcut

Midjourney --sref 3091309576 aims for Wong Kar-wai-style motion blur

Firefly prompt recipe: cinematic first-person still with Portra and camera settings

🖼️ Image creation & editing: Nano Banana Pro inpainting, FLUX typography speed-up, Firefly/Photoshop maker loops

Nano Banana Pro in Freepik advertises unlimited, selection-only inpainting

FLUX.2 [flex] is being used for rapid typography poster iteration

Firefly Boards is being used as an image-first storyboard workspace

Krea Realtime Edit is being used for fast “style auditioning” passes

Angles v2 turns a single image into a 360° camera exploration asset

Seedream 4.5 artifact tests show how “low-res snapshot” prompts can break realism

Adobe Firefly “Tap the Post” assets evolve into glitch-type template packs

🧍 Consistency & control systems: camera-as-parameter, reference-driven video continuity, multi-shot coherence

Angles v2 makes camera placement an editable parameter after image generation

Grok Imagine video editing model: targeted edits while keeping the rest consistent

Grok Imagine multi-shot control via [cut] delimiter inside one prompt

Grok Imagine: reuse one image across multiple generations to keep a consistent thread

“A trailer isn’t an episode”: creators call out episode-level continuity as the real bar

🛠️ Systems that ship: automated content factories, context that compounds, and agent-driven ops

Automated hyper-local news podcast pipeline claims $100/mo all-in cost

Clawdbot-style ops: route customer requests from anywhere into Discord threads

Floatprompt workflow: SQLite “brain” + git-aware session boot for compounding context

Linah AI workflow claims 400+ UGC ads per brand using Veo 3

Prompt-injection hardening claim for message/email-driven bots

SciSpace Agent adds library search, Zotero/Drive sync, report sub-agent, and save-to-notebook

Selfie-to-closet automation: clothing item extraction and tracking pitched as app idea

“In 2026, build systems” framing: assistant vs agent vs managed SuperAgent

🧊 3D & animation pipelines: text→3D assets, character conversion, and game-ready workflows

Meshy shows a full text-to-3D sword workflow with ZBrush + Substance Painter finishing

Hunyuan 3D 3.1 Pro and Rapid land on fal with fidelity vs speed options

Nano Banana Pro is being used for fast 3D lookdev on characters and product renders

Anima Labs keeps iterating a 2D→3D→animation character pipeline with Kling 2.5

Node Spawner adds a radial menu approach to node creation for graph workflows

🧪 Finishing passes: upscaling, restoration, and making motion hold up

Topaz teases a new Astra upscaler, with “native-looking” textures as the goal

Runway insert-shot prompting: “camera into the body” as a usable VFX beat

CapCut price friction shows up as creators test cheaper “editing paths”

Topaz as the “last mile” in multi-tool character video pipelines

💻 Coding agents & maker tooling: Claude Code, Codex loops, visual coding, and browser-based AI builders

A Codex↔Claude code-review loop is emerging as a reliability hack

X’s algorithm reportedly scores “profile click” and “follow” separately

“Close the editor”: creators are prioritizing docs and rules over diffs

Claude Code is being framed as the “build anything” tool for creatives

Gemini is being predicted as the default browser-based agent via distribution

🧰 Where creators run models: ComfyUI-in-browser, Comfy integrations, and model hosting hubs