Lightricks LTX‑2 open-weights 20s 4K AV model – RTX gains 3× speed

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Lightricks is turning LTX‑2 into an open AV stack: full weights, a distilled variant, LoRAs, and multimodal trainer ship together, targeting ~20s 4K synchronized audio‑video in a single pass; ComfyUI adds Day‑0 node graphs with canny/depth/pose control and keyframing, while NVIDIA‑tuned NVFP4/NVFP8 checkpoints report ~60% lower VRAM use and up to 3× speedups on RTX GPUs. CEO Zeev Farbman frames the move as a response to calls for “DeepSeek moments” in creative video—shifting LTX‑2 from hosted endpoints to something teams can audit, fine‑tune, and run locally; early community clips emphasize identity persistence, strong lip‑sync, and fp8 “turbo” image‑to‑video flows on consumer cards.

• Kling and peers: Runware adds 30s one‑take Motion Control; OpenArt showcases Voice Control for cross‑scene avatars; PixVerse’s MIMIC and BytePlus Seedance 1.5 Pro lean into motion‑clone and millisecond lip‑sync.
• Hardware, worlds, policy: Tencent’s HY‑World 1.5 opens its world‑model stack with a 5B Lite; NVIDIA’s Rubin NVL72 hits 3.6 EFLOPS inference; Hollywood’s 2026 AI calendar bunches DOJ, SAG‑AFTRA, Oscars, and contract‑expiry fights.

Open AV and world models plus Rubin pods widen who can run video and spatial workloads.

Feature Spotlight

LTX‑2 goes truly open: local AV video for creators

Open weights + trainer and NVIDIA‑optimized ComfyUI make LTX‑2 the first practical, open, local AV video model—4K, keyframes, and synced audio—putting studio‑grade control on creators’ RTX PCs.

Cross‑account story. Lightricks’ LTX‑2 ships open weights + full trainer, Day‑0 ComfyUI support, and NVIDIA‑optimized checkpoints—native, synchronized audio+video at 4K with keyframe/control nodes. Big, practical leap for indie film/music video workflows.

Jump to LTX‑2 goes truly open: local AV video for creators topics

🎬 LTX‑2 goes truly open: local AV video for creators

Lightricks open-sources LTX-2 audio-video model with full trainer

LTX-2 (Lightricks): Lightricks has released LTX-2 as a truly open-source audio‑video generation model, including full weights, a distilled variant, controllable LoRAs, and a complete multimodal training stack—following up on synced audio, where fal first exposed LTX‑2 via hosted endpoints. open source thread The model targets up to ~20 seconds of synchronized motion, dialogue, sound effects, and music in a single pass, with support for native 4K and up to 50fps in downstream pipelines. performance thread

• Full-stack release: Lightricks is shipping full model weights plus a distilled version, camera/structure/conditioning LoRAs, a multimodal trainer, benchmarks, and evaluation scripts, as detailed in the open release breakdown. (open source thread, release contents) • Native AV generation: The model generates tightly synchronized audio and video together (rather than stitching), with multi‑keyframe control and fine‑grained conditioning built in rather than added post‑hoc. (av capabilities, performance thread) • Openness rationale: CEO Zeev Farbman frames this as a response to calls for "DeepSeek moments" in AI and argues that creative AV models must be open, inspectable, and runnable on local hardware to evolve with real production constraints. (openness Q&A, deepseek answer) • Docs and access: Lightricks points creators and developers to the LTX‑2 model page and documentation, which consolidate downloads, trainer instructions, and examples of extending the camera and structure controls. (community message, model page) The launch shifts LTX‑2 from a closed API experience into something teams can audit, fine‑tune, and slot directly into their own creative pipelines.

Lightricks LTX‑2 open-weights 20s 4K AV model – RTX gains 3× speed

Executive Summary

LTX‑2 goes truly open: local AV video for creators

Table of Contents

🎬 LTX‑2 goes truly open: local AV video for creators

Lightricks open-sources LTX-2 audio-video model with full trainer

ComfyUI and NVIDIA ship optimized local LTX-2 pipelines for RTX GPUs

Early LTX-2 demos show strong identity, lip-sync, and fast I2V

🎥 Kling 2.6 adoption: Runware support and CES spotlight

OpenArt showcases Kling 2.6 Voice Control for consistent dialogue across clips

Runware adds Kling VIDEO 2.6 Motion Control with 30s one‑take shots

Kling 2.6 Motion Control keeps spreading through baby dances and glow‑up spells

Kling AI schedules CES 2026 creator panel on GenAI’s impact

🧰 Relight and production prompts: faster look control

Higgsfield’s Relight turns still images into 3D‑lit “virtual sets”

fofrAI turns Nano Banana JSON prompting into a public guide and app

Vector‑to‑plush: Illustrator + Nano Banana + Weavy yields full merch shots

Free “Candid Moments” PDF teaches Nano Banana Pro candid photo prompts

K‑pop dance spec shows Nano Banana Pro character consistency across a 2×2 grid

Luma’s Ray3 Modify shows pure virtual relighting of interior footage

NB Pro scrapbook prompt builds a 9:16 “day in my life” collage

3D crystal figurine prompt turns popular characters into glass collectibles

Creators port Midjourney looks into Nano Banana Pro for cinematic realism

Mixed‑media portrait prompt overlays drawn “glitch frames” on a photo

🎛️ Other gen‑video engines: mimicry, shot control, worlds

Tencent’s HY-World 1.5 opens its world model stack for creators

BytePlus Seedance 1.5 Pro leans into shot-level control and tight lip-sync

PixVerse’s MIMIC turns one driving video into many animated avatars

🖌️ Reusable looks: Midjourney styles and art prompts

Charcoal sketch prompt pack offers reusable raw, textured illustration look

Midjourney sref 1275745115 nails classic European animation aesthetics

Midjourney sref 6139537108 adds iridescent grain and foam-like forms

Nano Banana Pro prompt turns characters into translucent crystal figurines

“QT your dragon” prompt crystallizes a pastel plush mini-dragon look

“QT your geisha” thread spins up a watercolor ink-splash portrait style

Mixed-media glitch-frame portrait prompt blends photo and oil-pastel overlays

NB Pro scrapbook collage prompt captures Gen Z “day in my life” aesthetic

🧱 From one image to 3D and stylized character reels

Local Gradio UI lets Apple’s Sharp turn a single image into 3D on your PC

One‑day pipeline turns Nano Banana images into FBX animated models via Tripo

“Riven Black” reel showcases MJ → Nano Banana → Hailuo → ElevenLabs character stack

📖 Prompted shorts: Grok shots and indie experiments

$300, 5‑day Zelda‑style teaser built entirely in Freepik

Grok Imagine nails a 360° action spin other models botch

Grok Imagine powers poetic 80s OVA‑style vampire and flower shorts

Diesol and peers stress film craft alongside AI tools

Grok Imagine doubles as an anime villain design and music sketchpad

Mind Tunnels: Extraction leans on Midjourney for sci‑fi set pieces

AI‑driven music video teaser mixes live performance and abstract visuals

⚖️ Promptcraft integrity and 2026 Hollywood AI calendar

Hollywood’s 2026 AI calendar lines up DOJ action, SAG-AFTRA talks, and Oscars test

Prompt originality debate flares as Azed AI condemns prompt copying

Ozan Sihay debunks Grok “bikini safety” settings and calls for better research

🎙️ Voices and quick scores for creators

ElevenLabs voice agents now power 3M+ minutes of CARS24 sales calls

Grok Imagine gets praise for fast, on-style song generation

📣 Creator promos and CES tie‑ins

Higgsfield dangles 220‑credit Relight giveaway to push AI lighting tool

PixVerse leans on CES 2026 presence to sell “video you play with” vision

Apob AI re-ups 1,000‑credit Remotion offer around persona consistency pitch

Dreamina AI wraps New Year fireworks challenge with 1,000‑credit winner

Producer.ai spotlights community-built Spaces after first challenge winners

🏗️ Studios, hardware, and robotics in creative pipelines

NVIDIA Vera Rubin NVL72 specs land: 3.6 EFLOPS inference, 1.6 PB/s bandwidth

Boston Dynamics puts Atlas humanoid into production with Gemini Robotics and Hyundai plan

NVIDIA and Hugging Face fuse open Isaac robotics into LeRobot for creator demos

Runway spotlights OBSIDIAN studio’s AI-first campaigns for Disney, Nike, Wrangler, Hyundai

Hedra recaps Character-3, Hedra Studio, Live Avatars as it tees up 2026

Pictory case: doctor grows to 2,000 followers with text-to-video health shorts

🧪 Multimodal and control research worth bookmarking

NextFlow proposes unified autoregressive model for text–image tokens

Tencent’s HY‑World 1.5 opens code and adds 5B lite world model

Falcon-H1R shows small 7B model can rival larger reasoners

LG’s K‑EXAONE MoE model targets 256k‑token multilingual reasoning

Talk2Move uses RL to move objects in scenes via text

VINO unifies visual generation with interleaved omni-modal context

Youtu‑LLM explores agentic intelligence in a 2B‑parameter small model

On this page