releaseMarch 17, 2026

Hao AI Lab launches Dreamverse: 30s 1080p video in 4.5s on one GPU

Dreamverse paired Hao AI Lab's FastVideo stack with an interface for editing video scenes in a faster-than-playback loop, using quantization and fused kernels to keep latency below viewing time. The stack is interesting if you are building real-time multimodal generation or multi-user video serving.

Multimodal Realtime AI LLM Serving Inference Optimization

3 min read

Hao AI Lab launches Dreamverse: 30s 1080p video in 4.5s on one GPU

TL;DR

Hao AI Lab says Dreamverse can generate a full 30-second 1080p sequence in about 4.5 seconds on a single GPU, turning AI video from a minutes-long batch job into a faster-than-playback loop, according to the launch thread.
The product pitch is an edit-in-place workflow: generate a clip, watch it, then issue natural-language changes like “make it darker” or “change the background,” with revised versions arriving “within 5 seconds,” as shown in the workflow post.
Under the hood, Dreamverse rides Hao’s FastVideo stack, which the team says uses “4-bit quantization,” “fused kernels,” fast attention backends, and optimized multi-user serving, per the stack details.
For engineers, the interesting part is less the interface than the latency target: Hao is explicitly optimizing video inference until generation is faster than viewing, a threshold demonstrated in