← all repositories

tianweiy/CausVid

CausVid is an autoregressive video diffusion model that distills a 50-step bidirectional model into a 4-step streaming generator capable of 9.4 FPS text-to-video generation on a single GPU.

1.4k stars Python Image · Video · Audio
CausVid
Velocity · 7d
+3.0
★ / day
Trend
steady
star history

The project adapts a pretrained bidirectional diffusion transformer into an autoregressive transformer that generates frames on-the-fly for interactive applications. It extends distribution matching distillation (DMD) to video, compressing a 50-step diffusion model into a 4-step generator. The approach introduces student initialization based on teacher’s ODE trajectories and asymmetric distillation to mitigate error accumulation in autoregressive generation, enabling high-quality long-duration video synthesis.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.