YBYBZhang/ControlVideo
A training-free controllable text-to-video generation system using diffusion models and ControlNet.

Velocity · 7d
+0.8
★ / day
Trend
→steady
star history
ControlVideo adapts ControlNet for video generation without any fine-tuning, enabling high-quality and temporally consistent text-to-video synthesis. The system uses Stable Diffusion v1.5 as its base model combined with ControlNet conditioning mechanisms for spatial control. It incorporates frame interpolation using RIFE and offers support for both ControlNet 1.0 and 1.1 versions.