zai-org/CogVideo
Open-source text-to-video and image-to-video generation model using diffusion transformers and LLM-based understanding.

Velocity · 7d
+8.7
★ / day
Trend
→steady
star history
CogVideo and CogVideoX are open-source video generation systems that produce videos from text prompts or static images. The models employ diffusion-based architectures with LLM components for video understanding and generation. The repository includes inference code, LoRA fine-tuning capabilities, and DDIM inversion support for the generated videos.