Picsart-AI-Research/Text2Video-Zero
A zero-shot text-to-video generation system that repurposes text-to-image diffusion models for video synthesis.

Velocity · 7d
+3.6
★ / day
Trend
→steady
star history
Text2Video-Zero enables video generation from textual prompts without model fine-tuning by leveraging pre-trained text-to-image diffusion models. It introduces motion dynamics and temporal consistency mechanisms to extend still image synthesis to coherent video output. The system supports multiple generation modes including text-only generation, pose-guided and edge-guided generation, and instruction-guided video editing via Video Instruct-Pix2Pix.