FoundationVision/FlashVideo
A diffusion-based text-to-video generation model that efficiently produces high-resolution videos from text prompts.

Velocity · 7d
+0.9
★ / day
Trend
→steady
star history
FlashVideo is a high-resolution video generation model based on diffusion architecture. It uses a two-stage approach: first generating low-resolution video from text prompts, then upscaling to high resolution with minimal computation. The system targets efficient inference through flow-based matching and progressive resolution refinement.