jy0205/Pyramid-Flow
A flow-matching based autoregressive video generation model that produces 10-second 768p videos at 24 FPS from text or image inputs.

Velocity · 7d
+5.2
★ / day
Trend
→steady
star history
Pyramid Flow implements a pyramidal approach to flow matching for training-efficient video generation. The method trains only on open-source datasets and uses a DiT (Diffusion Transformer) backbone. It supports both text-to-video and image-to-video generation with natural motion stability, releasing checkpoints and training code for the community.