lucidrains/video-diffusion-pytorch
A Pytorch implementation of diffusion models for generating videos from text descriptions.

Velocity · 7d
+0.9
★ / day
Trend
→steady
star history
This repository implements Jonathan Ho’s Video Diffusion Models paper, extending denoising diffusion probabilistic models (DDPMs) from 2D images to 3D video generation. It uses a space-time factored U-net architecture and incorporates BERT-large for text embedding conditioning to enable text-to-video synthesis. The model can be trained on video data and sampled to generate new videos.