← all repositories

nihaomiao/CVPR23_LFDM

A latent flow diffusion model that generates videos from input images using optical-flow-based motion estimation.

473 stars Python Image · Video · Audio
CVPR23_LFDM
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

This repository provides the PyTorch implementation of a CVPR 2023 paper on conditional image-to-video generation. The model uses latent flow diffusion to synthesize realistic videos from single images, incorporating optical flow estimation to model motion patterns. The implementation includes training code and pretrained models for multiple datasets (MUG, MHAD, NATOPS) and supports cross-domain applications such as animating faces from the FaceForensics dataset.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.