Francis-Rings/StableAnimator
A video diffusion model that animates a reference image using pose sequences while preserving identity, published at CVPR 2025.

StableAnimator is an end-to-end video diffusion framework that synthesizes high-quality identity-preserving videos from a reference image and pose sequences without post-processing. It addresses challenges in temporal consistency and facial fidelity during animation using a novel approach that avoids relying on face-swapping or restoration tools. The model generates videos directly from pose guidance while maintaining the subject’s identity.