memoavatar/memo
A diffusion model that generates expressive talking videos from audio and a reference face image.

Velocity · 7d
+1.4
★ / day
Trend
→steady
star history
MEMO is a memory-guided diffusion model for synthesizing realistic talking head videos with natural facial expressions and head movements. It takes audio input and a reference face image to generate temporally consistent video sequences. The model is available on Hugging Face with community integrations including ComfyUI and Gradio apps.