← all repositories

ali-vilab/dreamtalk

A diffusion-based framework that generates high-quality talking head videos from audio input with expressive speaking styles.

1.8k stars Python Image · Video · Audio
dreamtalk
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

DreamTalk generates expressive talking head videos from audio using diffusion probabilistic models. It takes audio input combined with style clips and pose references to produce realistic talking head videos. The framework handles diverse inputs including songs, multilingual speech, and noisy audio, and can animate various portrait types with different speaking styles.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.