ali-vilab/dreamtalk
A diffusion-based framework that generates high-quality talking head videos from audio input with expressive speaking styles.

Velocity · 7d
+2.0
★ / day
Trend
→steady
star history
DreamTalk generates expressive talking head videos from audio using diffusion probabilistic models. It takes audio input combined with style clips and pose references to produce realistic talking head videos. The framework handles diverse inputs including songs, multilingual speech, and noisy audio, and can animate various portrait types with different speaking styles.