smthemex/ComfyUI_Sonic
A ComfyUI custom node for audio-driven portrait animation using the Sonic method with Whisper and Stable Video Diffusion models.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
This repository provides a ComfyUI integration for Sonic, a portrait animation technique that animates a person’s face driven by audio input. It leverages OpenAI’s Whisper-tiny model for audio processing and Stability AI’s Stable Video Diffusion (SVD) model for video generation. Users can input a portrait image and audio to generate synchronized talking-head animations.