Tencent-Hunyuan/HunyuanVideo-Foley
A Tencent Hunyuan diffusion model that synthesizes high-fidelity foley audio from video or text prompts.

Velocity · 7d
+3.5
★ / day
Trend
→steady
star history
HunyuanVideo-Foley is a multimodal diffusion model that uses representation alignment to generate professional-grade foley sound effects synchronized with video content. The system accepts video, text, or combined inputs and produces high-fidelity audio tracks for use in video production. The model is available on Hugging Face with pre-trained weights and demo spaces for experimentation.