open-mmlab/FoleyCrafter
Diffusion-based video-to-audio generation system that synthesizes realistic, synchronized sound effects from silent video input.

Velocity · 7d
+0.9
★ / day
Trend
→steady
star history
FoleyCrafter is a video-to-audio generation framework that produces realistic sound effects semantically relevant and synchronized with video content. It leverages diffusion models to analyze video frames and generate appropriate audio effects that match the visual actions and movements. The system is designed to automate foley sound synthesis for video editing, content creation, and multimedia production workflows.