← all repositories

open-mmlab/FoleyCrafter

Diffusion-based video-to-audio generation system that synthesizes realistic, synchronized sound effects from silent video input.

651 stars Python Image · Video · Audio
FoleyCrafter
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

FoleyCrafter is a video-to-audio generation framework that produces realistic sound effects semantically relevant and synchronized with video content. It leverages diffusion models to analyze video frames and generate appropriate audio effects that match the visual actions and movements. The system is designed to automate foley sound synthesis for video editing, content creation, and multimedia production workflows.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.