teticio/audio-diffusion
Audio generation model using diffusion architecture to synthesize music from noise.

Velocity · 7d
+0.6
★ / day
Trend
→steady
star history
This project adapts image-based diffusion models to generate audio waveforms instead. It leverages the Hugging Face diffusers library to train and run diffusion models that produce music. The repository provides a Gradio web interface for audio generation and includes pretrained latent audio diffusion models. Users can generate audio samples by sampling from the diffusion process, with support for conditional generation based on text or audio encodings.