Text-to-Audio/AudioLCM
A PyTorch implementation of AudioLCM for efficient, high-quality text-to-audio generation using latent consistency models.

Velocity · 7d
+1.6
★ / day
Trend
→steady
star history
AudioLCM is a generative model that creates audio from text descriptions. It employs latent consistency models, a distillation technique derived from consistency models, to enable fast and high-fidelity audio synthesis. The repository provides a PyTorch implementation along with pretrained models for researchers and developers to generate audio samples from textual prompts.