← all repositories

Text-to-Audio/AudioLCM

A PyTorch implementation of AudioLCM for efficient, high-quality text-to-audio generation using latent consistency models.

1.2k stars Python Image · Video · Audio
AudioLCM
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

AudioLCM is a generative model that creates audio from text descriptions. It employs latent consistency models, a distillation technique derived from consistency models, to enable fast and high-fidelity audio synthesis. The repository provides a PyTorch implementation along with pretrained models for researchers and developers to generate audio samples from textual prompts.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.