← all repositories

declare-lab/tango

Tango is a diffusion-based text-to-audio generation system that uses LLMs to guide the audio synthesis process.

1.2k stars Python Image · Video · Audio
tango
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

Tango is a family of diffusion models for text-to-audio generation. It leverages large language models to guide the diffusion process, enabling generation of audio from textual descriptions. The project includes multiple model versions (Tango, Tango2) and associated datasets like Audio-Alpaca. It provides model weights on HuggingFace and demo applications for generating sound effects and audio from text prompts.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.