← all repositories

ivcylc/OpenMusic

A diffusion transformer model that generates high-quality music from text descriptions.

632 stars Python Image · Video · Audio
OpenMusic
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

OpenMusic provides the official PyTorch implementation of QA-MDT (Quality-Aware Masked Diffusion Transformer), a state-of-the-art text-to-music generation system. The model uses a masked diffusion transformer architecture trained on audio-text pairs to synthesize music from natural language prompts. It achieved top rankings on MusicCaps and Song Describer Dataset benchmarks and supports extended-length music generation in a zero-shot manner.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.