ivcylc/OpenMusic
A diffusion transformer model that generates high-quality music from text descriptions.

Velocity · 7d
+0.8
★ / day
Trend
→steady
star history
OpenMusic provides the official PyTorch implementation of QA-MDT (Quality-Aware Masked Diffusion Transformer), a state-of-the-art text-to-music generation system. The model uses a masked diffusion transformer architecture trained on audio-text pairs to synthesize music from natural language prompts. It achieved top rankings on MusicCaps and Song Describer Dataset benchmarks and supports extended-length music generation in a zero-shot manner.