← all repositories

ace-step/ACE-Step

Open-source foundation model for music generation using diffusion and deep compression autoencoders.

4.6k stars Python Image · Video · Audio
ACE-Step
Velocity · 7d
+11
★ / day
Trend
steady
star history

ACE-Step is a music generation foundation model that integrates diffusion-based generation with Sana’s Deep Compression AutoEncoder and a lightweight linear transformer. It leverages MERT and m-hubert for semantic representation alignment during training, enabling rapid convergence. The model can synthesize up to 4 minutes of music in approximately 20 seconds on an A100 GPU, significantly faster than LLM-based baselines while maintaining superior musical coherence and controllability.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.