lucidrains/musiclm-pytorch
A PyTorch implementation of Google's MusicLM, a transformer-based model that generates music from text descriptions.

Velocity · 7d
+2.7
★ / day
Trend
→steady
star history
This repository contains a PyTorch implementation of MusicLM, Google’s SOTA model for music generation. It combines MuLaN, a text-audio contrastive learning model, with AudioLM’s architecture to generate high-quality music from text descriptions. The system uses AudioSpectrogramTransformer and TextTransformer components trained together to align text embeddings with audio representations for conditional music synthesis.