← all repositories

lucidrains/musiclm-pytorch

A PyTorch implementation of Google's MusicLM, a transformer-based model that generates music from text descriptions.

3.3k stars Python Image · Video · Audio
musiclm-pytorch
Velocity · 7d
+2.7
★ / day
Trend
steady
star history

This repository contains a PyTorch implementation of MusicLM, Google’s SOTA model for music generation. It combines MuLaN, a text-audio contrastive learning model, with AudioLM’s architecture to generate high-quality music from text descriptions. The system uses AudioSpectrogramTransformer and TextTransformer components trained together to align text embeddings with audio representations for conditional music synthesis.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.