lucidrains/muse-maskgit-pytorch
A PyTorch implementation of Muse, a text-to-image generation model that uses masked generative transformers.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
This repository provides a complete implementation of the Muse text-to-image generation model in PyTorch. It includes a VQGanVAE for encoding images into discrete token sequences and a MaskGitTransformer that generates images from text prompts through iterative masked token prediction. The implementation supports training the VAE component on image folders and then using it with the transformer for text-conditional image synthesis.