lucidrains/DALLE2-pytorch
A PyTorch implementation of OpenAI's DALL-E 2 text-to-image diffusion model.

Velocity · 7d
+7.4
★ / day
Trend
→steady
star history
This repository implements DALL-E 2, OpenAI’s text-to-image synthesis model, in PyTorch. It replicates the two-stage architecture: a diffusion prior network that predicts image embeddings from CLIP text embeddings, and a decoder that generates images from those embeddings. The implementation focuses on the diffusion prior variant, which was the best-performing approach.