← all repositories

lucidrains/DALLE2-pytorch

A PyTorch implementation of OpenAI's DALL-E 2 text-to-image diffusion model.

11.3k stars Python Image · Video · Audio
DALLE2-pytorch
Velocity · 7d
+7.4
★ / day
Trend
steady
star history

This repository implements DALL-E 2, OpenAI’s text-to-image synthesis model, in PyTorch. It replicates the two-stage architecture: a diffusion prior network that predicts image embeddings from CLIP text embeddings, and a decoder that generates images from those embeddings. The implementation focuses on the diffusion prior variant, which was the best-performing approach.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.