afiaka87/clip-guided-diffusion
A CLI tool that generates images from text prompts using CLIP-guided diffusion models from OpenAI.

Velocity · 7d
+0.3
★ / day
Trend
→steady
star history
This repository implements CLIP-guided diffusion, a technique for text-to-image synthesis where a diffusion model generates images guided by CLIP embeddings to match text prompts. Users can run the tool via a CLI to produce images from natural language descriptions, optionally logging intermediate outputs to wandb and saving results as GIFs or videos. The project requires PyTorch and downloads pretrained checkpoints from OpenAI and Katherine Crowson.