← all repositories

lucidrains/deep-daze

A CLI tool that generates images from text descriptions using OpenAI's CLIP model combined with Siren implicit neural representations.

4.3k stars Python Image · Video · Audio
deep-daze
Velocity · 7d
+2.2
★ / day
Trend
steady
star history

Deep Daze is a text-to-image generation tool that optimizes implicit neural representations (Siren networks) guided by CLIP visual-semantic similarity. Users provide text prompts via command line, and the system iteratively updates an image representation to maximize similarity with the text embedding, producing novel images matching the description.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.