← all repositories
zsdonghao/text-to-image

Teaching GANs to read flower catalogs

A 2016 paper implementation that generates flower images from text descriptions, built when TensorFlow 1.x was fresh and "skip thought vectors" sounded futuristic.

text-to-image
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

What it does

Feed it a sentence like “yellow anther red pistil and bright red petals” and it generates a matching flower image using the GAN-CLS algorithm. The repo implements the paper Generative Adversarial Text-to-Image Synthesis atop a DCGAN base, with TensorLayer handling the neural network plumbing.

The interesting bit

The project layers text conditioning onto a GAN in a fairly direct way: encode the caption, feed it to both generator and discriminator, and hope the visual-textual alignment holds. The README’s sample outputs show the model did learn something about color and petal structure, though “blue and white” flowers from 2016-era GANs are best viewed with generous squinting.

Key highlights

  • Implements the full GAN-CLS training pipeline with Oxford-102 flowers dataset
  • Built on DCGAN in TensorFlow, with TensorLayer 1.4+ as the higher-level API
  • Includes data download automation via downloads.py
  • Ships with pre-built model definitions, data loader, and training script
  • Apache 2.0 licensed

Caveats

  • Locked to TensorFlow 1.x and TensorLayer 1.4+ — both are effectively frozen in time
  • Dataset setup requires manual path shuffling even with downloads.py
  • No quantitative metrics or comparison tables in the README; quality assessment is purely visual

Verdict

Worth a look if you’re studying the evolution of text-to-image models or need a clean, small-scale GAN-CLS reference implementation. Skip it if you want production code — this is a 2016 research reproduction with 2016 dependencies, and Stable Diffusion it ain’t.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.