A Dockerized promptsmith for the AI-art lazy
Wraps three Hugging Face models in Docker so you can reverse-engineer or translate image-generation prompts without writing them yourself.

What it does
This project packages a prompt generator, a Chinese-to-English translator, and a CLIP image parser into Docker containers with a Gradio web UI. You feed it an image or a vague Chinese description; it spits out polished English prompts tuned for MidJourney or Stable Diffusion.
The interesting bit
The author treats Docker not as deployment packaging but as the entire development environment — Nvidia PyTorch base images, CPU and GPU variants, the works. It is a personal workflow made public, blog posts and all.
Key highlights
- Reverse-prompts images via CLIP, then embellishes them with a text2image model trained on MidJourney data
- Accepts Chinese input and translates it to English prompt syntax automatically
- Ships separate CPU and GPU Dockerfiles; the CPU path is viable on Apple Silicon
- Web UI runs on port 7860, standard Gradio fare
- Builds from Nvidia’s official PyTorch base image rather than pulling a pre-baked hub image
Caveats
- The README’s CPU
docker runcommand still includes--gpus all, which will fail on CPU-only hosts; copy-paste with caution - No pre-built images on DockerHub; you build everything locally, which “saves a lot of time” only if your bandwidth to Nvidia’s registry is better than to DockerHub
- Documentation assumes you have already bought into the author’s Docker-heavy deep-learning setup
Verdict
Worth a look if you generate a lot of AI art and would rather automate prompt engineering. Skip it if you already have a preferred prompt workflow or if “build three Docker images before lunch” sounds like a chore.