← all repositories

fpgaminer/joycaption

An open-source image captioning Visual Language Model built for generating training data for Diffusion models.

1.2k stars Jupyter Notebook Image · Video · AudioLanguage Models
joycaption
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

JoyCaption is a Visual Language Model designed to generate captions for images. It is built from the ground up as a free, open, and uncensored model to serve the AI art community. The model processes images and outputs detailed textual descriptions, which can be used as training data annotations for Diffusion models. Training scripts and methodology details are released openly with the weights.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.