fpgaminer/joycaption
An open-source image captioning Visual Language Model built for generating training data for Diffusion models.

Velocity · 7d
+2.0
★ / day
Trend
→steady
star history
JoyCaption is a Visual Language Model designed to generate captions for images. It is built from the ground up as a free, open, and uncensored model to serve the AI art community. The model processes images and outputs detailed textual descriptions, which can be used as training data annotations for Diffusion models. Training scripts and methodology details are released openly with the weights.