ZHO-ZHO-ZHO/ComfyUI-Gemini
A ComfyUI plugin that integrates Google Gemini as a multi-modal LLM node for generating prompts, tagging images, and conversational assistance within AI image generation workflows.

This repository provides custom nodes for ComfyUI that connect to Google Gemini models, enabling AI-assisted prompt generation, image captioning with Gemini Pro Vision, and multi-turn conversational interactions. Users can leverage Gemini to auto-generate stable diffusion prompts, batch-process image descriptions for tagging, and interact via a built-in chatbot interface. The plugin supports system instructions, file uploads, and context-aware multi-modal conversations.