← all repositories

DLYuanGod/TinyGPT-V

An efficient multimodal large language model using small backbones for text and image processing.

1.3k stars Python Language Models
TinyGPT-V
Velocity · 7d
+1.5
★ / day
Trend
steady
star history

TinyGPT-V is an efficient multimodal LLM that processes both text and images using compact backbone architectures. The project provides training code, pretrained model weights on HuggingFace, and interactive demo spaces for model evaluation. According to the authors, it achieves approximately 98% of InstructBLIP’s performance while using smaller model components.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.