microsoft/LLaVA-Med
A multimodal large language-and-vision assistant trained specifically for biomedical applications, published at NeurIPS 2023.

Velocity · 7d
+2.0
★ / day
Trend
→steady
star history
LLaVA-Med is a large language-and-vision assistant built for the biomedicine domain, trained using visual instruction tuning to achieve multimodal GPT-4 level capabilities. It combines vision and language understanding to interpret medical images and answer related queries. The model is available on Hugging Face and supports direct loading without delta weights.