← all repositories

microsoft/LLaVA-Med

A multimodal large language-and-vision assistant trained specifically for biomedical applications, published at NeurIPS 2023.

2.2k stars Python Language ModelsDomain Apps
LLaVA-Med
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

LLaVA-Med is a large language-and-vision assistant built for the biomedicine domain, trained using visual instruction tuning to achieve multimodal GPT-4 level capabilities. It combines vision and language understanding to interpret medical images and answer related queries. The model is available on Hugging Face and supports direct loading without delta weights.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.