← all repositories

xiaoachen98/Open-LLaVA-NeXT

Open-source training implementation for LLaVA-NeXT series, a large vision-language multimodal model combining CLIP vision encoder with Vicuna/LLaMA3 LLMs.

437 stars Python Language ModelsML Frameworks
Open-LLaVA-NeXT
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

This repository provides an open-source implementation for training LLaVA-NeXT, a large multimodal model that combines a CLIP vision encoder with large language model backbones (Vicuna-7B, LLaMA3-8B). It includes pretraining and supervised fine-tuning stages with all training data and checkpoints released. The codebase builds on the original LLaVA with minimal modifications, making it easy to reproduce and extend the model for research purposes.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.