InternLM/InternLM-XComposer
A multimodal large language model from Shanghai AI Laboratory supporting text, vision, and audio interactions.

Velocity · 7d
+3.0
★ / day
Trend
→steady
star history
InternLM-XComposer-2.5 is a large vision-language model developed as part of the InternLM foundation model family. It enables multimodal understanding and generation combining text, images, video, and audio. The project includes training code, evaluation scripts, and model weights for both the base model and specialized variants like the reward model and OmniLive streaming system.