BradyFU/Awesome-Multimodal-Large-Language-Models
A curated collection of papers, surveys, and benchmarks on multimodal large language models including MME-Survey and VITA series.

Velocity · 7d
+16
★ / day
Trend
→steady
star history
This repository aggregates the latest advances in multimodal large language models, containing surveys on MLLM evaluation, benchmarks like Video-MME-v2, and research on VITA series omni MLLMs capable of vision and speech interaction. It serves as a knowledge base tracking training methods, evaluation frameworks, and state-of-the-art models in the multimodal LLM space.