← all repositories

BradyFU/Awesome-Multimodal-Large-Language-Models

A curated collection of papers, surveys, and benchmarks on multimodal large language models including MME-Survey and VITA series.

Awesome-Multimodal-Large-Language-Models
Velocity · 7d
+16
★ / day
Trend
steady
star history

This repository aggregates the latest advances in multimodal large language models, containing surveys on MLLM evaluation, benchmarks like Video-MME-v2, and research on VITA series omni MLLMs capable of vision and speech interaction. It serves as a knowledge base tracking training methods, evaluation frameworks, and state-of-the-art models in the multimodal LLM space.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.