llm-lab-org/Multimodal-RAG-Survey
A curated academic survey repository collecting and categorizing papers on Multimodal Retrieval-Augmented Generation.
★519 stars Learning

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
This repository accompanies a published survey paper on Multimodal RAG, collecting and organizing research papers on combining retrieval mechanisms with generative AI across text, image, audio, and video modalities. It provides a taxonomy of recent advances, taxonomy of enhancements, and continuously updated resources for researchers working on RAG systems and multimodal learning.