richard-peng-xia/awesome-multimodal-in-medical-imaging
A curated collection of research papers on multimodal large language models applied to medical imaging tasks.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
This repository aggregates academic papers and resources on applying multimodal learning—including vision-language models and LLMs—to medical imaging. It covers tasks such as medical report generation, visual question answering, and multi-agent reasoning in clinical settings. Papers are organized by conference/journal with links to PDFs and code repositories, and actively tracks recent publications like MMedAgent-RL and MMed-RAG.