Paranioar/Awesome_Matching_Pretraining_Transfering
An awesome list compiling research papers on Large Multimodal Models, Vision-Language Pretraining, and Parameter-Efficient Fine-Tuning.

Velocity · 7d
+0.2
★ / day
Trend
→steady
star history
This repository is a categorized paper list and tutorial covering foundational and recent research in large multimodal models, vision-language pretraining, and parameter-efficient fine-tuning techniques. It includes sections on large language models, large vision models, text-to-image/video generation, multimodal perception and unification, model distillation, and related benchmarks and surveys.