dk-liang/Awesome-Visual-Transformer
A curated list of academic papers and surveys on Vision Transformers in deep learning.

Velocity · 7d
+1.8
★ / day
Trend
→steady
star history
This repository compiles research papers covering transformer architectures applied to computer vision tasks. It includes foundational papers like the original Attention is All You Need, survey papers on Visual Transformers, arXiv preprints on ViT variants, and technical blog posts in both English and Chinese. The collection spans topics from 3D semantic segmentation to text-VQA systems using transformer-based visual understanding.