← all repositories

dk-liang/Awesome-Visual-Transformer

A curated list of academic papers and surveys on Vision Transformers in deep learning.

Awesome-Visual-Transformer
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

This repository compiles research papers covering transformer architectures applied to computer vision tasks. It includes foundational papers like the original Attention is All You Need, survey papers on Visual Transformers, arXiv preprints on ViT variants, and technical blog posts in both English and Chinese. The collection spans topics from 3D semantic segmentation to text-VQA systems using transformer-based visual understanding.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.