NVlabs/FAN
Fully Attentional Network is a general-purpose vision transformer backbone achieving state-of-the-art results on ImageNet and domain generalization benchmarks.

Velocity · 7d
+0.3
★ / day
Trend
→steady
star history
This repository provides the official PyTorch implementation of FAN, a vision transformer architecture designed to improve self-attention mechanisms for visual recognition tasks. It includes training and evaluation code along with pretrained models for backbone use in image classification, object detection, and semantic segmentation across standard benchmarks like ImageNet and Cityscapes.