← all repositories

SHI-Labs/Neighborhood-Attention-Transformer

A transformer architecture for computer vision that uses localized attention mechanisms, published at CVPR 2023.

1.2k stars Python Computer Vision
Neighborhood-Attention-Transformer
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

Neighborhood Attention Transformer (NAT) and its dilated variant (DNAT) are vision transformer architectures that replace global self-attention with localized neighborhood attention for improved efficiency. The models achieve state-of-the-art performance on multiple computer vision benchmarks including instance segmentation, semantic segmentation, and panoptic segmentation on ADE20K, Cityscapes, and COCO datasets. The implementation includes PyTorch models and a custom CUDA extension (NATTEN) for accelerated neighborhood attention computation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.