← all repositories

cheerss/CrossFormer

CrossFormer++ is a vision transformer enabling cross-scale attention for object detection, instance segmentation, and semantic segmentation.

402 stars Python Computer VisionML Frameworks
CrossFormer
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

This repository contains PyTorch implementations of CrossFormer and CrossFormer++, versatile vision transformer architectures designed to build attention across features of different scales. The core innovations include Cross-scale Embedding Layer (CEL) and Long-Short Distance Attention (L/SDA) modules. The implementation supports multiple vision tasks including classification, object detection with Mask-RCNN and Cascade Mask-RCNN, instance segmentation, and semantic segmentation, with pretrained models across Small, Base, Large, and Huge variants.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.