← all repositories

JDAI-CV/CoTNet

CoTNet is a contextual transformer network that replaces standard convolutions with self-attention building blocks for visual recognition tasks.

538 stars Python Computer VisionML Frameworks
CoTNet
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

CoTNet is a unified self-attention building block that serves as an alternative to standard convolutions in ConvNets. The repository provides official PyTorch implementations of vision backbone models enhanced with contextualized self-attention for tasks including image classification, object detection, instance segmentation, and semantic segmentation. It achieves competitive accuracy with efficient inference time-accuracy trade-offs on ImageNet and MSCOCO benchmarks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.