← all repositories

HRNet/HRFormer

A Vision Transformer architecture that maintains high-resolution representations for dense prediction tasks like pose estimation and semantic segmentation.

522 stars Python Computer VisionML Frameworks
HRFormer
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

HRFormer adapts the transformer architecture for dense computer vision tasks by using multi-resolution parallel design inspired by HRNet, combined with local-window self-attention for computational efficiency. The model performs human pose estimation and semantic segmentation by processing image features at multiple resolutions simultaneously, addressing the memory and computation limitations of standard Vision Transformers that produce only low-resolution outputs.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.