← all repositories

facebookresearch/Mask2Former

A transformer neural network architecture for universal image segmentation across panoptic, instance, and semantic tasks.

3.4k stars Python Computer Vision
Mask2Former
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

Mask2Former is a computer vision model using masked-attention transformers for universal image segmentation. It achieves state-of-the-art performance on panoptic, instance, and semantic segmentation tasks using a single unified architecture. The project supports major benchmarks including ADE20K, Cityscapes, COCO, and Mapillary Vistas, with trained models and a web demo available through Hugging Face Spaces.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.