← all repositories

IDEA-Research/MaskDINO

A unified transformer-based framework for object detection, instance segmentation, semantic segmentation, and panoptic segmentation tasks.

1.5k stars Python Computer Vision
MaskDINO
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

MaskDINO extends the DINO detector with a mask prediction head to handle both detection and segmentation tasks within a single transformer architecture. The model unifies multiple computer vision tasks including instance segmentation, semantic segmentation, and panoptic segmentation through a shared framework. It achieves state-of-the-art results on COCO and ADE20K benchmarks. The implementation includes training scripts, evaluation tools, and pretrained models for downstream use.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.