FoundationVision/GLEE
A foundation model for unified object detection, segmentation, and tracking across images and videos.

Velocity · 7d
+1.3
★ / day
Trend
→steady
star history
GLEE is a general object foundation model designed to handle multiple computer vision tasks at scale including object detection, segmentation, referring expression comprehension, multi-object tracking, and video instance segmentation. It supports open-vocabulary capabilities for zero-shot detection and segmentation, enabling generalization to unseen object categories without additional training.