← all repositories

THU-MIG/yoloe

Real-time object detection and segmentation model supporting text prompts, visual prompts, and zero-shot open-vocabulary detection.

2.2k stars Python Computer Vision
yoloe
Velocity · 7d
+4.8
★ / day
Trend
steady
star history

YOLOE (Real-Time Seeing Anything) is a unified PyTorch-based computer vision model for object detection and segmentation tasks. It supports multiple prompt mechanisms including text prompts, visual inputs, and prompt-free detection with zero-shot capability. The model achieves competitive accuracy while maintaining real-time inference speed, representing advances in open-set object detection.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.