IDEA-Research/T-Rex
A multimodal object detection model that synergizes text and visual prompts for open-set detection and counting tasks.

Velocity · 7d
+2.9
★ / day
Trend
→steady
star history
T-Rex2 is a generic object detection framework that accepts both text and visual prompts to identify objects in images. The model enables interactive detection through visual exemplars and text descriptions, supporting open-set scenarios where classes are not pre-defined. It provides API access for integration and includes demo examples for easy experimentation.