longzw1997/Open-GroundingDino
PyTorch implementation of Grounding DINO for open-set object detection using vision-language grounding.

Velocity · 7d
+0.9
★ / day
Trend
→steady
star history
This repository provides a third-party implementation of Grounding DINO, an open-set object detection model that combines DINO architecture with grounded pre-training to detect arbitrary objects using language descriptions. It supports model fine-tuning on custom datasets and pre-training from scratch, uses BERT for text encoding, and includes training acceleration strategies with multi-machine Slurm support.