DINOv3's self-supervised eyes now spot objects in real time
DEIMv2 grafts a detection head onto DINOv3's frozen features, trading the usual YOLO backbone for a vision transformer that learned without labels.

What it does DEIMv2 is a real-time object detector that uses DINOv3—Meta’s self-supervised vision transformer—as its backbone instead of the convolutional networks typical in fast detectors. It ships eight model sizes from 0.5M-parameter “Atto” (23.8 AP, 1.1ms latency) up to 50M-parameter “X” (57.8 AP), with the S model hitting 50.9 AP on COCO at 5.78ms. The architecture wraps a hybrid encoder and DEIM transformer decoder around DINOv3 features, then post-processes with a standard detection head.
The interesting bit The project bets that DINOv3’s unsupervised pretraining contains richer spatial features than supervised backbones, and tries to prove it by keeping inference fast enough for edge deployment. The “STA” (presumably some attention mechanism) has already been upstreamed into the LightlyTrain distillation library, suggesting the training tricks have legs beyond this repo.
Key highlights
- Eight checkpoints on Hugging Face, loadable via
from_pretrained()with a PyTorchModelHubMixin wrapper - TensorRT deployment supported; FP16 requires TensorRT ≥ 10.6 to avoid silent corruption
- Integrated into X-AnyLabeling for annotation workflows
- Memory usage halved for S/M models via a ViT-Tiny attention optimization (Oct 2025)
- Apache 2.0 licensed; arXiv paper at 2509.20787
Caveats
- The README is thin on how DINOv3 features are actually adapted for detection; the “DINOv3STAs” module name hints at attention manipulation but details aren’t spelled out
- PyTorch 2.5.1 or 2.4.1 recommended; “we have not tried other versions” is not the most reassuring compatibility statement
- The authors now pitch EdgeCrafter as their newer, better, multi-task successor—this repo may be entering maintenance mode
Verdict Worth a look if you’re running detection on DINOv3 embeddings already or want a Hugging Face-native detector with unusual lineage. Skip if you need battle-tested YOLO ecosystem tooling or are starting fresh—EdgeCrafter is the team’s new favorite child.