researchmm/Stark
A spatio-temporal transformer model for visual object tracking in video sequences.

Velocity · 7d
+0.4
★ / day
Trend
→steady
star history
STARK is the official implementation of a transformer-based visual tracking system that learns to track objects across video frames using a novel spatio-temporal transformer architecture. It achieves state-of-the-art performance on benchmarks like LaSOT, GOT-10k, and TrackingNet. The method was integrated into the mmtracking library and won challenges at VOT-21.