henry123-boy/SpaTracker
SpatialTracker is a PyTorch implementation that tracks any 2D pixels in 3D space using depth estimation and multi-view geometric constraints, published at CVPR 2024.

SpatialTracker is a deep learning-based computer vision system that tracks arbitrary 2D pixels and reconstructs their 3D positions from video sequences. It leverages monocular depth estimators to obtain metric depth information and uses multi-view geometric constraints for 3D reconstruction and tracking. The official implementation provides inference code, pre-trained checkpoints, and supports dense tracking of thousands of points using a single GPU.