← all repositories

google-deepmind/tapnet

DeepMind's computer vision system for tracking arbitrary points across video frames using deep learning models.

1.9k stars Jupyter Notebook Computer VisionDomain Apps
tapnet
Velocity · 7d
+1.5
★ / day
Trend
steady
star history

Tracking Any Point (TAP) is a computer vision system that identifies and follows points through video sequences. The repository contains the TAPIR model, a two-stage algorithm using matching and refinement stages to locate point trajectories, along with the TAP-Vid and TAPVid-3D benchmark datasets for evaluating tracking performance. It also includes RoboTAP, which applies point tracking to real-world robotics manipulation tasks through imitation learning.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.