← all repositories

facebookresearch/TimeSformer

TimeSformer is a transformer-based model for video classification using space-time attention that achieves state-of-the-art results on action recognition benchmarks.

1.9k stars Python Computer VisionML Frameworks
TimeSformer
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

This repository provides the official PyTorch implementation of the TimeSformer model for video understanding. The model uses a transformer architecture with space-time attention to process video sequences, treating each frame as a separate patch and attending across both spatial and temporal dimensions. Pretrained models are provided for Kinetics-400, Kinetics-600, Something-Something-V2, and HowTo100M datasets.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.