jfzhang95/pytorch-video-recognition
PyTorch implementations of C3D, R3D, and R2Plus1D models for video-based human activity recognition.

Velocity · 7d
+0.4
★ / day
Trend
→steady
star history
This repository provides deep learning model implementations for video action/activity recognition. It includes C3D, R3D, and R2Plus1D architectures implemented in PyTorch, trained on standard benchmarks like UCF101 and HMDB51. These models use 3D convolutions to capture both spatial and temporal features from video sequences for classifying human actions. The code provides training scripts, pretrained model downloads, and dataset configuration utilities.