A 2017 pose paper, ported to PyTorch with duct tape
This repo re-implements the CVPR'17 OpenPose work in PyTorch, but you'll need to compile C++ helpers and pray your GPU matches the author's four-GTX-1080-Ti shrine.

What it does
Detects multiple people’s body keypoints in real time from images or webcam feeds. It’s a PyTorch re-implementation of the 2017 “Realtime Multi-Person Pose Estimation” paper that introduced Part Affinity Fields — basically, it draws stick figures over crowds.
The interesting bit
The original code was Caffe-based; this is a community port to PyTorch. The README claims a mAP bump from 0.577 to 0.653 simply by adding left-right flip augmentation during evaluation — a nice reminder that paper numbers are often training-curve accidents, not gospel.
Key highlights
- Pre-trained model available via Dropbox (not exactly a CDN, but it works)
- Includes both picture demo and a web demo for live webcam input
- Evaluation script provided against COCO val2017 with reported 0.653 mAP
- Training code included, though you’ll need to hand-edit paths in
train_VGG19.py - C++ postprocessing module (
lib/pafprocess) requires manual compilation
Caveats
- “Other platforms or GPU cards are not fully tested” — the author used 4× GTX 1080 Ti on Ubuntu 18.04, so your RTX 4090 or MacBook are uncharted territory
- The README has a stray “c” floating after the evaluation section, which tells you something about maintenance attention
- No inference time benchmark is actually filled in the results table
Verdict
Worth a look if you specifically need a PyTorch-native OpenPose variant for research or hacking. Skip it if you want production-ready pose estimation — MediaPipe and newer models have made this largely a historical artifact.