← all repositories
tensorboy/pytorch_Realtime_Multi-Person_Pose_Estimation

A 2017 pose paper, ported to PyTorch with duct tape

This repo re-implements the CVPR'17 OpenPose work in PyTorch, but you'll need to compile C++ helpers and pray your GPU matches the author's four-GTX-1080-Ti shrine.

1.4k stars Python Computer VisionML Frameworks
pytorch_Realtime_Multi-Person_Pose_Estimation
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

What it does

Detects multiple people’s body keypoints in real time from images or webcam feeds. It’s a PyTorch re-implementation of the 2017 “Realtime Multi-Person Pose Estimation” paper that introduced Part Affinity Fields — basically, it draws stick figures over crowds.

The interesting bit

The original code was Caffe-based; this is a community port to PyTorch. The README claims a mAP bump from 0.577 to 0.653 simply by adding left-right flip augmentation during evaluation — a nice reminder that paper numbers are often training-curve accidents, not gospel.

Key highlights

  • Pre-trained model available via Dropbox (not exactly a CDN, but it works)
  • Includes both picture demo and a web demo for live webcam input
  • Evaluation script provided against COCO val2017 with reported 0.653 mAP
  • Training code included, though you’ll need to hand-edit paths in train_VGG19.py
  • C++ postprocessing module (lib/pafprocess) requires manual compilation

Caveats

  • “Other platforms or GPU cards are not fully tested” — the author used 4× GTX 1080 Ti on Ubuntu 18.04, so your RTX 4090 or MacBook are uncharted territory
  • The README has a stray “c” floating after the evaluation section, which tells you something about maintenance attention
  • No inference time benchmark is actually filled in the results table

Verdict

Worth a look if you specifically need a PyTorch-native OpenPose variant for research or hacking. Skip it if you want production-ready pose estimation — MediaPipe and newer models have made this largely a historical artifact.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.