ericyangyu/PPO-for-Beginners
A bare-bones, well-documented Proximal Policy Optimization implementation in PyTorch for beginners.

Velocity · 7d
+0.6
★ / day
Trend
→steady
star history
This repository provides a from-scratch implementation of Proximal Policy Optimization, a reinforcement learning algorithm, using PyTorch. It follows the OpenAI Spinning Up pseudocode and targets learners with some Python and RL background. The code prioritizes clarity over performance tricks, with extensive documentation to help beginners understand PPO mechanics.