← all repositories

ericyangyu/PPO-for-Beginners

A bare-bones, well-documented Proximal Policy Optimization implementation in PyTorch for beginners.

1.2k stars Python ML FrameworksLearning
PPO-for-Beginners
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

This repository provides a from-scratch implementation of Proximal Policy Optimization, a reinforcement learning algorithm, using PyTorch. It follows the OpenAI Spinning Up pseudocode and targets learners with some Python and RL background. The code prioritizes clarity over performance tricks, with extensive documentation to help beginners understand PPO mechanics.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.