vwxyzjn/cleanrl
A deep reinforcement learning library offering clean, single-file implementations of RL algorithms like PPO, DQN, and SAC.

CleanRL provides high-quality single-file implementations of deep reinforcement learning algorithms including PPO, DQN, C51, DDPG, TD3, SAC, and PPG. Each algorithm variant is contained in a standalone file designed to be readable and research-friendly, making it a reference implementation for understanding RL algorithms. The library integrates with Hugging Face for model sharing and supports scaling experiments across cloud infrastructure using AWS Batch.