rlcode/reinforcement-learning
Educational repository with clean Python implementations of classic and deep reinforcement learning algorithms for Grid World, CartPole, and Atari environments.

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
Provides minimal, single-file code examples of reinforcement learning algorithms ranging from basic policy and value iteration to deep methods including DQN, A2C, and PPO. Each algorithm is implemented in a standalone Python file with training on Grid World, CartPole, and Atari environments. Includes benchmark results for training on Apple Silicon using the PyTorch MPS backend.