← all repositories

ShangtongZhang/reinforcement-learning-an-introduction

A Python implementation of algorithms from the Sutton & Barto reinforcement learning textbook, covering bandits, dynamic programming, and temporal-difference learning.

14.7k stars Python LearningML Frameworks
reinforcement-learning-an-introduction
Velocity · 7d
+4.1
★ / day
Trend
steady
star history

This repository provides Python implementations of algorithms from the classic reinforcement learning textbook by Sutton and Barto. It reproduces figures and exercises from the book, covering multi-armed bandits, grid-world environments, dynamic programming, and temporal-difference learning methods. The code is structured by chapter and designed for educational use.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.