A reinforcement learning library that outlived its framework
ChainerRL is a comprehensive deep RL toolkit built on Chainer, with a PyTorch successor waiting in the wings.

What it does ChainerRL implements a broad catalog of deep reinforcement learning algorithms—DQN variants, policy gradient methods, actor-critic approaches, and more—on top of the Chainer deep learning framework. It targets standard Gym-style environments and ships with reproduction examples for Atari and MuJoCo benchmarks.
The interesting bit The README quietly notes that PFRL is “the PyTorch analog of ChainerRL,” which tells you everything about where the ecosystem’s momentum went. Chainer itself was sunset by its maintainers in 2019, making this library a kind of well-preserved RL time capsule with a migration path baked in.
Key highlights
- 13+ algorithms with a clear capability matrix (discrete vs. continuous actions, recurrent models, batch/async training)
- Built-in techniques: Prioritized Experience Replay, Dueling Networks, NoisyNet, Deep Recurrent Q-Networks
- Separate browser-based visualization tool for debugging agent behavior
- JMLR-published library with formal citation
- Every algorithm links to paper and runnable example scripts
Caveats
- Python 3.6 is the stated test target; dependencies likely stale
- Chainer framework is no longer actively developed
- Several algorithms (TD3, SAC, IQN) lack support for recurrent models or CPU async training
Verdict Worth studying if you need a clean, well-documented reference implementation of classic deep RL algorithms, or if you’re migrating legacy Chainer code. New projects should probably start with PFRL or a more actively maintained stack.