← all repositories
chainer/chainerrl

A reinforcement learning library that outlived its framework

ChainerRL is a comprehensive deep RL toolkit built on Chainer, with a PyTorch successor waiting in the wings.

1.2k stars Python AgentsML Frameworks
chainerrl
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

What it does ChainerRL implements a broad catalog of deep reinforcement learning algorithms—DQN variants, policy gradient methods, actor-critic approaches, and more—on top of the Chainer deep learning framework. It targets standard Gym-style environments and ships with reproduction examples for Atari and MuJoCo benchmarks.

The interesting bit The README quietly notes that PFRL is “the PyTorch analog of ChainerRL,” which tells you everything about where the ecosystem’s momentum went. Chainer itself was sunset by its maintainers in 2019, making this library a kind of well-preserved RL time capsule with a migration path baked in.

Key highlights

  • 13+ algorithms with a clear capability matrix (discrete vs. continuous actions, recurrent models, batch/async training)
  • Built-in techniques: Prioritized Experience Replay, Dueling Networks, NoisyNet, Deep Recurrent Q-Networks
  • Separate browser-based visualization tool for debugging agent behavior
  • JMLR-published library with formal citation
  • Every algorithm links to paper and runnable example scripts

Caveats

  • Python 3.6 is the stated test target; dependencies likely stale
  • Chainer framework is no longer actively developed
  • Several algorithms (TD3, SAC, IQN) lack support for recurrent models or CPU async training

Verdict Worth studying if you need a clean, well-documented reference implementation of classic deep RL algorithms, or if you’re migrating legacy Chainer code. New projects should probably start with PFRL or a more actively maintained stack.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.