ikostrikov/jaxrl
JAX (Flax) implementations of deep reinforcement learning algorithms including Soft Actor Critic, DDPG, and AWAC for continuous action spaces.

This repository provides clean, minimal implementations of reinforcement learning algorithms written in JAX and Flax. It includes Soft Actor Critic with learnable temperature, Advantage Weighted Actor Critic, Deep Deterministic Policy Gradient with clipped double Q-learning, randomized ensemble double Q-learning, and behavioral cloning. The goal is to offer simple implementations that researchers can build upon for RL research in continuous control environments.