← all repositories

ikostrikov/jaxrl

JAX (Flax) implementations of deep reinforcement learning algorithms including Soft Actor Critic, DDPG, and AWAC for continuous action spaces.

757 stars Jupyter Notebook ML FrameworksAgents
jaxrl
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

This repository provides clean, minimal implementations of reinforcement learning algorithms written in JAX and Flax. It includes Soft Actor Critic with learnable temperature, Advantage Weighted Actor Critic, Deep Deterministic Policy Gradient with clipped double Q-learning, randomized ensemble double Q-learning, and behavioral cloning. The goal is to offer simple implementations that researchers can build upon for RL research in continuous control environments.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.