← all repositories

opendilab/LightZero

A unified benchmark for Monte Carlo Tree Search algorithms in reinforcement learning, implementing AlphaZero, MuZero, and variants for games and control tasks.

1.6k stars Python AgentsML FrameworksDomain Apps
LightZero
Velocity · 7d
+1.2
★ / day
Trend
steady
star history

LightZero provides PyTorch implementations of MCTS-based reinforcement learning algorithms including AlphaZero, MuZero, EfficientZero, Gumbel-MuZero, and Stochastic MuZero. It benchmarks these agents across diverse scenarios such as board games (Gomoku, TicTacToe), Atari environments, and continuous control tasks. The project is designed as both a research benchmark and a training framework for self-play decision-making agents.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.