← all repositories

s-casci/tinyzero

A framework for training AlphaZero-like reinforcement learning agents on custom environments via self-play and MCTS.

436 stars Python AgentsML Frameworks
tinyzero
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

The repository provides a streamlined implementation of the AlphaZero algorithm for training game-playing agents. It uses Monte Carlo Tree Search combined with deep neural networks for self-play training. Users can add custom environments by implementing a defined interface (reset, step, get_legal_actions, to_observation, etc.) and train agents through configurable episodes and simulations. It includes wandb integration for logging training metrics.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.