opendilab/DI-sheep
A deep reinforcement learning agent trained with PPO to play the 'Sheep Sheep' tile-matching puzzle game.

Velocity · 7d
+0.4
★ / day
Trend
→steady
star history
The project combines a popular puzzle game with deep reinforcement learning, training a PPO agent to play ‘羊了个羊’ (Sheep Sheep). It provides a custom Gym environment for the game, training scripts, and a web-based demo. The repository is built on top of the DI-engine RL framework and includes both a Python service and a React frontend for playing and interacting with the trained agent.