← all repositories

opendilab/DI-sheep

A deep reinforcement learning agent trained with PPO to play the 'Sheep Sheep' tile-matching puzzle game.

508 stars Python AgentsDomain Apps
DI-sheep
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

The project combines a popular puzzle game with deep reinforcement learning, training a PPO agent to play ‘羊了个羊’ (Sheep Sheep). It provides a custom Gym environment for the game, training scripts, and a web-based demo. The repository is built on top of the DI-engine RL framework and includes both a Python service and a React frontend for playing and interacting with the trained agent.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.