open-thought/reasoning-gym
A Python library of RL environments with algorithmically verifiable rewards for training reasoning capabilities in language models.

Velocity · 7d
+2.9
★ / day
Trend
→steady
star history
Reasoning Gym provides procedurally generated dataset generators and verifiable reasoning environments for training and evaluating reasoning models using reinforcement learning. It offers over 100 tasks across diverse domains including algebra, geometry, graph theory, logic, and games. Tasks are designed with single or multiple correct solutions, with a standard interface for procedural verification of model outputs.