← all repositories

open-thought/reasoning-gym

A Python library of RL environments with algorithmically verifiable rewards for training reasoning capabilities in language models.

reasoning-gym
Velocity · 7d
+2.9
★ / day
Trend
steady
star history

Reasoning Gym provides procedurally generated dataset generators and verifiable reasoning environments for training and evaluating reasoning models using reinforcement learning. It offers over 100 tasks across diverse domains including algebra, geometry, graph theory, logic, and games. Tasks are designed with single or multiple correct solutions, with a standard interface for procedural verification of model outputs.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.