← all repositories

AgentR1/Agent-R1

Agent-R1 is a unified RL framework for training multi-step LLM agents to use tools and interact with environments.

1.5k stars Python AgentsML Frameworks
Agent-R1
Velocity · 7d
+3.2
★ / day
Trend
steady
star history

The framework implements a step-native reinforcement learning loop where LLM agents observe environments, generate actions, and receive tool or environment feedback until task completion. It models each turn as an explicit MDP transition, making reward assignment and policy optimization part of a unified training substrate. The project includes integrations with StepPO training and provides processed datasets for agent tasks like HotpotQA, ALFWorld, and WebShop.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.