← all repositories

PrimeIntellect-ai/prime-rl

A framework for large-scale reinforcement learning that trains autonomous agents using asynchronous RL on distributed GPU infrastructure.

prime-rl
Velocity · 7d
+3.0
★ / day
Trend
steady
star history

PRIME-RL is a reinforcement learning training framework designed for agentic systems at scale. It provides fully asynchronous RL training capable of scaling to 1000+ GPUs, supporting mixture-of-experts models with distributed training strategies including FSDP2, EP, and CP parallelism. The framework integrates vLLM for inference, supports FP8 inference and PD disaggregation, and includes native support for agentic and SWE environments through the Environments Hub. It offers end-to-end post-training capabilities including SFT, RL training, and evaluations.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.