← all repositories

alibaba/ROLL

A distributed RL training library by Alibaba for optimizing Large Language Models at scale using RLHF and RLVR techniques.

ROLL
Velocity · 7d
+8.5
★ / day
Trend
steady
star history

ROLL is an open-source library that enables efficient large-scale reinforcement learning training for LLMs on GPU clusters. It implements multi-role distributed architecture powered by Ray for flexible resource scheduling, and integrates modern inference engines like vLLM and SGLang alongside Megatron-Core for model training. The library focuses on improving LLM capabilities in human preference alignment (RLHF), multi-turn agentic interactions, and complex reasoning tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.