Yes — alibaba/ROLL is open source, released under the Apache-2.0 license.

What language is ROLL written in?

alibaba/ROLL is primarily written in Python.

alibaba/ROLL has 3.3k stars on GitHub.

Where can I find ROLL?

alibaba/ROLL is on GitHub at https://github.com/alibaba/ROLL.

alibaba/ROLL

Post-training huge LLMs with RL shouldn't need a PhD in scheduling

ROLL wraps the messy distributed systems work of large-scale LLM reinforcement learning into YAML configs and Ray clusters.

★3.3k stars Python ML Frameworks Agents LLMOps · Eval

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

ROLL is a training framework from Alibaba for reinforcement-learning post-training of large language models. It coordinates distributed rollout and training across many GPUs using Ray, and stitches together Megatron-Core, vLLM, and SGLang so you can run pipelines like RLVR, agentic RL, DPO, and distillation without building the plumbing yourself. The README emphasizes broad support for the Qwen family and compatibility with NVIDIA, AMD, and Ascend NPUs.

The interesting bit

Instead of building a monolithic new engine, ROLL acts as a multi-role scheduler: it uses Ray to allocate heterogeneous tasks—rollout, inference, training—across a cluster, treating vLLM and Megatron-Core as composable backends. That makes it less a reinvention of the wheel and more an attempt to keep the wheels from falling off at scale.

Key highlights

Integrates Megatron-Core, vLLM, and SGLang rather than replacing them.
Supports RLVR, agentic RL (including async and stepwise learning), SFT, DPO, and distillation pipelines.
Runs on NVIDIA, AMD, and Ascend NPU hardware.
Heavy focus on Qwen model family (Qwen2.5, Qwen3, Qwen3-MoE, Qwen3.5, Qwen-VL, etc.).
Published several accompanying research papers on optimization tricks (APPO, RollPacker, ROLL Flash).

Verdict

Worth a look if you are post-training Qwen-scale models with RL and need a pre-integrated distributed stack. Skip it if you are doing small-model experimentation on a single GPU or want a framework-agnostic, deeply documented alternative with community benchmarks.

Frequently asked

What is alibaba/ROLL?: ROLL wraps the messy distributed systems work of large-scale LLM reinforcement learning into YAML configs and Ray clusters.
Is ROLL open source?: Yes — alibaba/ROLL is open source, released under the Apache-2.0 license.
What language is ROLL written in?: alibaba/ROLL is primarily written in Python.
How popular is ROLL?: alibaba/ROLL has 3.3k stars on GitHub.
Where can I find ROLL?: alibaba/ROLL is on GitHub at https://github.com/alibaba/ROLL.