Is atropos open source?

Yes — NousResearch/atropos is open source, released under the MIT license.

What language is atropos written in?

NousResearch/atropos is primarily written in Python.

How popular is atropos?

NousResearch/atropos has 1.3k stars on GitHub.

Where can I find atropos?

NousResearch/atropos is on GitHub at https://github.com/NousResearch/atropos.

← all repositories

NousResearch/atropos

Atropos Is the Gym; You Bring the LLM and the Trainer

A microservice framework that supplies the worlds, benchmarks, and judges for LLM reinforcement learning, leaving the training and inference to you.

★1.3k stars Python Agents ML Frameworks LLMOps · Eval

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does

Atropos is an environment framework for asynchronous reinforcement learning with language models. It runs environments as microservices and exposes a trajectory API that feeds data to your trainer while pulling batches from an inference engine you host yourself. The package ships with pre-built environments spanning static benchmarks like GSM8K and MMLU, interactive games such as Blackjack and Taxi, code execution tasks, multimodal VQA, and even RLAIF personality-shaping judges.

The interesting bit

The framework deliberately stays out of the model-training business: it is only the gym, not the athlete or the coach. That separation lets researchers swap in any OpenAI-compatible inference server—vLLM, SGLang, or a remote API—and focus on curating the world the LLM acts inside rather than wiring distributed RL plumbing.

Key highlights

Ships with diverse ready-to-use environments: dataset evaluation, online games, tool calling, code execution (coding_server.py), and multimodal tasks (multimodal_dpo/).
Claims concrete task-specific improvements: up to 4.6× improvement on parallel tool-calling and 2.5× on financial directional prediction, with published model artifacts on Hugging Face.
Includes an RLAIF environment the authors used to produce “interesting and weird” personality variants (their words) such as the DeepHermes Egregore and Ascension Maze models.
Designed as microservices with automatic token and logprob tracking via the ManagedServer approach.
Distributed-inference friendly, with documentation for Slurm clusters.

Caveats

The README is upfront that trainer and inference engine are not included; this is strictly the environment layer.
Several experimental results are shown, but they are domain-specific (tool calling, financial prediction); generalization to other tasks is not demonstrated in the provided sources.

Verdict

Researchers and engineers who already have a training stack and need a standardized, extensible way to evaluate or fine-tune LLMs across interactive tasks should look here. If you want an all-in-one RL training platform with a built-in model runner, this is only half the puzzle.

Frequently asked

What is NousResearch/atropos?: A microservice framework that supplies the worlds, benchmarks, and judges for LLM reinforcement learning, leaving the training and inference to you.
Is atropos open source?: Yes — NousResearch/atropos is open source, released under the MIT license.
What language is atropos written in?: NousResearch/atropos is primarily written in Python.
How popular is atropos?: NousResearch/atropos has 1.3k stars on GitHub.
Where can I find atropos?: NousResearch/atropos is on GitHub at https://github.com/NousResearch/atropos.