Is open-r1 open source?

Yes — huggingface/open-r1 is open source, released under the Apache-2.0 license.

What language is open-r1 written in?

huggingface/open-r1 is primarily written in Python.

How popular is open-r1?

huggingface/open-r1 has 26.4k stars on GitHub.

Where can I find open-r1?

huggingface/open-r1 is on GitHub at https://github.com/huggingface/open-r1.

← all repositories

huggingface/open-r1

Open R1 is the missing manual for DeepSeek's reasoning pipeline

Because DeepSeek-R1 is impressive, but its training cookbook shouldn't be a black box.

★26.4k stars Python Language Models ML Frameworks

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does

Open R1 is Hugging Face’s attempt to reverse-engineer the entire DeepSeek-R1 pipeline and release the pieces openly as they are built. The repo provides training scripts for supervised fine-tuning and GRPO, synthetic data generation via Distilabel, and pre-made recipes tuned for H100 clusters. As of late May, the team has completed Step 1: they replicated the R1-Distill 7B model and released the Mixture-of-Thoughts dataset, a curated set of 350,000 verified reasoning traces.

The interesting bit

This is not a single model drop; it is a methodical deconstruction of a closed training process. The project treats DeepSeek’s technical report as a literal blueprint, breaking the reproduction into three stages—distillation, pure RL, and multi-stage tuning—and publishing the intermediate artifacts as they go. The result reads more like a public laboratory log than a finished product.

Key highlights

Step 1 is complete and benchmarked: the team’s OpenR1-Distill-7B matches DeepSeek’s own 7B distilled model on AIME 2024, MATH-500, GPQA Diamond, and LiveCodeBench v5.
Released datasets include Mixture-of-Thoughts (350k traces), CodeForces-CoTs (10k problems, 100k solutions), and OpenR1-Math-220k.
A 7B model trained on the CodeForces dataset outperforms Claude 3.7 Sonnet on the new IOI24 benchmark; a 32B version reportedly beats R1 itself.
The codebase is intentionally minimal—just sft.py, grpo.py, generate.py, and a Makefile—designed to be read and modified rather than wrapped.

Caveats

Steps 2 and 3 (replicating the pure RL pipeline and the full multi-stage training from base model to RL-tuned) are still listed as work in progress.
The README is explicit that this is a work in progress, and most training recipes are currently hard-wired for 8× H100 nodes.

Verdict

Grab this if you are a researcher or practitioner who wants to understand—or fork—how modern reasoning models are distilled and tuned. Skip it if you are looking for a drop-in chatbot; the project is scaffolding, not a finished endpoint.

Frequently asked

What is huggingface/open-r1?: Because DeepSeek-R1 is impressive, but its training cookbook shouldn't be a black box.
Is open-r1 open source?: Yes — huggingface/open-r1 is open source, released under the Apache-2.0 license.
What language is open-r1 written in?: huggingface/open-r1 is primarily written in Python.
How popular is open-r1?: huggingface/open-r1 has 26.4k stars on GitHub.
Where can I find open-r1?: huggingface/open-r1 is on GitHub at https://github.com/huggingface/open-r1.