Is ZeroSearch open source?

Yes — Alibaba-NLP/ZeroSearch is open source, released under the Apache-2.0 license.

What language is ZeroSearch written in?

Alibaba-NLP/ZeroSearch is primarily written in Python.

How popular is ZeroSearch?

Alibaba-NLP/ZeroSearch has 1.3k stars on GitHub.

Where can I find ZeroSearch?

Alibaba-NLP/ZeroSearch is on GitHub at https://github.com/Alibaba-NLP/ZeroSearch.

← all repositories

Alibaba-NLP/ZeroSearch

Training LLMs to search by keeping them offline

A reinforcement learning framework that teaches language models to query and reason over search results without ever calling a live search API during training.

★1.3k stars Python Language Models Agents LLMOps · Eval

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

ZeroSearch is a reinforcement learning pipeline that trains language models to interact with search engines. Instead of burning API credits on live Google or Wikipedia calls during RL training, it uses a dedicated simulation LLM to generate synthetic retrieval results—both relevant documents and deliberate noise. A curriculum mechanism gradually ramps up the difficulty of these fake retrieval scenarios to force the policy model to improve its reasoning.

The interesting bit

The twist is that the simulator itself is an LLM fine-tuned to mimic a retriever, effectively creating a closed-loop gym where the student practices on a counterfeit internet. The authors claim this setup not only eliminates API costs but outperforms models trained with real search engines—a result that either indicts live search reliability or validates the fidelity of their synthetic retrieval.

Key highlights

Supports REINFORCE, GRPO, and PPO for training the policy model.
Provides fine-tuned simulation LLMs (3B, 7B, and 14B) for both Google and Wikipedia retrieval flavors.
Released policy models for Qwen2.5 and Llama 3.2 in multiple sizes.
Includes a curriculum rollout with adjustable difficulty thresholds to progressively stress-test the model.
Built atop Search-R1, veRL, and RAGEN rather than from scratch.

Caveats

Hosting the simulation LLM alongside the policy model demands significant GPU resources; the docs suggest multi-GPU serving even for the simulator.
Quantitative claims rely on figures rather than prose; the README cites outperformance of real search but omits specific metrics in text.
Tightly coupled to Google and Wikipedia simulation backends; transfer to other retrieval sources is unexplored.

Verdict

Worth a look if you’re training tool-use LLMs and want to dodge search API bills during RL. Skip it if you need a plug-and-play search agent or lack the hardware to host two large models simultaneously.

Frequently asked

What is Alibaba-NLP/ZeroSearch?: A reinforcement learning framework that teaches language models to query and reason over search results without ever calling a live search API during training.
Is ZeroSearch open source?: Yes — Alibaba-NLP/ZeroSearch is open source, released under the Apache-2.0 license.
What language is ZeroSearch written in?: Alibaba-NLP/ZeroSearch is primarily written in Python.
How popular is ZeroSearch?: Alibaba-NLP/ZeroSearch has 1.3k stars on GitHub.
Where can I find ZeroSearch?: Alibaba-NLP/ZeroSearch is on GitHub at https://github.com/Alibaba-NLP/ZeroSearch.