← all repositories

Alibaba-NLP/ZeroSearch

ZeroSearch trains large language models to develop search capabilities using simulation-based reinforcement learning without real search engines.

ZeroSearch
Velocity · 7d
+3.2
★ / day
Trend
steady
star history

ZeroSearch is a training framework that incentivizes LLMs to acquire search-like reasoning capabilities through reinforcement learning with simulation LLMs. Rather than using actual search engines during training, it trains policy models on simulated search environments, then deploys them with real search APIs. The project releases policy models, simulation LLMs, and datasets compatible with Wikipedia and Google Search on Hugging Face.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.