← all repositories

xhluca/bm25s

An ultrafast pure-Python implementation of BM25 ranking function for text retrieval, commonly used in RAG pipelines.

1.7k stars Python RAG · Search
bm25s
Velocity · 7d
+2.2
★ / day
Trend
steady
star history

BM25S provides an efficient implementation of BM25, a classic probabilistic ranking function for keyword-based text retrieval. The library leverages sparse matrices and Numba JIT compilation to achieve high performance for document ranking tasks. It is frequently used as the retrieval backend in Retrieval-Augmented Generation (RAG) pipelines alongside vector search systems.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.