← all repositories

stanford-futuredata/ColBERT

A neural retrieval model that encodes passages and queries into token-level embedding matrices and uses late interaction to efficiently compute fine-grained similarity at scale.

3.9k stars Python RAG · SearchLanguage Models
ColBERT
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

ColBERT is a fast and accurate retrieval system built on BERT that encodes each passage into a matrix of token-level embeddings. At search time, it embeds queries into similar matrices and employs scalable vector-similarity MaxSim operators to efficiently find contextually matching passages. This late-interaction approach enables it to surpass single-vector representation models in quality while maintaining the ability to scale to large document collections.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.