← all repositories

MaartenGr/KeyBERT

KeyBERT is a keyword extraction library that uses BERT embeddings to identify the most similar words and phrases to an entire document.

4.2k stars Python RAG · Search
KeyBERT
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

It leverages transformer-based embeddings to create semantic representations of both documents and candidate phrases, then uses cosine similarity to rank phrases by their relevance to the source. The library supports configurable embedding models and includes techniques like Max Sum Distance and Maximal Marginal Relevance to improve diversity in extracted keywords.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.