← all repositories

shibing624/text2vec

Text2vec is a Python library that converts text into vector embeddings using models like Word2Vec, Sentence-BERT, and CoSENT, enabling text similarity computation and semantic search.

text2vec
Velocity · 7d
+2.1
★ / day
Trend
steady
star history

The library provides implementations of multiple text representation models including Word2Vec, RankBM25, BERT-based sentence encoders, and CoSENT for semantic similarity. It supports multi-GPU inference and includes a CLI tool for batch text vectorization. The project also publishes pre-trained Chinese matching models on HuggingFace and enables training of FlagEmbedding models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.