← all repositories

inejc/paragraph-vectors

A PyTorch implementation of Paragraph Vectors (doc2vec) for generating vector representations of text documents.

415 stars Python Language ModelsML Frameworks
paragraph-vectors
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

This library implements the Paragraph Vectors algorithm (also known as doc2vec) for creating dense vector representations of variable-length text documents. It minimizes the Negative Sampling objective as proposed by Mikolov et al., enabling efficient sparse updates during training. The implementation supports parallel batch generation on CPU while training on GPU, providing a flexible tool for unsupervised document embedding generation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.