← all repositories

blmoistawinde/HarvestText

A Python toolkit for text mining and preprocessing using unsupervised and weakly-supervised methods.

2.6k stars Python Data Tooling
HarvestText
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

HarvestText is a text mining library focused on unsupervised and weakly-supervised approaches for domain-specific text processing. It provides capabilities including text cleaning, new word discovery, sentiment analysis, named entity recognition and linking, keyword extraction, knowledge extraction, and syntactic parsing. The library integrates domain knowledge such as entity types and aliases for specialized analysis tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.