blmoistawinde/HarvestText
A Python toolkit for text mining and preprocessing using unsupervised and weakly-supervised methods.

Velocity · 7d
+1.0
★ / day
Trend
→steady
star history
HarvestText is a text mining library focused on unsupervised and weakly-supervised approaches for domain-specific text processing. It provides capabilities including text cleaning, new word discovery, sentiment analysis, named entity recognition and linking, keyword extraction, knowledge extraction, and syntactic parsing. The library integrates domain knowledge such as entity types and aliases for specialized analysis tasks.