← all repositories

jbesomi/texthero

A Python library for fast text preprocessing, representation, and visualization built on top of Pandas.

2.9k stars Python Data ToolingLanguage Models
texthero
Velocity · 7d
+1.3
★ / day
Trend
steady
star history

Texthero provides utilities for cleaning, preprocessing, and representing text data for NLP and ML workflows. It offers functions for tokenization, stemming, named entity recognition, and text vectorization including word embeddings. The library integrates with Pandas for easy manipulation of text columns and includes visualization tools for exploring text datasets.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.