← all repositories

clips/pattern

A Python web mining library with built-in NLP and ML tools for text classification, sentiment analysis, POS tagging, and clustering.

8.9k stars Python ML FrameworksData Tooling
pattern
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

Pattern is a comprehensive web mining module for Python that bundles data extraction, NLP, and ML capabilities. It provides part-of-speech taggers, sentiment analysis, WordNet integration, and classical ML algorithms including KNN, SVM, and Perceptron classifiers. The library also includes tools for web crawling, HTML parsing, network graph analysis, and vector space modeling, making it suitable for building text-based ML pipelines and data mining workflows.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.