clips/pattern
A Python web mining library with built-in NLP and ML tools for text classification, sentiment analysis, POS tagging, and clustering.

Pattern is a comprehensive web mining module for Python that bundles data extraction, NLP, and ML capabilities. It provides part-of-speech taggers, sentiment analysis, WordNet integration, and classical ML algorithms including KNN, SVM, and Perceptron classifiers. The library also includes tools for web crawling, HTML parsing, network graph analysis, and vector space modeling, making it suitable for building text-based ML pipelines and data mining workflows.