A PHP-native library that brings text analysis, sentiment scoring, and document classification to codebases that can't justify a Python microservice.
Language Models
heavyweights · gaining speedA Python tool that parses natural language questions and turns them into executable SQL using only a database dump—no live connection required.
A clean reference implementation of DSSM/CLSM for learning text similarity, minus the proprietary Bing data you'd need to actually run it.
A 2015-era Keras notebook that teaches RNNs to summarize news articles by learning when to paraphrase and when to just lift words from the source text.
A from-scratch walkthrough that treats backprop through time as a computation graph problem, not a math ritual.
A Python library that treats Hindi, Tamil, Bengali, and friends as a family rather than isolated problems.
A reference implementation of multilayered bidirectional LSTMs for named entity recognition, with a side of embedding archaeology.
A Go-native toolkit for training word embeddings when you don't want to leave the gopher ecosystem.
MeTA bundles tokenization, search indexes, topic models, and CRFs into one compiled toolkit for researchers who'd rather fight algorithms than package managers.
A research-grade neural machine translation framework built before Transformers ate the world, still supporting RNNs, attention variants, and interactive translation.
One Python toolkit that wraps classical ML, deep learning, and transformers behind a uniform API so you can swap algorithms without rewriting plumbing.
A thin Python wrapper around Stanford's Java CoreNLP server, now officially abandoned in favor of Stanza.
A Java-based bot platform that predates the LLM era and still runs on JUnit and Objective-C.
A Python library that reverse-engineers the 5W1H structure from news articles, because someone finally decided to treat reporters' training as a spec.
Someone finally organized the firehose of NLP research into a single, obsessively maintained list.
A thin wrapper that lets beginners train chatbots and phoneme converters without drowning in TensorFlow boilerplate.
ChatLearner bolts rule-based reasoning onto TensorFlow's seq2seq model so it can tell time, solve arithmetic, and read jokes—things pure neural chatbots flunk.
A scikit-learn-flavored toolkit that turns messy conversations into measurable social signals.
A self-contained Java morphological analyzer that ships its own dictionaries so you don't have to wrestle with MeCab.
A Chinese question-answering system whose author tells you not to use it.










