Is xmnlp open source?

Yes — SeanLee97/xmnlp is open source, released under the Apache-2.0 license.

What language is xmnlp written in?

SeanLee97/xmnlp is primarily written in Python.

How popular is xmnlp?

SeanLee97/xmnlp has 1.3k stars on GitHub.

Where can I find xmnlp?

SeanLee97/xmnlp is on GitHub at https://github.com/SeanLee97/xmnlp.

← all repositories

SeanLee97/xmnlp

A Swiss-Army knife for Chinese text that fits in one import

xmnlp bundles a dozen Chinese NLP tasks—segmentation, NER, sentiment, pinyin, even radicals—behind a single pip install, with ONNX models you download separately.

★1.3k stars Python Language Models

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does

xmnlp is an all-in-one Chinese NLP toolkit. It handles word segmentation, part-of-speech tagging, named-entity recognition, sentiment analysis, text correction, keyword/keyphrase extraction, pinyin conversion, and even Chinese character radical lookup. Most heavy lifting runs through RoBERTa + CRF models exported to ONNX, with faster rule-based fallbacks (reverse maximum matching) when you don’t need neural precision.

The interesting bit

The “speed vs. accuracy” dial is explicit: every major task exposes both fast_* and deep_* variants, so you can trade neural nuance for throughput without swapping libraries. The radical lookup and pinyin features are just HashMap and Trie lookups—simple, but oddly hard to find bundled with modern transformer-based tools.

Key highlights

Segmentation, POS tagging, and NER via RoBERTa + CRF finetuning, with custom dictionary support (jieba-compatible format)
Sentiment analysis and spell-checking (detector + corrector) included
Keyword/keyphrase extraction via Textrank
Sentence embeddings and similarity calculation
ONNX Runtime inference; supports Python 3.6–3.8 on Linux, Windows, macOS
Models downloaded separately via Feishu or Baidu Netdisk—version-locked to the package

Caveats

Deep model interfaces are Simplified-Chinese only; no Traditional Chinese support
Model weights are hosted on Chinese cloud services (Feishu/Baidu), not HuggingFace or GitHub releases
Python 3.6–3.8 support suggests the project may not be actively tracking newer releases

Verdict

Good fit if you need one library to cover the full Chinese NLP pipeline without orchestrating multiple dependencies. Skip it if you require Traditional Chinese, want models pip-installable from PyPI, or need the bleeding-edge accuracy of dedicated single-task libraries.

Frequently asked

What is SeanLee97/xmnlp?: xmnlp bundles a dozen Chinese NLP tasks—segmentation, NER, sentiment, pinyin, even radicals—behind a single pip install, with ONNX models you download separately.
Is xmnlp open source?: Yes — SeanLee97/xmnlp is open source, released under the Apache-2.0 license.
What language is xmnlp written in?: SeanLee97/xmnlp is primarily written in Python.
How popular is xmnlp?: SeanLee97/xmnlp has 1.3k stars on GitHub.
Where can I find xmnlp?: SeanLee97/xmnlp is on GitHub at https://github.com/SeanLee97/xmnlp.