← all repositories
FudanNLP/fnlp

FudanNLP: A Chinese NLP toolkit frozen in 2018

A once-active Java toolkit for Chinese NLP that now serves mainly as a signpost to its successor.

2.7k stars Java Other AI
fnlp
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

What it does FudanNLP (FNLP) is a Java toolkit for Chinese natural language processing. It handles word segmentation, part-of-speech tagging, named entity recognition, keyword extraction, dependency parsing, and temporal phrase recognition, plus text classification and clustering. Models ship separately from source code due to size.

The interesting bit The README’s top line is a eulogy: the project was superseded by FastNLP in December 2018 and “no longer updated.” That makes this less a living tool than a historical artifact — one that still draws stars (2,690) and occasional curiosity from developers maintaining legacy Java stacks or studying how Chinese NLP was done before the PyTorch era.

Key highlights

  • Pure Java implementation with LGPL 3.0 licensing
  • Covers the full classic Chinese NLP pipeline: seg → POS → NER → parsing
  • Ships with pre-trained models (seg.m, pos.m, dep.m) via GitHub Releases
  • Published at ACL 2013, giving it academic pedigree
  • Successor FastNLP is the actively maintained replacement

Caveats

  • Explicitly unmaintained since 2018; no bug fixes or model updates
  • Model download and manual “models” directory setup required
  • Documentation and community support (QQ group, Google Code links) show their age

Verdict Worth a look if you’re maintaining a legacy Java system or studying NLP toolkit evolution. Everyone else should head straight to FastNLP.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.