yaoguangluo/Deta_Parser
Chinese word segmentation library processing over 16 million characters per second using HMM and neural network algorithms.

Velocity · 7d
+0.2
★ / day
Trend
→steady
star history
This is a high-speed Chinese text segmentation library for NLP tasks. It implements word segmentation, part-of-speech tagging (POS), and text mining using Hidden Markov Models (HMM) and neural network approaches. The project claims peak performance of 16.3 million Chinese characters per second and supports multi-language mixed text processing across dozens of languages.