← all repositories

yaoguangluo/Deta_Parser

Chinese word segmentation library processing over 16 million characters per second using HMM and neural network algorithms.

480 stars Java Data ToolingLanguage Models
Deta_Parser
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

This is a high-speed Chinese text segmentation library for NLP tasks. It implements word segmentation, part-of-speech tagging (POS), and text mining using Hidden Markov Models (HMM) and neural network approaches. The project claims peak performance of 16.3 million Chinese characters per second and supports multi-language mixed text processing across dozens of languages.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.