← all repositories

speechio/chinese_text_normalization

A ready-to-use Chinese text normalization module designed for Automatic Speech Recognition (ASR) text processing pipelines.

730 stars Python Data ToolingDomain Apps
chinese_text_normalization
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

This project provides a language-specific text normalization module for Chinese speech processing applications. It handles conversion of non-standard words such as cardinals, dates, digits, fractions, and other numeric expressions into their spoken forms, which is a critical preprocessing step for ASR output. The module integrates with Kaldi-ASR and uses Sparrowhawk/Thrax grammar-based rule processing.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.