← all repositories

nobody132/masr

A Mandarin Chinese speech recognition system using gated convolutional neural networks trained on the AISHELL-1 dataset.

2k stars Python Image · Video · Audio
masr
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

The project implements an end-to-end deep learning model for Chinese Mandarin speech recognition based on gated convolutional networks similar to Facebook’s Wav2letter architecture. It uses Gated Linear Units (GLU) as activation functions for faster convergence. The model is trained on the AISHELL-1 dataset containing 150 hours of recordings covering over 4000 Chinese characters. Evaluation uses Character Error Rate (CER) as the metric, with the validation set reaching approximately 11% CER.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.