← all repositories

mravanelli/pytorch-kaldi

A speech recognition toolkit that combines PyTorch deep neural networks with the Kaldi toolkit for acoustic modeling and decoding.

2.4k stars Python Domain AppsLanguage Models
pytorch-kaldi
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

PyTorch-Kaldi is an open-source toolkit for building state-of-the-art DNN/RNN hybrid speech recognition systems. The project integrates PyTorch for managing deep neural network components (DNN, RNN, LSTM, GRU) with the Kaldi toolkit for feature extraction, label computation, and decoding. It implements the DNN-HMM hybrid approach common in ASR systems, where neural networks provide acoustic predictions that feed into a hidden Markov model decoder. The toolkit aims to improve transparency and replicability of speech recognition research.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.