← all repositories

gooofy/zamia-speech

Toolkit for building and training automatic speech recognition models from speech corpora.

447 stars Python Image · Video · Audio
zamia-speech
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

This repository provides scripts for constructing ASR audio models (Kaldi nnet3 chain, wav2letter++), language models (KenLM in ARPA format), and grapheme-to-phoneme models (sequitur g2p) from speech data sources like VoxForge. It processes speech and text corpora to train models that decode audio to text. Pre-trained models and a Docker-based STT demo service are also provided.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.