SeanNaren/deepspeech.pytorch
A PyTorch implementation of the DeepSpeech2 end-to-end speech recognition model.

Velocity · 7d
+0.6
★ / day
Trend
→steady
star history
This repository provides an implementation of the DeepSpeech2 architecture for automatic speech recognition (ASR). It supports full training pipelines with distributed multi-node training via PyTorch Lightning, testing, and inference. The model can optionally leverage a KenLM language model during decoding to improve transcription accuracy. The project includes dataset support for common speech benchmarks including AN4, TEDLIUM, VoxForge, and Common Voice.