← all repositories

salute-developers/GigaAM

Open-source foundational acoustic model for speech processing with self-supervised pre-training and CTC/RNN-T fine-tuning.

GigaAM
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

GigaAM provides a family of open-source acoustic models for speech recognition tasks. It uses self-supervised learning for pre-training and supports both CTC and RNN-T decoding heads. The models achieve state-of-the-art results for Russian language speech recognition and include emotion recognition capabilities. The repository includes model fine-tuning scripts, Triton Inference Server integration with TensorRT optimization, and ONNX export support.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.