salute-developers/GigaAM
Open-source foundational acoustic model for speech processing with self-supervised pre-training and CTC/RNN-T fine-tuning.

GigaAM provides a family of open-source acoustic models for speech recognition tasks. It uses self-supervised learning for pre-training and supports both CTC and RNN-T decoding heads. The models achieve state-of-the-art results for Russian language speech recognition and include emotion recognition capabilities. The repository includes model fine-tuning scripts, Triton Inference Server integration with TensorRT optimization, and ONNX export support.