k2-fsa/sherpa-onnx
A next-gen Kaldi-based speech processing toolkit running ONNX models for offline ASR, TTS, speaker diarization, VAD, and speech enhancement.

Sherpa-onnx is a speech processing library that leverages ONNX runtime to run neural network models for tasks including automatic speech recognition, text-to-speech synthesis, speaker diarization, voice activity detection, keyword spotting, and speech enhancement. It supports cross-platform deployment from embedded devices to servers across multiple programming languages including C++, Python, Java, JavaScript, C#, Kotlin, and Swift.