← all repositories

k2-fsa/sherpa-ncnn

A real-time speech-to-text, TTS, and VAD system built on neural network models for offline local execution across mobile, desktop, and embedded platforms.

sherpa-ncnn
Velocity · 7d
+1.3
★ / day
Trend
steady
star history

Sherpa-ncnn provides streaming and offline speech recognition capabilities by implementing modern Kaldi-style neural network models that run entirely on-device without internet connectivity. It supports voice activity detection using models like Silero VAD and text-to-speech using VITS models from the Piper project. The system uses the ncnn neural network inference library for efficient cross-platform deployment, targeting resource-constrained environments including mobile devices, embedded systems, and WebAssembly.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.