echogarden-project/echogarden
Cross-platform speech toolset providing text-to-speech, speech-to-text, forced alignment, and related audio processing via ML models.

Echogarden is a TypeScript/Node.js library and CLI tool for speech processing. It integrates multiple speech synthesis and recognition engines, including OpenAI Whisper (ported to TypeScript/ONNX), Kokoro, VITS, and whisper.cpp for offline operation. It also supports cloud services from Google, Microsoft, Amazon, OpenAI, and ElevenLabs. The project runs entirely in Node.js without Python dependencies by leveraging ONNX runtime and WebAssembly for portable ML inference.