FireRedTeam/FireRedASR2S
A production-grade all-in-one ASR pipeline providing speech-to-text, voice activity detection, language identification, and punctuation restoration.

FireRedASR2S is a speech recognition system combining multiple neural network modules: FireRedASR2 for ASR supporting Chinese, English, and code-switching with 2.89% CER on Mandarin benchmarks; FireRedVAD for voice activity detection in 100+ languages with 97.57% F1; FireRedLID for spoken language identification; and FireRedPunc for punctuation prediction. The system includes both streaming and non-streaming inference modes and supports vLLM deployment for the LLM variant.