modelscope/FunASR
An end-to-end speech recognition toolkit providing transcription, speaker diarization, and emotion detection across 50+ languages.

Velocity · 7d
+13
★ / day
Trend
→steady
star history
FunASR is an industrial-grade speech recognition toolkit built on PyTorch that provides automatic speech recognition (ASR), speaker diarization, voice activity detection, and emotion recognition. It supports streaming and batch processing with claimed 170x realtime performance and offers an OpenAI-compatible API for serving. The project includes integration with vLLM for inference and MCP server support for agent toolchains.