MoonshotAI/Kimi-Audio
An open-source audio foundation model for understanding, generating, and conversing with audio.

Velocity · 7d
+11
★ / day
Trend
→steady
star history
Kimi-Audio is a 7B parameter audio foundation model developed by MoonshotAI that supports audio understanding, generation, and conversational interactions. The repository includes inference code, pretrained and instruct model weights, fine-tuning examples, and a comprehensive evaluation toolkit for benchmarking audio models across speech recognition, audio understanding, and conversation tasks.