← all repositories

MoonshotAI/Kimi-Audio

An open-source audio foundation model for understanding, generating, and conversing with audio.

4.6k stars Python Image · Video · Audio
Kimi-Audio
Velocity · 7d
+11
★ / day
Trend
steady
star history

Kimi-Audio is a 7B parameter audio foundation model developed by MoonshotAI that supports audio understanding, generation, and conversational interactions. The repository includes inference code, pretrained and instruct model weights, fine-tuning examples, and a comprehensive evaluation toolkit for benchmarking audio models across speech recognition, audio understanding, and conversation tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.