QwenLM/Qwen-Audio
Large audio language model that processes and understands speech and audio for chat and understanding tasks.

Velocity · 7d
+2.0
★ / day
Trend
→steady
star history
Qwen-Audio is a pretrained large audio language model developed by Alibaba Cloud that extends LLM capabilities to audio understanding. It supports processing various audio inputs including speech for tasks like speech recognition and conversational interactions. The model comes in both base and chat variants, with the chat version enabling natural language interaction with audio content.