QwenLM/Qwen2-Audio
A large audio-language model from Alibaba's Qwen team capable of accepting speech and audio inputs for conversational interaction and audio analysis.

Velocity · 7d
+2.9
★ / day
Trend
→steady
star history
Qwen2-Audio is a large-scale audio-language model that processes various audio signals and responds to speech instructions. It supports two interaction modes: free-form voice chat without text input, and audio analysis where users provide audio paired with text queries. The model is released in 7B parameter versions for both pretrained and instruction-tuned variants.