← all repositories

QwenLM/Qwen2-Audio

A large audio-language model from Alibaba's Qwen team capable of accepting speech and audio inputs for conversational interaction and audio analysis.

Qwen2-Audio
Velocity · 7d
+2.9
★ / day
Trend
steady
star history

Qwen2-Audio is a large-scale audio-language model that processes various audio signals and responds to speech instructions. It supports two interaction modes: free-form voice chat without text input, and audio analysis where users provide audio paired with text queries. The model is released in 7B parameter versions for both pretrained and instruction-tuned variants.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.