deepseek-ai/DeepSeek-VL2
DeepSeek-VL2 is a mixture-of-experts vision-language model designed for advanced multimodal understanding across text and images.

Velocity · 7d
+9.8
★ / day
Trend
→steady
star history
DeepSeek-VL2 is a multimodal AI model that processes and understands both visual and textual information. It leverages a mixture-of-experts architecture to efficiently handle vision-language tasks. The repository provides model weights, inference code, and integration with Hugging Face for deployment and usage.