deepseek-ai/DeepSeek-VL
DeepSeek-VL is a vision-language foundation model for real-world multimodal comprehension.

Velocity · 7d
+5.0
★ / day
Trend
→steady
star history
DeepSeek-VL is a vision-language model that combines visual and language understanding capabilities. The repository provides model weights, training code, and inference tools for the VL model. It enables tasks like visual question answering and image-text comprehension across real-world scenarios.