OFA-Sys/Chinese-CLIP
A Chinese-language CLIP model enabling cross-modal retrieval and zero-shot image classification.

Velocity · 7d
+4.1
★ / day
Trend
→steady
star history
This project is a Chinese adaptation of OpenAI’s CLIP model, pre-trained on approximately 200 million Chinese image-text pairs. It provides APIs and training code for tasks including image-text similarity computation, cross-modal retrieval, and zero-shot classification in the Chinese language domain. The implementation builds on the open_clip project with optimizations for Chinese data and is implemented in PyTorch using transformer architectures.