← all repositories

mlfoundations/open_clip

Open-source PyTorch implementation of CLIP, a contrastive vision-language foundation model enabling zero-shot image classification and image-text matching.

open_clip
Velocity · 7d
+7.8
★ / day
Trend
steady
star history

OpenCLIP provides an open-source implementation of CLIP (Contrastive Language-Image Pre-training), a multi-modal model that learns to associate images with natural language descriptions through contrastive learning. The library includes training infrastructure with FSDP2 support, NaFlex image pipelines, CLAP audio model integration, and torch.compile strategies. It offers pretrained image/text models for inference and supports zero-shot classification tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.