Is fashion-clip open source?

Yes — patrickjohncyh/fashion-clip is open source, released under the MIT license.

What language is fashion-clip written in?

patrickjohncyh/fashion-clip is primarily written in Python.

How popular is fashion-clip?

patrickjohncyh/fashion-clip has 529 stars on GitHub.

Where can I find fashion-clip?

patrickjohncyh/fashion-clip is on GitHub at https://github.com/patrickjohncyh/fashion-clip.

← all repositories

patrickjohncyh/fashion-clip

CLIP gets a wardrobe upgrade

FashionCLIP fine-tunes CLIP on hundreds of thousands of fashion image-text pairs so your zero-shot search can finally tell a bomber jacket from a blazer.

★529 stars Python Language Models Image · Video · Audio Domain Apps

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does

FashionCLIP is a domain-specific adaptation of CLIP, fine-tuned on over 700,000 image-text pairs from the Farfetch catalog (pending official release). It generates aligned image and text embeddings tuned for apparel concepts—think silhouettes, textures, and garment categories—and applies them to zero-shot classification, multi-modal retrieval, and fashion parsing. The project provides a Python API built atop Hugging Face transformers, with helper classes for catalog management and approximate nearest-neighbor search.

The interesting bit

The team recently replaced the original OpenAI CLIP backbone with LAION’s CLIP-ViT-B-32 checkpoint—trained on roughly five times the data—and re-fine-tuned to create FashionCLIP 2.0. The architecture stayed identical, yet weighted macro F1 scores jumped across all three fashion benchmarks, a pragmatic case of upgrading the foundation and running the same playbook twice.

Key highlights

Fine-tuned on 700K+ fashion image-text pairs; the underlying Farfetch dataset is awaiting official release.
Weights hosted on Hugging Face; supports private repositories via authentication tokens.
FCLIPDataset and FashionCLIP classes handle catalog ingestion, embedding generation, and approximate nearest-neighbor retrieval.
Outperforms both OpenAI CLIP and the LAION base model on FMNIST, KAGL, and DEEP benchmarks according to the project’s reported weighted macro F1 scores.
Accompanied by a Nature Scientific Reports paper that discloses training costs and environmental impact.

Caveats

Pre-computed vectors and full training reproducibility are on hold until the Farfetch dataset is officially released.
The repository overview labels the Streamlit demo as “coming soon,” even though an external badge links to a Hugging Face space, so the demo status is unclear.
The library is largely a convenience wrapper and dataset handler around Hugging Face CLIP; the heavy embedding work is delegated to the underlying transformer.

Verdict

Worth exploring if you are building ecommerce search, automated product tagging, or content moderation for apparel and need zero-shot capability without training a vision-language model from scratch. For general vision tasks outside fashion, the underlying LAION or OpenAI CLIP checkpoints are likely sufficient.

Frequently asked

What is patrickjohncyh/fashion-clip?: FashionCLIP fine-tunes CLIP on hundreds of thousands of fashion image-text pairs so your zero-shot search can finally tell a bomber jacket from a blazer.
Is fashion-clip open source?: Yes — patrickjohncyh/fashion-clip is open source, released under the MIT license.
What language is fashion-clip written in?: patrickjohncyh/fashion-clip is primarily written in Python.
How popular is fashion-clip?: patrickjohncyh/fashion-clip has 529 stars on GitHub.
Where can I find fashion-clip?: patrickjohncyh/fashion-clip is on GitHub at https://github.com/patrickjohncyh/fashion-clip.