Yes — salesforce/BLIP is open source, released under the BSD-3-Clause license.

What language is BLIP written in?

salesforce/BLIP is primarily written in Jupyter Notebook.

salesforce/BLIP has 5.7k stars on GitHub.

Where can I find BLIP?

salesforce/BLIP is on GitHub at https://github.com/salesforce/BLIP.

salesforce/BLIP

BLIP: unified vision-language understanding and generation, retired

A unified vision-language model that bootstraps its pre-training with filtered synthetic captions and handles both understanding and generation.

★5.7k stars Jupyter Notebook Language Models Image · Video · Audio

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does BLIP is a single transformer architecture that jointly tackles vision-language understanding—image-text retrieval, visual question answering, and natural-language visual reasoning—and generation tasks such as image captioning. This repository holds the original PyTorch research implementation, complete with pre-trained and fine-tuned checkpoints, bootstrapped pre-training datasets, and interactive demos via Colab, Hugging Face Spaces, and Replicate.

The interesting bit Instead of treating understanding and generation as separate problems, BLIP unifies them under one model and improves its training data by bootstrapping: the README lists filtered web captions alongside synthetic captions generated by its own ViT-B and ViT-L variants, effectively letting the model refine its own pre-training diet. It also supports zero-shot video-text retrieval, extending the same architecture beyond static images.

Key highlights

Explicitly deprecated and unsupported; Salesforce has merged BLIP into the maintained LAVIS library
One model handles both understanding and generation: retrieval, VQA, NLVR2, and image captioning
Ships with bootstrapped pre-training datasets including filtered web captions and synthetic captions derived from CC3M+CC12M+SBU and LAION115M
Pre-trained and fine-tuned checkpoints available for ViT-B and ViT-L backbones
Interactive demos run without a local GPU via Colab, Hugging Face Spaces, or Replicate

Caveats

The authors explicitly warn against using this repository for production or sensitive purposes
Active development has moved to LAVIS, leaving this repo as unsupported, frozen research code
Fine-tuning and pre-training configurations reference eight to sixteen A100 GPUs

Verdict Paper reproducers and vision-language researchers will find the original checkpoints and bootstrapped datasets valuable. Everyone else—especially anyone shipping production code—should head straight to LAVIS, as the authors recommend.

Frequently asked

What is salesforce/BLIP?: A unified vision-language model that bootstraps its pre-training with filtered synthetic captions and handles both understanding and generation.
Is BLIP open source?: Yes — salesforce/BLIP is open source, released under the BSD-3-Clause license.
What language is BLIP written in?: salesforce/BLIP is primarily written in Jupyter Notebook.
How popular is BLIP?: salesforce/BLIP has 5.7k stars on GitHub.
Where can I find BLIP?: salesforce/BLIP is on GitHub at https://github.com/salesforce/BLIP.