johanmodin/clifs
CLIFS is a proof-of-concept video search engine that uses OpenAI's CLIP model to find frames matching free-text queries.

Velocity · 7d
+0.3
★ / day
Trend
→steady
star history
The system extracts features from video frames using CLIP’s image encoder, then converts search queries to embeddings via CLIP’s text encoder. Matching is performed by computing cosine similarity between query and frame embeddings, returning top results above a configurable threshold. A Django web server provides a REST interface to the search engine, deployed via Docker containers with optional GPU support.