infiniflow/infinity
A high-performance vector and full-text database engineered for LLM applications, supporting hybrid search across dense embeddings, sparse embeddings, tensors, and text.

Infinity is an AI-native database purpose-built for RAG and LLM applications. It provides hybrid search capabilities combining dense vector retrieval (HNSW), sparse embeddings (BM25-style), tensor/multi-vector search, and full-text search in a single system. The database targets low-latency retrieval at scale with claimed sub-millisecond query latency on million-scale datasets. It serves as the vector storage and retrieval backbone for question-answering, conversational AI, copilot, and content generation pipelines.