← all repositories

superlinear-ai/raglite

A Python RAG toolkit with DuckDB or PostgreSQL supporting multiple LLM providers, rerankers, and late chunking embeddings.

1.2k stars Python RAG · SearchLLMOps · Eval
raglite
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

RAGLite provides a complete pipeline for retrieval-augmented generation workflows. It supports multiple LLM providers via LiteLLM including local llama-cpp-python models, stores and searches vectors using DuckDB or PostgreSQL with pgvector, and ranks results with any reranker. The toolkit includes PDF-to-markdown conversion, multi-vector late chunking for improved embeddings, and hardware acceleration via Metal on macOS and CUDA on Linux/Windows.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.