RAG · Search

big names · picking up speed

+888 ★/day↗accelerating

It builds a persistent structural map of your codebase so AI assistants read only the files in a change's blast radius instead of the whole repo.

★ 26.4k Python Coding Assistants · explained

topoteretes/cognee

+160 ★/day↗accelerating

Cognee gives AI agents persistent memory by treating documents as a living knowledge graph rather than a vector dump.

★ 29.3k Python Agents · explained Feature

getzep/graphiti

+73 ★/day↗accelerating

Graphiti continuously ingests structured and unstructured data into temporal context graphs so agents can query what is true now—or was true at any specific moment—without rebuilding the graph from scratch.

★ 29.2k Python RAG · Search · explained

HKUDS/LightRAG

+69 ★/day↗accelerating

It exists because pure vector search often misses the big picture, so LightRAG auto-maps entities and relationships into a knowledge graph for richer retrieval.

★ 38.1k Python RAG · Search · explained

VectifyAI/PageIndex

+67 ★/day↗accelerating

PageIndex replaces vector similarity search with hierarchical tree reasoning to retrieve complex documents more accurately.

★ 34.6k Python RAG · Search · explained

langgenius/dify

+140 ★/day↗accelerating

Because stitching together LLM workflows, RAG, agents, and observability by hand is a full-time job.

★ 150.2k TypeScript Agents · explained

NirDiamant/RAG_Techniques

+41 ★/day↗accelerating

It catalogs 42 advanced RAG techniques as runnable notebooks so developers can compare chunking, reranking, and graph retrieval strategies without wiring up every pipeline from scratch.

★ 28.8k Jupyter Notebook RAG · Search · explained

microsoft/graphrag

+49 ★/day↗accelerating

GraphRAG exists to give LLMs a structured memory layer for reasoning over messy, private narrative text.

★ 34.8k Python RAG · Search · explained

QuivrHQ/quivr

+25 ★/day↗accelerating

Quivr wraps retrieval-augmented generation in a Python library with YAML-configurable workflows, letting you swap LLMs and vector stores without rewriting plumbing.

★ 39.4k Python RAG · Search · explained

infiniflow/ragflow

+92 ★/day↗accelerating

RAGFlow fuses deep document parsing with agentic workflows so LLMs can answer from messy corporate documents without making things up.

★ 86k Go RAG · Search · explained

mindsdb/mindshub

+18 ★/day↗accelerating

Minds Platform exists so teams can self-host autonomous AI agents and semantic search without shipping data to a managed cloud.

★ 39.5k Makefile Agents · explained

ItzCrazyKns/Vane

+25 ★/day↗accelerating

A privacy-focused AI answering engine that runs entirely on your own hardware so your searches never leave the building.

★ 35.9k TypeScript RAG · Search · explained

langflow-ai/langflow

+54 ★/day↗accelerating

Langflow is a visual schematic editor for AI workflows that deploys your graphs as APIs or MCP servers, saving you from writing yet another Python script to chain LLM calls together.

★ 152.4k Python Agents · explained

onyx-dot-app/onyx

+30 ★/day↗accelerating

Onyx bundles chat, RAG, agents, and sandboxed code into a single self-hosted interface so teams can use any LLM without shipping data to a SaaS black box.

★ 31.2k Python RAG · Search · explained

MemPalace/mempalace

+41 ★/day↗accelerating

MemPalace is a local-first AI memory system that stores conversations verbatim and retrieves them with semantic search backed by published, reproducible benchmarks.

★ 57.7k Python RAG · Search · explained

huggingface/transformers

+38 ★/day↗accelerating

It centralizes model definitions so the same architecture works across PyTorch, JAX, vLLM, and llama.cpp without rewrites.

★ 163k Python Language Models · explained

stanford-oval/storm

+25 ★/day↗accelerating

STORM simulates expert research conversations so LLMs can write long, cited articles from scratch.

★ 30.3k Python Agents · explained

open-webui/open-webui

+120 ★/day↗accelerating

To wrap every LLM backend, RAG pipeline, and enterprise auth scheme into a single self-hosted interface.

★ 146.7k Python Chat Assistants · explained

mem0ai/mem0

+79 ★/day↗accelerating

Mem0 gives AI agents a persistent memory layer that accumulates user context across sessions instead of overwriting it every time.

★ 61.7k TypeScript Agents · explained

milvus-io/milvus

+16 ★/day↗accelerating

Milvus is a Go/C++ vector database built for billion-scale ANN search with Kubernetes-native horizontal scaling.

★ 45.4k Go RAG · Search · explained

loading more…