A self-hosted AI workspace that bolts chat, agents, email triage, calendars, and deep research onto your own hardware.
RAG · Search
heavyweights · velocity + momentumA Rust vector index that squeezes 31 GB of float32 embeddings into 4 GB without a training phase, then outruns FAISS on the query.
Graphify turns any project folder into a queryable knowledge graph that plugs into Claude Code, Cursor, Codex, and a dozen other AI assistants.
This plugin turns sprawling repos into explorable knowledge graphs inside your AI coding tool of choice.
Agent Reach installs scrapers, MCP servers, and CLI tools so Claude Code or Cursor can browse Twitter, Reddit, Bilibili, and more without API fees.
PaddleOCR turns scans and PDFs into structured Markdown or JSON using a tiny vision-language model that punches above its weight class.
A pluggable, benchmarked memory layer for LLMs that stores conversation history verbatim and retrieves it without API calls.
Supermemory is a hosted memory layer that lets LLMs remember facts, preferences, and context across conversations instead of starting from scratch every time.
A personal knowledge base that actually answers questions instead of dumping search results on you.
Clone a travel agent, earnings-call analyst, or multi-agent team in three commands instead of rebuilding from scratch.
Open-source tool extracts structured data from PDFs and auto-tags them for accessibility, backed by benchmark claims and PDF Association collaboration.
A plugin that records what Claude Code (and friends) actually did, compresses it, and feeds relevant history back into future sessions so you don't keep re-explaining your codebase.
Ruflo turns Claude Code from a solo assistant into a self-organizing, cross-machine agent swarm with memory that persists across sessions.
A single web UI that swallows Ollama, OpenAI APIs, RAG, image generation, and enterprise auth into one Docker container.
Dify bundles workflows, RAG, agents, and observability into one visual IDE for teams that want to ship without wiring infrastructure by hand.
Twenty-five bite-sized projects showing how to wire up LLMs, RAG, and agents into things that actually do work.
A seven-layer local memory stack that makes Hermes Agent actually remember your projects, decisions, and reasoning across sessions.
A desktop app that turns Andrej Karpathy's LLM wiki pattern into a persistent, self-organizing knowledge base with graph analysis and a two-step ingest pipeline.
DataTalksClub's open course teaches RAG, agents, and vector search by making you ship a working AI assistant in 10 weeks.
Mem0 gives AI assistants long-term recall with a single LLM call—no update/delete churn, no multi-step reasoning overhead.


