A Rust vector index that squeezes 31 GB of float32 embeddings into 4 GB without a training phase, then outruns FAISS on the query.
RAG · Search
heavyweights · gaining speedAgent Reach installs scrapers, MCP servers, and CLI tools so Claude Code or Cursor can browse Twitter, Reddit, Bilibili, and more without API fees.
PaddleOCR turns scans and PDFs into structured Markdown or JSON using a tiny vision-language model that punches above its weight class.
Supermemory is a hosted memory layer that lets LLMs remember facts, preferences, and context across conversations instead of starting from scratch every time.
Twenty-five bite-sized projects showing how to wire up LLMs, RAG, and agents into things that actually do work.
Open-source tool extracts structured data from PDFs and auto-tags them for accessibility, backed by benchmark claims and PDF Association collaboration.
DataTalksClub's open course teaches RAG, agents, and vector search by making you ship a working AI assistant in 10 weeks.
A curated hub of notebooks, apps, and guides that treat Oracle AI Database as the single backbone for vectors, graphs, memory, and SQL.
Honcho treats memory as structured reasoning over time, not just vector search with extra steps.
Mem0 gives AI assistants long-term recall with a single LLM call—no update/delete churn, no multi-step reasoning overhead.
EverOS gives agents long-term memory so they stop forgetting who you are between sessions.
A Zotero plugin that wraps search, translation, and AI-generated surveys into one workflow, backed by a 240-million-paper knowledge graph.
Yuxi is a full-stack platform for building knowledge-backed agents with LightRAG, LangGraph, and a Vue frontend—aimed at teams who need more than a chatbot wrapper.
Qdrant stores neural network outputs as searchable vectors and lets you filter them with SQL-like payload queries, bridging the gap between embedding models and production search.
An MCP server that turns your research database into something AI assistants can actually query, annotate, and argue with.
Self-hostable personal AI assistant bundling LLMs, RAG, browser automation, and MCP tools into one deployable executable.
A multi-agent pipeline that automates the full math-modeling competition workflow—from problem analysis to code execution to formatted paper generation.
A TypeScript monorepo that treats long-form fiction as a production pipeline, not a chat session.
A CLI and library that turns prompt evaluation and red-teaming into repeatable, automatable engineering instead of vibe-based guesswork.
A Python library that lets you point an LLM at a website and ask for what you want, instead of hand-crafting selectors.



