← all repositories
1Panel-dev/MaxKB

The 21k-star Chinese answer to enterprise RAG you probably missed

MaxKB wraps LangChain, Django, and pgvector into a one-click Docker platform for building knowledge-backed AI agents without writing glue code.

21.2k stars Python AgentsRAG · Search
MaxKB
Velocity · 7d
+21
★ / day
Trend
steady
star history

What it does

MaxKB (“Max Knowledge Brain”) is a self-hostable platform that bundles document ingestion, vector search, workflow orchestration, and multi-model chat into a single container. You upload documents or point it at URLs, it splits and vectorizes them via pgvector, then exposes a web UI where users query against grounded context. It also supports MCP tool-use and claims zero-coding integration into existing business systems.

The interesting bit

The project sits at a pragmatic intersection: it is essentially polished glue code around LangChain and Django, but that is exactly what most enterprises actually want. The README notes native multimodal I/O (text, image, audio, video) and explicit support for both local models (DeepSeek, Llama, Qwen) and APIs (OpenAI, Claude, Gemini), which suggests the authors understand the hybrid deployment reality inside Chinese and global orgs alike.

Key highlights

  • One-liner Docker start with a pre-baked admin account (username: admin, password: MaxKB@123.. — yes, with two dots)
  • Built-in RAG pipeline: direct upload, web crawling, auto-splitting, vectorization
  • Workflow engine with function library and MCP tool-use for multi-step agentic flows
  • Model-agnostic backends; topics tag DeepSeek-R1, Qwen3, Llama3, Ollama specifically
  • GPL v3 licensed, 21k+ GitHub stars, active Chinese and English documentation

Caveats

  • The README uses the word “seamless” unironically, which historically correlates with integration friction
  • Default credentials in quick-start docs are a footgun; rotate them immediately
  • Actual benchmark numbers or latency claims are absent — performance is unclear

Verdict

Worth a spin if you need a self-hosted, RAG-backed internal knowledge base or customer-service bot and would rather not wire LangChain yourself. Skip it if you need fine-grained control over retrieval algorithms or already run a custom pipeline you are happy with.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.