← all repositories
apocas/restai

Self-hosted AI platform that actually ships with a UI

RESTai wraps LLMs, RAG, agents, and even a visual Blockly IDE into a single deployable service with built-in analytics and RBAC.

restai
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

What it does

RESTai is a Python-based “AI as a Service” platform you run yourself. It exposes projects—RAG pipelines, ReAct agents, direct LLM inference, even visual logic flows—as REST endpoints, and bundles a React admin dashboard for managing them. Supports essentially every LLM backend the README mentions: OpenAI, Anthropic, Ollama, vLLM, Gemini, LiteLLM, Azure, etc.

The interesting bit

The breadth is almost comical. Knowledge graphs with NER and entity disambiguation? Check. Headless browser agents with encrypted credential vaults and per-project domain allowlists? Check. A built-in MCP server so Claude Desktop can call your projects as tools? Also check. Plus a Blockly visual IDE for non-LLM data pipelines. Someone clearly got tired of stitching half a dozen tools together.

Key highlights

  • RAG with extras: multiple vector stores, ColBERT reranking, optional SQL-to-natural-language via MySQL/PostgreSQL, and knowledge graph layering with a force-directed visual explorer
  • Agentic browser: Playwright-driven headless Chromium in per-chat Docker containers, with credential injection that keeps secrets out of LLM context entirely
  • Built-in analytics: token tracking, cost and latency dashboards, per-project rate limiting—ops concerns usually left as “exercise for the reader”
  • Deployment options: PyPI package (restai-core), Docker multi-arch images, Kubernetes-ready, includes pre-built React frontend
  • Enterprise-ish features: RBAC, teams, OAuth/LDAP, TOTP 2FA, white-label branding, prompt versioning

Caveats

  • The README is long on feature lists and short on architecture or performance numbers; don’t expect benchmark comparisons
  • “Image/audio generation with dynamic loading generators” is claimed in the repo description but the README section appears truncated—unclear how mature this is
  • At 509 stars, it’s early enough that “production” claims deserve your own validation

Verdict

Worth a look if you need to stand up an internal AI platform fast and would rather not wire LangChain, a vector DB, auth, and a dashboard yourself. Skip it if you want a focused, single-purpose tool—this is a kitchen sink, not a scalpel.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.