Your data analyst that actually runs the code it writes
An open-source agent that connects to databases, writes SQL and Python, executes it in a sandbox, and turns results into charts and reports.

What it does
DB-GPT is an agentic AI assistant for data work. You point it at databases, CSVs, or knowledge bases, ask questions in plain English, and it writes SQL or Python, executes the code in a sandboxed environment, and returns charts, dashboards, or HTML reports. It also packages domain workflows into reusable “skills” that teams can share.
The interesting bit
The project treats data analysis as an agentic workflow rather than a single prompt: the system plans tasks, breaks them into steps, calls tools, and iterates. Execution happens in isolated sandboxes, which matters when you’re letting an LLM generate and run arbitrary SQL and Python against live data sources.
Key highlights
- One-line installer via curl for macOS/Linux, plus PyPI package (
dbgpt-app) with an interactive first-run wizard - Supports OpenAI, DeepSeek, Kimi/Moonshot, MiniMax, and many local models via vLLM or llama.cpp
- Multi-source: relational databases, CSV/Excel, documents, and knowledge bases in one workspace
- Reusable “skills” system for packaging domain-specific analysis workflows
- Sandboxed code execution with controlled tool use and reproducible artifacts
- Text2SQL fine-tuning support for a dozen LLM families (via separate DB-GPT-Hub project)
Caveats
- The README is enthusiastic about “next-generation AI + Data products” but light on concrete architecture details or performance benchmarks
- Default install pulls in a broad stack (FastAPI, ChromaDB, RAG parsing, multiple LLM providers) — resource footprint is unclear
- Windows support exists but the quick-start installer is macOS/Linux only
Verdict
Worth a look if you want a self-hostable alternative to ChatGPT’s Code Interpreter that can persistently connect to your actual databases and data warehouses. Probably overkill if you just need occasional SQL help; the sweet spot is teams building repeatable analysis workflows with safety guardrails.