Claude Artifacts, but you own the stack and the bill
A Next.js app that turns one prompt into a runnable mini-app using Llama 3.1 405B, because vendor lock-in is a choice.

What it does
Llama Coder takes a text prompt and generates a small, runnable application — think calculators, dashboards, mini-games — then renders it live in a browser sandbox. It’s a self-hosted clone of Anthropic’s Claude Artifacts, swapping Claude for Meta’s Llama 3.1 405B running on Together AI.
The interesting bit
The project isn’t novel AI research; it’s glue code with good taste. The real work is the integration: wiring a 405B parameter model through Together AI’s inference API, dropping the generated code into CodeSandbox’s Sandpack for instant execution, and wrapping it all in a Next.js app router setup. The boring part — observability via Helicone and analytics via Plausible — suggests someone actually plans to run this in production, not just demo it.
Key highlights
- Uses Llama 3.1 405B (Meta’s largest open weights model) via Together AI
- Sandpack provides the live code execution environment without self-hosting a sandbox
- Next.js app router + Tailwind for the UI layer
- Requires three external services to run: Together AI, CodeSandbox, and a PostgreSQL database (Neon suggested)
- ~6,900 GitHub stars, suggesting either genuine utility or good timing on the “open source Artifacts” narrative
Caveats
- README is thin: no architecture diagram, no explanation of prompt engineering or safety filtering on generated code
- Running locally requires three separate API keys/services; not a quick
docker run - No mention of rate limits, costs, or how 405B inference pricing compares to just using Claude
Verdict
Worth a look if you want to self-host an Artifacts-like experience or study a clean LLM-to-sandbox integration. Skip it if you just need the user-facing feature — Claude’s original is already there, and likely cheaper at low volume.