← all repositories
ItsPi3141/alpaca-electron

ChatGPT's UI, running on your CPU, in your basement

An Electron wrapper that turns llama.cpp into a point-and-click desktop app for local Alpaca models.

1.3k stars JavaScript Chat AssistantsInference · Serving
alpaca-electron
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

What it does

Alpaca Electron is a desktop chat app that wraps llama.cpp in an Electron shell. You download a quantized model, point the app at the file path, and start typing. It runs entirely on CPU—no GPU required, no cloud calls, no terminal commands.

The interesting bit

The project doesn’t hide its inspiration: the UI is deliberately “borrowed” from a well-known chat interface (the README includes a :trollface: emoji). The actual work is in the packaging—prebuilt llama.cpp binaries, an installer, and enough path-handling glue that non-technical users can avoid the command line entirely.

Key highlights

  • Ships with llama.cpp backend; supports Alpaca, Vicuna, and other 4-bit quantized .bin models
  • CPU-only inference; AVX2 preferred, falls back to slower AVX on older chips
  • One installer, no external dependencies claimed
  • Docker Compose setup included for containerized running
  • Context memory implemented; chat history, web search, and Stable Diffusion integration are on the to-do list

Caveats

  • Windows only for now. The README states plainly that “the new llama.cpp binaries that support GGUF have not been built for other platforms yet.” MacOS and Linux builds exist but are untested or community-provided.
  • No GPU acceleration yet (cuBLAS/OpenBLAS are planned, not shipped).
  • Model download links are deliberately not provided; you’re on your own for sourcing quantized weights.

Verdict

Worth a look if you want the simplest possible on-ramp to local LLM inference and don’t mind the Windows-first reality. Skip it if you need GPU speed, cross-platform reliability, or a polished feature set—this is explicitly a wrapper, not a research tool.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.