cocktailpeanut/dalai
A tool for running quantized LLaMA and Alpaca models locally on your computer.

Velocity · 7d
+11
★ / day
Trend
→steady
star history
Dalai provides a simple way to run LLaMA and Alpaca language models on local machines. It is powered by llama.cpp and alpaca.cpp for model quantization and efficient inference. The project includes a hackable web application, a JavaScript API, and a Socket.io API for integration. It supports multiple model sizes from 7B to 65B parameters across Linux, Mac, and Windows operating systems.