Atome-FE/llama-node
A Node.js library for running large language model inference locally on CPU using Rust-based backends.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
llama-node provides a Node.js binding for running LLM inference using llama-rs, llama.cpp, and rwkv.cpp as backends. It supports models including llama, alpaca, gpt4all, vicuna, and rwkv, running entirely locally on laptop CPU without cloud dependencies. The project is built with N-API and Rust (NAPI-RS) for performance.