RightNow-AI/picolm
A C-based LLM inference engine that runs 1B-parameter models on $10 hardware with 256MB RAM.

Velocity · 7d
+15
★ / day
Trend
→steady
star history
PicoLM is an ultra-lightweight inference runtime written in pure C with zero dependencies, enabling language model inference on embedded devices like the LicheeRV Nano. It supports GGUF model files and quantization to achieve a ~80KB binary size and ~45MB runtime RAM footprint. The project serves as the local brain for the PicoClaw AI assistant framework, forming a fully offline AI agent solution without requiring cloud APIs or internet connectivity.