← all repositories

KolosalAI/Kolosal

Local LLM inference and training application that runs 100% offline on edge devices with minimal resource requirements.

Kolosal
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

Kolosal AI is an open-source desktop application for running large language models entirely offline on personal devices. Built on the Genta Personal Engine (leveraging llama.cpp), it supports any CPU with AVX2 instructions and AMD/NVIDIA GPUs. The application compiles to approximately 20 MB and targets edge devices like Raspberry Pi, enabling on-premise AI solutions without cloud dependencies. It supports popular open models including Mistral, LLaMA, Qwen, DeepSeek, and Phi variants, and includes capabilities for custom dataset generation and model training alongside inference.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.