SciSharp/LLamaSharp
A C#/.NET library for efficient local LLM inference using llama.cpp with CPU/GPU acceleration.

Velocity · 7d
+3.3
★ / day
Trend
→steady
star history
LLamaSharp is a cross-platform library that enables running LLaMA and LLaVA models on local devices with efficient inference on both CPU and GPU backends. It provides higher-level APIs and built-in RAG support for deploying large language models in .NET applications. The library integrates with Microsoft Semantic Kernel and kernel-memory, and supports multiple backends including CUDA and Vulkan.