← all repositories

SciSharp/LLamaSharp

A C#/.NET library for efficient local LLM inference using llama.cpp with CPU/GPU acceleration.

LLamaSharp
Velocity · 7d
+3.3
★ / day
Trend
steady
star history

LLamaSharp is a cross-platform library that enables running LLaMA and LLaVA models on local devices with efficient inference on both CPU and GPU backends. It provides higher-level APIs and built-in RAG support for deploying large language models in .NET applications. The library integrates with Microsoft Semantic Kernel and kernel-memory, and supports multiple backends including CUDA and Vulkan.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.