ggml-org/llama.vscode
A VS Code extension that brings local LLM-powered code completion, chat, and agentic coding to the editor.

This extension provides LLM-assisted code and text completion directly within VS Code, using local llama.cpp models. It includes an integrated Llama Agent for autonomous coding tasks, supports MCP (Model Context Protocol) tools from installed servers, and offers model management with Huggingface integration for downloading models. Features include inline suggestions with keyboard shortcuts, configurable context windows, and environment-based model grouping.