← all repositories

serge-chat/serge

A self-hosted web chat interface for running language models locally using llama.cpp with FastAPI, LangChain, and Redis.

serge
Velocity · 7d
+4.9
★ / day
Trend
steady
star history

Serge is a fully dockerized chat web application for interacting with Alpaca and similar LLM models through llama.cpp. It provides a SvelteKit frontend with a FastAPI backend that uses LangChain to orchestrate calls to llama.cpp python bindings. Redis stores chat history and generation parameters. The system runs entirely locally with no API keys required.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.