serge-chat/serge
A self-hosted web chat interface for running language models locally using llama.cpp with FastAPI, LangChain, and Redis.

Velocity · 7d
+4.9
★ / day
Trend
→steady
star history
Serge is a fully dockerized chat web application for interacting with Alpaca and similar LLM models through llama.cpp. It provides a SvelteKit frontend with a FastAPI backend that uses LangChain to orchestrate calls to llama.cpp python bindings. Redis stores chat history and generation parameters. The system runs entirely locally with no API keys required.