jonfairbanks/local-rag
A self-hosted retrieval augmented generation application that ingests local files, GitHub repositories, and websites for use with local Ollama models.

Local RAG provides a browser-based interface for ingesting documents from local files, GitHub repositories, and websites into a vector store. It uses Ollama to serve chat models and HuggingFace or Ollama embedding models locally, keeping all data and model inference on-premises. The pipeline streams responses via LlamaIndex, preserving chat history and source citations without sending sensitive data to third parties.