← all repositories

weiwill88/Local_Pdf_Chat_RAG

A pure-Python RAG framework combining FAISS semantic search with BM25 keyword retrieval for document-based question answering.

Local_Pdf_Chat_RAG
Velocity · 7d
+1.9
★ / day
Trend
steady
star history

This project implements a complete retrieval-augmented generation pipeline in native Python for educational purposes. It handles document loading and chunking from multiple formats (PDF, DOCX, TXT, etc.), stores embeddings in FAISS for vector search, and combines results with BM25 keyword retrieval. The system supports local models via Ollama or cloud inference via SiliconFlow API, includes cross-encoder reranking for result quality, and optionally integrates web search via SerpAPI for real-time information augmentation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.