← all repositories

upstash/wikipedia-semantic-search

A semantic search system indexing Wikipedia articles with vector embeddings and serving a RAG chatbot powered by Llama 3.

470 stars TypeScript RAG · SearchLLMOps · Eval
wikipedia-semantic-search
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

The project implements a semantic search engine over Wikipedia by embedding articles using the BGE-M3 multilingual model and indexing 144+ million vectors in Upstash Vector database. It provides cross-lingual search capabilities and integrates with a RAG chatbot built on the Upstash RAG Chat SDK, which leverages Meta’s Llama 3 model to generate context-aware responses from retrieved Wikipedia content.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.