← all repositories

OpenBMB/VisRAG

A parsing-free RAG system that leverages vision-language models for visual document retrieval and multi-image reasoning.

962 stars Python RAG · SearchLanguage Models
VisRAG
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

VisRAG 2.0 is a retrieval-augmented generation system designed for visual documents that operates without traditional text parsing. It uses vision-language models to directly retrieve and reason over visual content including images and documents. The system includes specialized retrieval models (VisRAG-Ret) and generation models (EVisRAG), enabling evidence-guided multi-image reasoning for visual question answering and document understanding tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.