← all repositories

Filimoa/open-parse

A Python library that visually parses complex document layouts to produce semantically coherent chunks optimized for RAG systems.

3.2k stars Python Data ToolingRAG · Search
open-parse
Velocity · 7d
+3.9
★ / day
Trend
steady
star history

Open-Parse is a document parsing library designed to handle complex files for AI applications. It uses visual analysis to understand document structures — including headings, tables, images, and markdown — then groups related content into chunks that can be retrieved in RAG pipelines. It aims to replace basic text splitting or heavyweight ML layout parsers with a more practical middle-ground solution.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.