DocumindHQ/documind
Open-source platform for extracting structured data from PDFs using LLM-based document analysis.

Velocity · 7d
+2.6
★ / day
Trend
→steady
star history
Documind is a document processing tool that leverages AI to extract structured information from unstructured documents like PDFs. It converts documents to Markdown format and outputs structured JSON based on customizable schemas or auto-generated ones. The platform supports multiple LLMs including OpenAI, Llama3.2-vision, and LLaVA for the extraction pipeline.