← all repositories

DocumindHQ/documind

Open-source platform for extracting structured data from PDFs using LLM-based document analysis.

1.5k stars JavaScript Data ToolingRAG · Search
documind
Velocity · 7d
+2.6
★ / day
Trend
steady
star history

Documind is a document processing tool that leverages AI to extract structured information from unstructured documents like PDFs. It converts documents to Markdown format and outputs structured JSON based on customizable schemas or auto-generated ones. The platform supports multiple LLMs including OpenAI, Llama3.2-vision, and LLaVA for the extraction pipeline.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.