← all repositories

MarkPDFdown/markpdfdown

A PDF-to-Markdown converter that uses multimodal LLM visual recognition to extract and format text, tables, formulas, and diagrams from PDF documents.

1.8k stars Python Data ToolingLanguage Models
markpdfdown
Velocity · 7d
+3.9
★ / day
Trend
steady
star history

MarkPDFDown transforms PDF documents into clean Markdown using multimodal AI models through LiteLLM. It leverages vision-capable LLMs to accurately extract text while preserving formatting including headings, lists, tables, and mathematical formulas. The tool supports both OpenAI and OpenRouter providers, offers CLI and pipe-based usage modes, and includes a desktop application for GUI-based conversions.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.