← all repositories

Dicklesworthstone/llm_aided_ocr

A system that improves scanned document OCR accuracy by applying LLM-based error correction and markdown formatting.

2.9k stars Python Domain AppsData Tooling
llm_aided_ocr
Velocity · 7d
+2.8
★ / day
Trend
steady
star history

This project enhances Tesseract OCR output by using large language models to correct errors, restructure text into clean markdown, and assess output quality. It converts scanned PDFs to images, runs OCR, then applies LLMs (local or API-based including OpenAI and Anthropic) for correction and formatting. The system supports both local GPU-accelerated LLMs and cloud providers, with smart chunking for efficient processing.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.