← all repositories

breezedeus/Pix2Text

Open-source Python tool using small ML models to recognize layouts, tables, math formulas, and text from images and output Markdown.

3.1k stars Jupyter Notebook Computer VisionData Tooling
Pix2Text
Velocity · 7d
+2.3
★ / day
Trend
steady
star history

Pix2Text is a document understanding pipeline that combines layout analysis, table detection, math formula recognition (LaTeX), and OCR into a unified image-to-markdown conversion tool. It uses small PyTorch models (MFD/MFR versions 1.5) to process documents and extract structured text across 80+ languages, positioning itself as a free alternative to Mathpix.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.