breezedeus/Pix2Text
Open-source Python tool using small ML models to recognize layouts, tables, math formulas, and text from images and output Markdown.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
Pix2Text is a document understanding pipeline that combines layout analysis, table detection, math formula recognition (LaTeX), and OCR into a unified image-to-markdown conversion tool. It uses small PyTorch models (MFD/MFR versions 1.5) to process documents and extract structured text across 80+ languages, positioning itself as a free alternative to Mathpix.