alephpi/Texo
A 20M-parameter open-source LaTeX OCR model using vision-encoder-decoder architecture that converts images of mathematical formulas into LaTeX code.

Texo is a minimalist deep learning model trained on consumer-grade GPUs that performs Optical Character Recognition specifically for mathematical formulas and LaTeX expressions. It employs a vision-encoder-decoder architecture built with PyTorch and Transformers, achieving state-of-the-art results while remaining lightweight. The repository includes the full training pipeline, model weights, and a web-based inference demo that runs entirely in the browser.