← all repositories

LinXueyuanStdio/LaTeX_OCR_PRO

A deep learning system that recognizes mathematical formulas from images and converts them to LaTeX code.

1.3k stars Jupyter Notebook Computer Vision
LaTeX_OCR_PRO
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

This project implements an OCR system specifically for mathematical formulas using a Seq2Seq architecture with Attention mechanism and Beam Search. The model processes images through CNN layers, then uses LSTM/GRU RNN layers to decode the visual features into LaTeX code. It supports handwritten, printed, and Chinese-mixed mathematical formulas. The system is built on TensorFlow with LaTeX abstract syntax tree structures for output representation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.