← all repositories

microsoft/table-transformer

A deep learning model for extracting tables from PDFs and images using object detection.

2.9k stars Python Computer VisionData Tooling
table-transformer
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

Table Transformer (TATR) is a deep learning-based object detection model for extracting tables from unstructured documents. It uses a DETR-style architecture to detect and parse table structures from images and PDFs. The repository includes the PubTables-1M dataset with over 1 million tables, the GriTS evaluation metric for table structure recognition, and pre-trained models for table detection, structure recognition, and functional analysis.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.