AlibabaResearch/AdvancedLiterateMachinery
Alibaba Research project developing deep learning models and benchmarks for document understanding, OCR, and multimodal text reading tasks.

Advanced Literate Machinery (ALM) is an Alibaba Research project focused on teaching machines to read text from images and documents using deep learning and multimodal approaches. The project releases OCR models like Platypus (ECCV 2024) and benchmarks like CC-OCR for evaluating Large Multimodal Models on text reading tasks. It encompasses scene text detection, document parsing, multilingual OCR, and key information extraction capabilities.