Media-Smart/vedastr
A PyTorch-based scene text recognition toolbox implementing CRNN, Transformer, and attention-based models for OCR tasks.

This repository provides a modular PyTorch framework for scene text recognition, supporting popular architectures including CRNN, TPS-ResNet-BiLSTM-Attention, and Transformer models. It includes training pipelines on MJSynth and SynthText datasets, with evaluation on standard benchmarks like IIIT5K, SVT, IC03, IC13, and IC15. The toolbox enables rapid implementation and customization of text recognition systems through a composable module design.