← all repositories

Media-Smart/vedastr

A PyTorch-based scene text recognition toolbox implementing CRNN, Transformer, and attention-based models for OCR tasks.

534 stars Python Computer VisionML Frameworks
vedastr
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

This repository provides a modular PyTorch framework for scene text recognition, supporting popular architectures including CRNN, TPS-ResNet-BiLSTM-Attention, and Transformer models. It includes training pipelines on MJSynth and SynthText datasets, with evaluation on standard benchmarks like IIIT5K, SVT, IC03, IC13, and IC15. The toolbox enables rapid implementation and customization of text recognition systems through a composable module design.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.