← all repositories
JaidedAI/EasyOCR

OCR that actually reads the room — 80+ languages, one import

A PyTorch-backed OCR library that treats Chinese street signs and Arabic documents as first-class citizens, not afterthoughts.

29.6k stars Python Computer Vision
EasyOCR
Velocity · 7d
+13
★ / day
Trend
steady
star history

What it does EasyOCR is a Python library that extracts text from images using a two-stage pipeline: CRAFT for detecting text regions, then a CRNN (ResNet/VGG + LSTM + CTC) for recognizing characters. It supports 80+ languages across Latin, Chinese, Arabic, Devanagari, Cyrillic, and other scripts. You initialize a Reader with your language codes, pass it an image file, URL, numpy array, or raw bytes, and get back bounding boxes with text and confidence scores.

The interesting bit The project is essentially a well-curated integration layer — it doesn’t invent new models, it makes existing research (CRAFT, deep-text-recognition-benchmark) accessible through a single pip install. The roadmap explicitly admits this: “we are not trying to be geniuses here.” The planned swappable detection/recognition architecture (detection='DB', recognition='Transformer') suggests they’re building plumbing, not monuments.

Key highlights

  • 80+ languages with mixed-script support; English pairs with anything, similar scripts usually pair with each other
  • Automatic model weight downloads on first use, with manual override via model hub
  • CPU fallback via gpu=False for memory-constrained or GPU-less environments
  • CLI interface included: easyocr -l ch_sim en -f image.jpg
  • Custom model training supported for both detection and recognition pipelines

Caveats

  • Model loading is slow; the README warns this “needs to run only once” because it takes noticeable time
  • Not all language combinations work together — compatibility depends on shared character sets
  • Issues older than 6 months get auto-closed due to “limited resources”
  • Handwritten text support is on the roadmap but not yet implemented

Verdict Grab this if you need multilingual OCR without wrestling with model training or CUDA configurations. Skip it if you need handwritten text recognition today, or if you want fine-grained control over the underlying detection/recognition architectures before the planned restructure lands.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.