← all repositories
PatrickLib/captcha_recognize

Captcha solver skips the tedious slicing step

A TensorFlow pipeline that reads whole captcha images end-to-end instead of segmenting characters first.

570 stars Python Computer VisionML Frameworks
captcha_recognize
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

What it does Trains a neural network to recognize text in captcha images without chopping them into individual characters first. Feed it 128×48 pixel images named like LABEL_*.jpg, run through tfrecord conversion, training, and evaluation scripts. Supports single GPU or multi-GPU training.

The interesting bit The 99.7% accuracy claim comes from a specific generator (lepture/captcha) with 50k samples and 20k steps. Switch generators to Gregwar/CaptchaBundle and accuracy drops to 52.1% even with 100k samples and 200k steps — a useful reminder that “captcha” covers a lot of visual ground.

Key highlights

  • End-to-end recognition: no character segmentation preprocessing
  • Python 2.7 + TensorFlow 1.1 + Anaconda2 4.3.1 (pinned, dated stack)
  • Multi-GPU training script included
  • Eval script produces accuracy numbers; recognize script prints predicted strings
  • Works with custom datasets or default generated captchas

Caveats

  • Python 2.7 and TensorFlow 1.1 are firmly in legacy territory
  • Accuracy varies wildly by captcha generator; the 99.7% figure is not universal
  • README lacks detail on model architecture or why segmentation-free works here

Verdict Worth a look if you’re researching captcha-breaking techniques or need a baseline end-to-end OCR pipeline to adapt. Skip it if you need modern dependencies or production-ready code — this is a 2017-era research snapshot.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.