Multiple approaches to automated CAPTCHA solving using computer vision and deep learning. Built as a research project exploring different ML strategies for visual recognition challenges.
Automated browser interaction with TensorFlow-based CAPTCHA detection:
captchacrack.py— Headless Chrome + TF frozen graph for real-time CAPTCHA solvingcaptchacrack2.py— Improved version with better image preprocessinganatomia-captcha.py— CAPTCHA anatomy analysis and segmentation
Custom Convolutional Neural Network for character-level CAPTCHA recognition:
captcha_model.py— CNN architecture definitioncaptcha_train.py— Training pipeline with multi-GPU supportcaptcha_eval.py— Model evaluationcaptcha_gen_default.py— Synthetic CAPTCHA generation for training datacaptcha_recognize.py— Inference script
Segment-then-classify approach:
extract_single_letters_from_captchas.py— Image segmentation to isolate individual charactershelpers.py— Preprocessing utilities (resize, threshold, contour detection)
┌─────────────┐ ┌──────────────────┐ ┌───────────────┐
│ CAPTCHA │────▶│ Preprocessing │────▶│ Recognition │
│ Image │ │ - Threshold │ │ - CNN model │
│ │ │ - Segmentation │ │ - TF detect │
└─────────────┘ │ - Contours │ │ - Letter OCR │
└──────────────────┘ └───────────────┘
tensorflow>=1.13
opencv-python
selenium
beautifulsoup4
pillow
numpy
Disclaimer: This project was built for educational and research purposes to understand ML-based visual recognition. Use responsibly.
2019–2020