# sml2h3/ddddocr

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/sml2h3-ddddocr).**

14,339 stars · 2,296 forks · Python · MIT

## Links

- GitHub: https://github.com/sml2h3/ddddocr
- Homepage: https://ddddocr.com
- awesome-repositories: https://awesome-repositories.com/repository/sml2h3-ddddocr.md

## Topics

`captcha` `ddddocr` `ocr`

## Description

ddddocr is a Python library for automated image analysis, focused on extracting text and detecting objects from visual content. Its core capabilities include character recognition that can handle alphanumeric, Chinese, and special characters, as well as object detection that returns bounding box coordinates for targets within images.

The library provides specialized support for solving slider CAPTCHAs by identifying the position of missing pieces using edge matching or image comparison algorithms. It also offers image preprocessing through color-based filtering to reduce noise from complex backgrounds, and allows users to constrain OCR output to specific character subsets for improved accuracy on targeted inputs.

For extensibility, ddddocr supports loading user-trained ONNX models with custom character sets, enabling recognition of specialized or proprietary CAPTCHA types. Performance can be enhanced through GPU acceleration, which offloads model inference to a GPU device for faster batch or high-volume processing. The library also includes a REST API server that exposes recognition, detection, and slider-matching functions through HTTP endpoints for remote access.

## Tags

### Part of an Awesome List

- [Text Recognition](https://awesome-repositories.com/f/awesome-lists/ai/text-recognition.md) — Identifies alphanumeric, Chinese, and special characters in images and returns recognized text. ([source](https://cdn.jsdelivr.net/gh/sml2h3/ddddocr@master/README.md))
- [Optical Character Recognitions](https://awesome-repositories.com/f/awesome-lists/more/text-extraction-and-ocr/optical-character-recognitions.md) — Extracts alphanumeric, Chinese, and special characters from images using a pre-trained OCR model. ([source](https://github.com/sml2h3/ddddocr))
- [Slider CAPTCHA Solvers](https://awesome-repositories.com/f/awesome-lists/security/captcha-solving/slider-captcha-solvers.md) — Implements slider CAPTCHA gap matching using edge detection and image comparison algorithms. ([source](https://cdn.jsdelivr.net/gh/sml2h3/ddddocr@master/README.md))

### Artificial Intelligence & ML

- [Object Detection](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/object-detection-tracking/object-detection.md) — Locates target objects within images and returns their bounding box coordinates. ([source](https://cdn.jsdelivr.net/gh/sml2h3/ddddocr@master/README.md))
- [Custom Model Training](https://awesome-repositories.com/f/artificial-intelligence-ml/custom-model-training.md) — Provides custom model loading for ONNX-based recognition of proprietary CAPTCHAs. ([source](https://cdn.jsdelivr.net/gh/sml2h3/ddddocr@master/README.md))
- [GPU-Accelerated Inference](https://awesome-repositories.com/f/artificial-intelligence-ml/gpu-accelerated-inference.md) — Offloads model inference to a GPU device to speed up batch or high-volume recognition tasks. ([source](https://cdn.jsdelivr.net/gh/sml2h3/ddddocr@master/README.md))
- [ONNX Model Loaders](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/fine-tuning-and-customization/model-customization/ocr-model-customizers/onnx-model-loaders.md) — Supports loading user-trained ONNX models with custom character sets for specialized CAPTCHA recognition. ([source](https://github.com/sml2h3/ddddocr))

### Development Tools & Productivity

- [OCR REST API Servers](https://awesome-repositories.com/f/development-tools-productivity/rest-api-integrations/ocr-rest-api-servers.md) — Ships a built-in REST API server that exposes OCR, detection, and slider-matching functions over HTTP. ([source](https://cdn.jsdelivr.net/gh/sml2h3/ddddocr@master/README.md))