# chineseocr/chineseocr

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/chineseocr-chineseocr).**

6,113 stars · 1,710 forks · Python · MIT

## Links

- GitHub: https://github.com/chineseocr/chineseocr
- awesome-repositories: https://awesome-repositories.com/repository/chineseocr-chineseocr.md

## Topics

`chinese-ocr` `chinese-text-detect` `darknet-text-detect` `idcard` `ocr` `opencv-dnn` `trainticket` `yolo3`

## Description

chineseocr is an end-to-end deep learning pipeline for detecting and recognizing Chinese and English text in images. The project combines text region detection using YOLOv3 with sequence-based recognition via Convolutional Recurrent Neural Networks (CRNN) and dense OCR models, forming a complete optical character recognition workflow.

The pipeline includes orientation detection to handle text rotated at 0, 90, 180, or 270 degrees before recognition, and supports structured field extraction from identity cards and train tickets. A multi-framework model converter enables trained models to be converted between darknet, Keras, and PyTorch formats for cross-framework deployment.

The system provides capabilities for text region detection using darknet, OpenCV DNN, or Keras backends, along with variable-length text recognition for both Chinese and English characters. The documentation covers model conversion tools and the end-to-end pipeline configuration.

## Tags

### Artificial Intelligence & ML

- [Chinese OCR Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/chinese-ocr-tools.md) — Provides optical character recognition specifically for Chinese scripts using deep learning models.
- [End-to-End Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/chinese-ocr-tools/end-to-end-pipelines.md) — Provides an end-to-end pipeline that detects text regions and recognizes Chinese and English characters.
- [Text Detectors in Images](https://awesome-repositories.com/f/artificial-intelligence-ml/chinese-text-recognition/text-detectors-in-images.md) — Locating and bounding text regions in images containing Chinese and English characters using deep learning models like YOLO3 and darknet.
- [CRNN Recognizers](https://awesome-repositories.com/f/artificial-intelligence-ml/cloud-based-image-and-text-analysis/embedded-text-recognizers/crnn-recognizers.md) — Provides a sequence-based recognizer that reads variable-length Chinese and English text using CRNN models.
- [Dense OCR Models](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition/dense-ocr-models.md) — Ships dense OCR models for high-accuracy text extraction from document images.
- [YOLO-Based Detectors](https://awesome-repositories.com/f/artificial-intelligence-ml/text-detection-algorithms/yolo-based-detectors.md) — Uses YOLOv3 object detection architecture to locate text regions in images with bounding boxes.
- [Cross-Framework Model Conversion](https://awesome-repositories.com/f/artificial-intelligence-ml/cross-framework-model-conversion.md) — Converts trained models between darknet, Keras, and PyTorch formats for cross-framework deployment.
- [Model Format Converters](https://awesome-repositories.com/f/artificial-intelligence-ml/model-format-converters.md) — Converts trained deep learning models between darknet, Keras, and PyTorch formats for cross-framework deployment.
- [Model Format Converters](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-networks/model-format-converters.md) — Converts trained text detection and recognition models between darknet, Keras, and PyTorch formats.

### Part of an Awesome List

- [Deep Learning Detectors](https://awesome-repositories.com/f/awesome-lists/ai/text-detection/deep-learning-detectors.md) — Locates text regions in images using darknet, OpenCV DNN, or Keras deep learning models. ([source](https://cdn.jsdelivr.net/gh/chineseocr/chineseocr@app/README.md))
- [CRNN-Based Recognizers](https://awesome-repositories.com/f/awesome-lists/ai/text-recognition/crnn-based-recognizers.md) — Provides a CRNN-based recognizer for variable-length Chinese and English text sequences.
- [Variable-Length Text Recognizers](https://awesome-repositories.com/f/awesome-lists/ai/text-recognition/crnn-based-recognizers/variable-length-text-recognizers.md) — Recognize variable-length Chinese and English text from detected regions using CRNN or dense OCR models. ([source](https://cdn.jsdelivr.net/gh/chineseocr/chineseocr@app/README.md))

### Content Management & Publishing

- [Text Orientation Detection](https://awesome-repositories.com/f/content-management-publishing/content-processing-transformation/document-processing-conversion/document-processing-tools/intelligent-extraction-frameworks/text-orientation-detection.md) — Detects text orientation at 0, 90, 180, or 270 degrees using deep learning models before recognition.

### Data & Databases

- [Document](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-transformation/data-parsing-extraction/field-extractors/document.md) — Pulls specific fields from identity cards and train tickets using OCR and layout analysis.
- [Document Field Extractors](https://awesome-repositories.com/f/data-databases/field-value-extraction/document-field-extractors.md) — Extracts structured fields from identity cards and train tickets using trained models.
- [Document Field Extractors](https://awesome-repositories.com/f/data-databases/structured-data-extraction/document-field-extractors.md) — Extracts structured fields from identity cards and train tickets using trained models. ([source](https://cdn.jsdelivr.net/gh/chineseocr/chineseocr@app/README.md))
