# rednote-hilab/dots.ocr

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/rednote-hilab-dots-ocr).**

7,695 stars · 707 forks · Python · mit

## Links

- GitHub: https://github.com/rednote-hilab/dots.ocr
- awesome-repositories: https://awesome-repositories.com/repository/rednote-hilab-dots-ocr.md

## Description

dots.ocr is a suite of software utilities for document layout analysis, multilingual optical character recognition, and scene text digitization. It functions as an engine for extracting digital text and structured layout data from images and PDFs across various human scripts.

The project includes a specialized transformer for converting charts, diagrams, and chemical formulas from raster images into scalable vector graphics. It also provides a pipeline to transform extracted text and structural layout from documents and web screenshots into formatted Markdown files.

The system covers capabilities for identifying bounding boxes and categories of layout elements to produce structured JSON representations. It further includes tools for scene text detection within natural images and an evaluation framework for measuring text and table extraction accuracy against ground truth data.

## Tags

### Artificial Intelligence & ML

- [Document Analysis Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/document-analysis-tools.md) — Provides a comprehensive tool for detecting bounding boxes and layout categories to structure documents as JSON.
- [Bounding Box Detection](https://awesome-repositories.com/f/artificial-intelligence-ml/bounding-box-detection.md) — Provides bounding box detection to locate page elements and distinguish body text from headers and footers.
- [Multilingual OCR Systems](https://awesome-repositories.com/f/artificial-intelligence-ml/multilingual-ocr-systems.md) — Extracts digital text and structured layout data from images and PDFs across various human scripts.
- [Document Layout Analysis](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/document-layout-analysis.md) — Identifies bounding boxes and categories for layout elements within images and PDF files. ([source](https://github.com/rednote-hilab/dots.ocr/blob/master/README.md))
- [Multilingual Text Recognition](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition/multilingual-text-recognition.md) — Implements a unified recognition engine to process diverse character sets and writing systems from images into digital text. ([source](https://github.com/rednote-hilab/dots.ocr/blob/master/README.md))
- [Scene Text Recognition](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition/multilingual-text-recognition/scene-text-recognition.md) — Locates and extracts digital text from natural images and complex environmental backgrounds.
- [OCR-Based Extraction](https://awesome-repositories.com/f/artificial-intelligence-ml/script-conversion/ocr-based-extraction.md) — Extracts digital text and structured layout data from images and PDFs across various human scripts. ([source](https://github.com/rednote-hilab/dots.ocr#readme))
- [Model Serving Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/model-serving-engines.md) — Implements high-performance model serving engines to provide scalable text extraction API endpoints.
- [Model Serving Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/model-serving-servers.md) — Serves machine learning models as scalable API endpoints using high-performance inference servers. ([source](https://github.com/rednote-hilab/dots.ocr#readme))

### Part of an Awesome List

- [Scene Text Detection Systems](https://awesome-repositories.com/f/awesome-lists/ai/text-recognition/scene-text-detection-systems.md) — Finds and digitizes text embedded within natural scene images and web screenshots.
- [Text Detection Models](https://awesome-repositories.com/f/awesome-lists/ai/text-detection-models.md) — Locates and digitizes text embedded within natural scene images and complex backgrounds. ([source](https://github.com/rednote-hilab/dots.ocr/blob/master/README.md))
- [OCR Accuracy Evaluators](https://awesome-repositories.com/f/awesome-lists/more/text-extraction-and-ocr/ocr-accuracy-evaluators.md) — Measures text and table extraction accuracy against ground truth images to refine OCR performance.

### Business & Productivity Software

- [Document Digitization Tools](https://awesome-repositories.com/f/business-productivity-software/document-digitization-tools.md) — Converts images and PDFs containing various human scripts into digital text while preserving document structure.

### Content Management & Publishing

- [Automated Generation](https://awesome-repositories.com/f/content-management-publishing/content-management-systems/content-architecture-modeling/documentation-tooling/authoring-editing/markdown-documentation/automated-generation.md) — Transforms extracted text and structural layout from documents and web screenshots into formatted Markdown files. ([source](https://github.com/rednote-hilab/dots.ocr#readme))
- [Document Layout Analyzers](https://awesome-repositories.com/f/content-management-publishing/content-processing-transformation/document-processing-conversion/document-processing/data-extraction-analysis/document-layout-analyzers.md) — Identifies bounding boxes and categories of layout elements to produce structured JSON representations of documents. ([source](https://github.com/rednote-hilab/dots.ocr#readme))
- [Markdown Converters](https://awesome-repositories.com/f/content-management-publishing/content-processing-transformation/document-processing-conversion/document-processing/format-specific-parsers/markdown-converters.md) — Transforms extracted text and structural layout from images and documents into formatted Markdown files.
- [Visual-to-Markdown Pipelines](https://awesome-repositories.com/f/content-management-publishing/markdown-documentation/visual-to-markdown-pipelines.md) — Transforms scanned documents and web screenshots into formatted Markdown files for editing and archival.

### Data & Databases

- [Layout-Aware Extraction](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction/layout-aware-extraction.md) — Maps spatial relationships between text elements and document structures during the extraction process.
- [Visual Layout Parsing](https://awesome-repositories.com/f/data-databases/document-parsing-engines/web-document-parsing/visual-layout-parsing.md) — Extracts structured text and layout information specifically from screenshots of web pages. ([source](https://github.com/rednote-hilab/dots.ocr/blob/master/README.md))

### Development Tools & Productivity

- [Layout-Preserving Export](https://awesome-repositories.com/f/development-tools-productivity/markdown-documentation/layout-preserving-export.md) — Generates formatted Markdown files from extracted document text while excluding page headers and footers. ([source](https://github.com/rednote-hilab/dots.ocr/blob/master/README.md))

### Graphics & Multimedia

- [Raster-to-Vector Conversions](https://awesome-repositories.com/f/graphics-multimedia/raster-to-vector-conversions.md) — Transforms rasterized charts, diagrams, and chemical formulas into scalable vector graphics. ([source](https://github.com/rednote-hilab/dots.ocr#readme))
- [Vector Graphics Export](https://awesome-repositories.com/f/graphics-multimedia/vector-graphics-export.md) — Converts technical graphics like charts and chemical formulas from images into scalable vector graphics.
- [Raster-to-SVG Transformers](https://awesome-repositories.com/f/graphics-multimedia/vector-graphics-export/raster-to-svg-transformers.md) — Converts charts, diagrams, and chemical formulas from raster images into scalable vector graphics code.
