1 repo
Utilities and libraries for extracting text from images and documents, including language-specific recognition support.
Distinguishing note: None of the candidates were provided; this category is specifically for OCR-related functionality under the Graphics & Multimedia umbrella.
Explore 1 awesome GitHub repository matching graphics & multimedia · Optical Character Recognition Tools. Refine with filters or upvote what's useful.
OCRmyPDF is a command-line tool designed to transform scanned documents into searchable, selectable PDF files. It functions as a document processing pipeline that adds a hidden text layer to image-based files while simultaneously optimizing the document's file size and image quality. By preserving the original visual fidelity of the input, it ensures that digitized documents remain accessible to screen readers and search engines. The project distinguishes itself through a modular architecture that supports custom plugins and the integration of external recognition engines, allowing users to t
Supports multi-language document processing by allowing the configuration and installation of specific recognition language packs.