1 repo
Tools for configuring and optimizing document layout analysis and segmentation modes to improve OCR accuracy.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Page Segmentation Optimizers. Refine with filters or upvote what's useful.
Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d