awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
OCR Training Datasets · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesOCR Training Datasets

Community-maintained datasets specifically for improving optical character recognition accuracy.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · OCR Training Datasets. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Artificial Intelligence & Machine Learning
  4. Machine Learning Datasets
  5. OCR Training Datasets

Awesome OCR Training Datasets GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    C++hacktoberfestlstmmachine-learning