1 repo
Automated processes for identifying and structuring information from raw data.
Distinguishing note: Focuses on data extraction rather than general machine learning.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Metadata Extraction. Refine with filters or upvote what's useful.
Paperless-ngx is a self-hosted document management server designed to transform physical paperwork into a searchable, organized digital archive. It functions as a private platform for storing, indexing, and retrieving documents, providing users with full control over their data on local infrastructure or private cloud servers. The system distinguishes itself through an automated workflow engine that categorizes, tags, and routes incoming files using content analysis and metadata extraction. To maintain responsiveness during resource-intensive tasks like optical character recognition, it utili
Automatically identifies and categorizes key information from scanned files.