# paperless-ngx/paperless-ngx

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/paperless-ngx-paperless-ngx).**

36,738 stars · 2,335 forks · Python · gpl-3.0

## Links

- GitHub: https://github.com/paperless-ngx/paperless-ngx
- Homepage: http://docs.paperless-ngx.com/
- awesome-repositories: https://awesome-repositories.com/repository/paperless-ngx-paperless-ngx.md

## Topics

`angular` `archiving` `django` `dms` `document-management` `document-management-system` `hacktoberfest` `machine-learning` `ocr` `optical-character-recognition` `pdf`

## Description

Paperless-ngx is a self-hosted document management server designed to transform physical paperwork into a searchable, organized digital archive. It functions as a private platform for storing, indexing, and retrieving documents, providing users with full control over their data on local infrastructure or private cloud servers.

The system distinguishes itself through an automated workflow engine that categorizes, tags, and routes incoming files using content analysis and metadata extraction. To maintain responsiveness during resource-intensive tasks like optical character recognition, it utilizes an asynchronous task queue. The platform also features a dedicated search engine for rapid retrieval across large archives and stores documents in a structured, portable directory hierarchy on disk.

Beyond core storage, the project acts as a central integration hub by exposing all system functionality through a comprehensive interface. This allows for automated document workflows, event-driven ingestion from monitored directories, and connectivity with a wide range of community-developed mobile applications, desktop clients, and automation scripts.

## Tags

### Content Management & Publishing

- [Document Management Systems](https://awesome-repositories.com/f/content-management-publishing/document-management-systems.md) — Acts as a centralized platform for storing, indexing, and retrieving documents using automated metadata extraction.
- [Personal Archiving](https://awesome-repositories.com/f/content-management-publishing/personal-archiving.md) — Builds a searchable digital repository for physical paperwork through automation.
- [Self-Hosted Archiving Solutions](https://awesome-repositories.com/f/content-management-publishing/self-hosted-archiving-solutions.md) — Provides a private software solution for managing document archives on local infrastructure or private cloud servers.

### Artificial Intelligence & ML

- [Metadata Extraction](https://awesome-repositories.com/f/artificial-intelligence-ml/metadata-extraction.md) — Automatically identifies and categorizes key information from scanned files.

### Business & Productivity Software

- [Workflow Automation Engines](https://awesome-repositories.com/f/business-productivity-software/workflow-automation-engines.md) — Provides an automated pipeline for categorizing, tagging, and routing digital files based on content analysis.

### DevOps & Infrastructure

- [Task Queues](https://awesome-repositories.com/f/devops-infrastructure/task-queues.md) — Handles resource-intensive document parsing and OCR tasks asynchronously to maintain interface responsiveness.
- [Event Pipelines](https://awesome-repositories.com/f/devops-infrastructure/event-pipelines.md) — Monitors directories for new files to automatically trigger ingestion workflows.

### Data & Databases

- [Search Engines](https://awesome-repositories.com/f/data-databases/search-engines.md) — Maintains an inverted index of document contents and metadata for rapid retrieval.
- [File Storage Systems](https://awesome-repositories.com/f/data-databases/file-storage-systems.md) — Organizes documents into a structured directory hierarchy on disk for portability and backup.
- [Relational Databases](https://awesome-repositories.com/f/data-databases/relational-databases.md) — Stores document relationships, tags, and metadata to ensure data integrity and filtering.

### Security & Cryptography

- [Data Sovereignty](https://awesome-repositories.com/f/security-cryptography/data-sovereignty.md) — Ensures full control over data storage and privacy by hosting records on private infrastructure.

### Software Engineering & Architecture

- [Workflow Automation](https://awesome-repositories.com/f/software-engineering-architecture/workflow-automation.md) — Integrates document management into larger systems using APIs and webhooks.

### Web Development

- [API Orchestration](https://awesome-repositories.com/f/web-development/api-orchestration.md) — Exposes system functionality via a RESTful interface for mobile apps and automation.
- [Integration APIs](https://awesome-repositories.com/f/web-development/integration-apis.md) — Provides comprehensive programmatic access to document data for external mobile apps, desktop clients, and automation scripts.

### Mobile Development

- [Cross-Platform Clients](https://awesome-repositories.com/f/mobile-development/cross-platform-clients.md) — Offers native clients for both macOS and iOS to support offline access and scanning. ([source](https://github.com/paperless-ngx/paperless-ngx/wiki/Related-Projects))
- [Mobile Clients](https://awesome-repositories.com/f/mobile-development/mobile-clients.md) — Provides native mobile applications for managing documents on the go. ([source](https://github.com/paperless-ngx/paperless-ngx/wiki/Related-Projects))

### User Interface & Experience

- [Remote Access](https://awesome-repositories.com/f/user-interface-experience/remote-access.md) — Enables interaction with a centralized archive from various devices and locations.
