Command line utilities and local applications for splitting, merging, and manipulating PDF document files efficiently.
Stirling-PDF is a web-based PDF management suite used for editing, merging, splitting, and converting PDF documents. It functions as a self-hosted document manager, providing a centralized interface for users to manipulate files on a private server. The system features a workflow automation engine that allows for the creation of processing pipelines to handle large volumes of documents without writing custom code. It also includes an optical character recognition tool to convert scanned PDFs into searchable and editable text. Access is managed through single sign-on integration and OIDC compatibility, which supports secure authentication and the maintenance of audit logs for compliance. The application is delivered as a container-based deployment and exposes its functions through a REST API for external software integration.
Stirling-PDF is a self-hosted, privacy-focused suite that provides a graphical interface for merging and splitting documents, while also supporting batch processing and local execution via container deployment.
Pdfcraft is a containerized service for self-managed PDF processing, editing, and conversion. It provides a toolkit for document manipulation, a multi-format converter, and OCR software to transform scanned documents into searchable and editable text. The project features a visual, node-based workflow editor that allows users to build automated pipelines by chaining together various PDF conversion and optimization operations. The service covers a broad range of capabilities, including document management for merging and splitting files, format conversion between PDFs and office documents or images, and security tools for encryption and metadata removal. It also includes utilities for content editing, interactive form creation, and file optimization.
This is a self-hosted, containerized PDF suite that provides the requested splitting, merging, and local processing capabilities through a visual workflow interface.
omni-tools is a browser-based utility suite that provides client-side tools for manipulating PDFs, media files, and data formats. It functions as a collection of web-based processors and calculation engines that execute directly within the browser without requiring server-side processing. The suite includes a client-side PDF editor for merging, splitting, and reorganizing document structures, and a web-based media processor for resizing, trimming, and converting image and video files. It also features a data format converter that transforms structured information between JSON, CSV, and XML formats using schema-based mapping. The project further provides technical calculation utilities for date and time analysis, electrical property computations, and mathematical operations. Additional capabilities include text formatting tools for modifying casing and shuffling list items.
This browser-based utility suite provides local, client-side PDF splitting and merging capabilities, though it lacks a command-line interface as it is designed specifically as a web-based graphical tool.
Pdfarranger is a PDF page organizer, document editor, image converter, and booklet generator. It provides a visual drag-and-drop interface to reorder, merge, split, and delete pages within PDF documents. The application includes specialized tools for creating booklet printing layouts and converting image files into PDF pages or exporting PDF pages as PNG and JPEG images. It allows for the modification of document metadata while preserving internal outlines and hyperlinks. The software covers a range of structural manipulations, including page rotation, resizing, cropping, and overlaying. It also supports content extraction to the system clipboard and text search within loaded documents. The interface supports language localization and custom keyboard shortcut mapping via configuration files.
This is a desktop application that provides a visual interface for splitting, merging, and reordering PDF pages locally, though it lacks a native command-line interface for batch processing.
Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-based interface for interactive editing and a programmatic, API-first architecture that allows for the automation of document workflows through standard HTTP requests. The project distinguishes itself through its focus on private, infrastructure-agnostic deployment and granular security. It supports role-based access control and stateless session authentication, ensuring that sensitive operations remain protected within a user-controlled environment. By offering a unified interface for sequential file transformations, it enables users to chain multiple processing tasks into single, automated pipelines while maintaining full control over document integrity and security. The system covers a broad range of document manipulation capabilities, including optical character recognition, digital signature validation, and advanced layout operations like booklet imposition and page reorganization. It is built for flexible integration, supporting deployment across containerized environments, bare metal, or native desktop installations. Configuration is managed through environment variables, YAML files, or the web interface, allowing for consistent behavior across diverse infrastructure setups.
Stirling-PDF is a comprehensive, self-hosted document processing suite that provides both a graphical web interface and robust API support for local PDF splitting, merging, and advanced manipulation.
qpdf is a collection of specialized utility tools for the structural transformation, metadata inspection, file optimization, and cryptographic management of PDF documents. It provides a command line tool for transforming and inspecting internal PDF structures, a structural transformer for reorganizing pages and merging documents, and an encryption engine for managing passwords and restrictions. The project distinguishes itself through a technical approach to document manipulation, utilizing an object-based structural representation to modify files as a graph of unique objects. It includes a metadata inspector for analyzing cross-reference tables and internal object streams, as well as a dedicated optimization tool for compressing data streams and linearizing documents to enable faster web viewing. The toolset covers broad capability areas including PDF content management for embedding attachments and overlaying visual layers, as well as security management for encryption and decryption. It also provides facilities for low-level metadata analysis and the registration of custom cryptographic providers to handle secure data operations.
This is a powerful command-line utility for structural PDF manipulation, including splitting and merging, that processes files locally; however, it lacks a built-in graphical user interface.
BentoPDF is a browser-based document toolkit designed for local-first PDF manipulation, conversion, and metadata management. By executing all file processing tasks directly within the browser sandbox, the application ensures that sensitive data remains on the user's device and is never uploaded to or stored on external servers. The platform distinguishes itself through a modular architecture that supports dynamic remote script loading and the integration of external processing engines. Users can extend the core functionality by connecting third-party libraries, which are executed as compiled binary modules within the browser runtime to maintain high performance while preserving data privacy. The toolkit provides a comprehensive suite of capabilities for document archival and content extraction. It supports transforming files into standardized archival formats, converting fonts to outlines, and extracting structured data such as text, markdown, or spreadsheets. Additionally, users can organize document structures by splitting files based on bookmarks and modifying internal metadata. The application maintains a consistent user experience by persisting configuration settings and interface preferences directly to the browser local storage. It is distributed as a web-based utility that requires no server-side dependencies for its operations.
BentoPDF is a browser-based utility that performs PDF manipulation entirely on the client side, fulfilling the requirement for local, privacy-focused document processing including splitting and metadata management.
ffmpeg.wasm is a browser-based multimedia processing engine that brings the capabilities of the FFmpeg library directly to the client environment. By utilizing WebAssembly, it enables audio and video transcoding, format conversion, and stream recording to occur entirely within the browser without requiring server-side infrastructure. The library distinguishes itself by executing resource-intensive media tasks in background threads, ensuring that the main user interface remains responsive during complex operations. It manages data through an isolated, in-memory virtual file system, allowing for efficient input and output handling without the need for direct disk access. This tool provides a comprehensive suite for local media manipulation, including real-time processing and the application of visual filters. It is distributed as a library that integrates into web applications to facilitate high-performance multimedia tasks locally.
This is a multimedia processing engine designed for audio and video transcoding rather than a dedicated tool for manipulating PDF documents.
PDFPatcher is a specialized suite of PDF utility tools designed for editing navigational bookmarks, modifying document structure, managing metadata, and processing pages. It provides a toolkit for altering PDF structures and properties without changing the original content stream. The project is distinguished by its focus on bookmark management, featuring bulk editing and the ability to generate clickable bookmarks from visual tables of contents using optical character recognition. It also includes capabilities for font optimization through substitution and embedding to ensure consistent character rendering across different readers. The software covers a broad range of document manipulation tasks, including merging and splitting files, extracting or reorganizing pages, and converting pages to images. It provides metadata administration tools for renaming files based on internal properties and security utilities to remove print and copy restrictions. Additional structural tools allow for the visualization of internal document node hierarchies and their export to XML. Command-line tools are provided for automating file system operations based on internal document metadata properties.
PDFPatcher is a comprehensive local utility that provides the requested PDF splitting, merging, and batch processing capabilities through a graphical interface, though its primary focus is on advanced bookmark and metadata management.
pdfcpu is a Go PDF processing library and command-line interface designed for programmatically manipulating, optimizing, and validating PDF files. It provides a toolkit for document content modification and structural management. The project distinguishes itself as an optimization tool and layout engine, capable of reducing file sizes and improving loading speeds by streamlining internal structures. It also functions as a security manager, providing password-based encryption, decryption, and digital signature verification. Its capability surface includes page management for merging, splitting, cropping, and rotating documents, as well as the extraction of embedded assets like images and fonts. It also supports branding automation through the management of visual overlays such as stamps and watermarks, and provides utilities for managing embedded file attachments and validating document integrity.
This is a command-line utility that provides robust local PDF splitting, merging, and manipulation capabilities, though it lacks a built-in graphical user interface.