# Local PDF Split and Merge Tools

> Search results for `split and merge PDF files locally` on awesome-repositories.com. 115 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/split-and-merge-pdf-files-locally

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/split-and-merge-pdf-files-locally).**

## Results

- [frooodle/stirling-pdf](https://awesome-repositories.com/repository/frooodle-stirling-pdf.md) (81,168 ⭐) — Stirling-PDF is a web-based PDF management suite used for editing, merging, splitting, and converting PDF documents. It functions as a self-hosted document manager, providing a centralized interface for users to manipulate files on a private server.

The system features a workflow automation engine that allows for the creation of processing pipelines to handle large volumes of documents without writing custom code. It also includes an optical character recognition tool to convert scanned PDFs into searchable and editable text.

Access is managed through single sign-on integration and OIDC comp
- [py-pdf/pypdf](https://awesome-repositories.com/repository/py-pdf-pypdf.md) (9,818 ⭐) — pypdf is a Python library for parsing, manipulating, and generating PDF documents. It provides high-level operations for document processing, such as merging multiple files into one or splitting a single document into smaller files.

The project includes specialized tools for managing interactive elements, including the creation and modification of annotations, hyperlinks, and form fields. It also supports advanced metadata management, allowing for the extraction and modification of standard document properties and XML-based XMP metadata.

Beyond basic structural changes, the library covers pa
- [files-community/files](https://awesome-repositories.com/repository/files-community-files.md) (44,008 ⭐) — Files is a graphical file manager designed to replace the default operating system explorer with a unified, highly configurable environment. It functions as an extensible storage aggregator, normalizing local, cloud, and remote network storage into a single, consistent interface. By hooking into the system shell, the application intercepts navigation requests to provide a seamless, integrated experience for managing diverse file systems.

The application distinguishes itself through a dual-pane productivity environment that facilitates efficient cross-directory operations and drag-and-drop wor
- [hakimel/reveal.js](https://awesome-repositories.com/repository/hakimel-reveal-js.md) (71,731 ⭐) — This project is a web-native presentation framework that renders slide decks from standard HTML or Markdown. It functions as a declarative slide engine, managing navigation, state persistence, and lifecycle events through a configuration-driven interface. By leveraging standard web technologies, it enables the creation of responsive, browser-based presentations that support complex layouts, nested transitions, and interactive content.

The framework distinguishes itself through a modular, plugin-based architecture that allows developers to extend core functionality using custom hooks and event
- [gotenberg/gotenberg](https://awesome-repositories.com/repository/gotenberg-gotenberg.md) (12,452 ⭐) — Gotenberg is a stateless, containerized service that provides a unified API for document conversion, manipulation, and web-to-PDF rendering. It functions as a centralized engine that abstracts complex document processing tasks, allowing users to interact with various rendering tools and libraries through standard HTTP requests.

The service distinguishes itself by utilizing headless browser automation to capture web content and by wrapping multiple specialized PDF engines into a single interface. It supports asynchronous task execution, offloading resource-intensive operations to background wo
- [qiuyannnn/local-file-organizer](https://awesome-repositories.com/repository/qiuyannnn-local-file-organizer.md) (3,132 ⭐) — Local-File-Organizer is a local-first file classification system that uses on-device machine learning models to categorize documents and media into structured directories. It functions as an automated file classifier and asset manager that leverages local inference to sort files based on content, meaning, and metadata.

The project emphasizes privacy by performing all data processing and analysis on the local device, eliminating the need to send sensitive files to external cloud services. It utilizes local models to analyze text and image content to generate descriptive filenames and thematic
- [home-assistant/home-assistant.io](https://awesome-repositories.com/repository/home-assistant-home-assistant-io.md) (9,466 ⭐) — Home Assistant is a local home automation platform and server that acts as an IoT device orchestrator. It integrates diverse smart home hardware by wrapping third-party APIs into a standardized logic layer and stores all system state and historical statistics on local hardware to eliminate cloud dependencies.

The system functions as a Matter IoT controller and an MQTT home automation bridge, allowing for local interoperability between different manufacturers. It features a state-based entity model and an internal event bus that decouple physical device logic from system automation.

The platf
- [iib0011/omni-tools](https://awesome-repositories.com/repository/iib0011-omni-tools.md) (9,710 ⭐) — omni-tools is a browser-based utility suite that provides client-side tools for manipulating PDFs, media files, and data formats. It functions as a collection of web-based processors and calculation engines that execute directly within the browser without requiring server-side processing.

The suite includes a client-side PDF editor for merging, splitting, and reorganizing document structures, and a web-based media processor for resizing, trimming, and converting image and video files. It also features a data format converter that transforms structured information between JSON, CSV, and XML fo
- [py-pdf/pypdf2](https://awesome-repositories.com/repository/py-pdf-pypdf2.md) (10,094 ⭐) — pypdf is a free and open-source pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files. It can also add custom data, viewing options, and passwords to PDF files. pypdf can retrieve text and metadata from PDFs as well.
- [immersive-translate/immersive-translate](https://awesome-repositories.com/repository/immersive-translate-immersive-translate.md) (17,917 ⭐) — Immersive Translate is a browser-based translation tool that integrates third-party translation engines and large language models to provide automated, real-time text conversion directly within the web interface. It functions as a browser extension that intercepts and modifies web content, injecting translated text nodes into the document object model to maintain original page layouts and styling.

The project distinguishes itself through its granular control over the translation process, allowing users to define site-specific rules, manage custom terminology glossaries, and customize translat
- [pdfcrafttool/pdfcraft](https://awesome-repositories.com/repository/pdfcrafttool-pdfcraft.md) (3,113 ⭐) — Pdfcraft is a containerized service for self-managed PDF processing, editing, and conversion. It provides a toolkit for document manipulation, a multi-format converter, and OCR software to transform scanned documents into searchable and editable text.

The project features a visual, node-based workflow editor that allows users to build automated pipelines by chaining together various PDF conversion and optimization operations.

The service covers a broad range of capabilities, including document management for merging and splitting files, format conversion between PDFs and office documents or
- [mapbox/geojson-merge](https://awesome-repositories.com/repository/mapbox-geojson-merge.md) (245 ⭐) — Merge multiple GeoJSON files into one FeatureCollection.
- [jaykali/maskphish](https://awesome-repositories.com/repository/jaykali-maskphish.md) (3,020 ⭐) — Maskphish is a comprehensive security toolkit that integrates capabilities for digital forensics, network vulnerability scanning, open-source intelligence, penetration testing, and social engineering. It functions as a multi-purpose framework for automating reconnaissance and executing security audits across diverse network environments.

The project features a specialized phishing and social engineering toolkit used for cloning websites, masking URLs, and deploying deceptive pages to capture user credentials. It also includes a remote access Trojan builder for generating platform-specific exe
- [diegomura/react-pdf](https://awesome-repositories.com/repository/diegomura-react-pdf.md) (16,403 ⭐) — This library provides a rendering engine that translates declarative component trees into PDF files. It enables the construction of complex document structures using a component-based architecture, allowing developers to define layout, styling, and content programmatically. The system operates across both browser and server-side environments, offering a unified interface for generating structured documents.

The project distinguishes itself through a sophisticated layout and rendering pipeline that handles the complexities of document creation. It utilizes a flexbox-based engine for element po
- [nathancahill/split](https://awesome-repositories.com/repository/nathancahill-split.md) (6,278 ⭐) — Unopinionated utilities for resizeable split views
- [anthropics/claude-code](https://awesome-repositories.com/repository/anthropics-claude-code.md) (132,728 ⭐) — Anthropic's terminal-native AI coding agent.
- [sngn/eslint-plugin-split-and-sort-imports](https://awesome-repositories.com/repository/sngn-eslint-plugin-split-and-sort-imports.md) (1 ⭐) — An eslint plugin that sorts imports and splits 'multiple' imports into single line imports.
- [pdfcpu/pdfcpu](https://awesome-repositories.com/repository/pdfcpu-pdfcpu.md) (8,675 ⭐) — pdfcpu is a Go PDF processing library and command-line interface designed for programmatically manipulating, optimizing, and validating PDF files. It provides a toolkit for document content modification and structural management.

The project distinguishes itself as an optimization tool and layout engine, capable of reducing file sizes and improving loading speeds by streamlining internal structures. It also functions as a security manager, providing password-based encryption, decryption, and digital signature verification.

Its capability surface includes page management for merging, splittin
- [anko/tap-merge](https://awesome-repositories.com/repository/anko-tap-merge.md) (13 ⭐) — library and CLI tool that merges multiple Test Anything Protocol streams
- [mstamy2/pypdf2](https://awesome-repositories.com/repository/mstamy2-pypdf2.md) (10,064 ⭐) — PyPDF2 is a pure Python library for reading, writing, and manipulating PDF files. It functions as a document manipulator, text extractor, and encryption tool, allowing users to process PDF files without relying on external C libraries or native binaries.

The library provides specialized tools for modifying document structures, such as merging multiple files into one, splitting documents into separate files, and transforming page layouts through cropping. It also includes capabilities for securing documents via passwords and encryption.

Additional capabilities include the extraction of writte
- [cockroachdb/cockroach](https://awesome-repositories.com/repository/cockroachdb-cockroach.md) (32,207 ⭐) — Cockroach is a distributed SQL database designed to scale horizontally across multiple nodes while maintaining strict ACID compliance and global data consistency. It functions as a relational database engine that automatically partitions data into ranges, rebalancing them across a cluster to accommodate growing storage and throughput requirements. By utilizing a distributed consensus protocol, the system ensures that all nodes agree on the order of operations, providing fault tolerance and continuous availability even in the event of hardware failures.

The system distinguishes itself through
- [wmjordan/pdfpatcher](https://awesome-repositories.com/repository/wmjordan-pdfpatcher.md) (12,486 ⭐) — PDFPatcher is a specialized suite of PDF utility tools designed for editing navigational bookmarks, modifying document structure, managing metadata, and processing pages. It provides a toolkit for altering PDF structures and properties without changing the original content stream.

The project is distinguished by its focus on bookmark management, featuring bulk editing and the ability to generate clickable bookmarks from visual tables of contents using optical character recognition. It also includes capabilities for font optimization through substitution and embedding to ensure consistent char
- [grncdr/merge-stream](https://awesome-repositories.com/repository/grncdr-merge-stream.md) (212 ⭐) — Merge (interleave) a bunch of streams.
- [amruthpillai/reactive-resume](https://awesome-repositories.com/repository/amruthpillai-reactive-resume.md) (38,613 ⭐) — This project is a web-based platform designed for creating, managing, and sharing professional resumes. It functions as a structured document builder that integrates artificial intelligence to assist with content generation, editing, and analysis. Users can maintain a collection of resumes, customize their visual presentation through various templates, and export them into multiple formats for job applications.

The platform distinguishes itself through its autonomous AI agent capabilities, which can perform research, suggest incremental edits, and apply data patches directly to documents. It
- [awesome-selfhosted/awesome-selfhosted](https://awesome-repositories.com/repository/awesome-selfhosted-awesome-selfhosted.md) (299,516 ⭐) — This project is a community-curated directory of open-source software designed for deployment in private server environments and home labs. It serves as a comprehensive resource for discovering independent, self-hosted alternatives to mainstream cloud services, enabling users to maintain full data ownership and control over their digital infrastructure.

The directory is structured through a hierarchical taxonomy that organizes a vast collection of applications into logical categories, ranging from media management and data analytics to private communication and team productivity tools. It dis
- [wikimedia/composer-merge-plugin](https://awesome-repositories.com/repository/wikimedia-composer-merge-plugin.md) (1,009 ⭐) — Merge one or more additional composer.json files at Composer runtime
- [chainlit/chainlit](https://awesome-repositories.com/repository/chainlit-chainlit.md) (12,213 ⭐) — Chainlit is a Python framework designed for building and deploying interactive, stateful conversational AI interfaces. It provides a backend-driven platform that connects language models and agent frameworks to a web-based chat frontend, managing the complexities of session state, message history, and real-time communication.

The framework distinguishes itself by offering a component-based UI builder that allows developers to inject interactive widgets, rich media, and data visualizations directly into the chat stream. It supports the visualization of complex agent workflows, enabling users t
- [mgillr/crdt-merge](https://awesome-repositories.com/repository/mgillr-crdt-merge.md) (3 ⭐) — Conflict-free merge for DataFrames, JSON, ML models & distributed agents — powered by CRDTs. The first merge library where every operation is mathematically guaranteed to converge.
- [gitbookio/gitbook](https://awesome-repositories.com/repository/gitbookio-gitbook.md) (28,902 ⭐) — Gitbook is a documentation-as-code platform designed for centralized technical knowledge management. It functions as a knowledge management system that synchronizes documentation files directly with version control repositories, allowing teams to maintain content alongside their source code.

The platform distinguishes itself through an integrated artificial intelligence layer that provides context-aware search assistance and automated content suggestions. By utilizing block-based content modeling, it enables the construction of structured, modular documentation that can be compiled into stati
- [unidoc/unioffice](https://awesome-repositories.com/repository/unidoc-unioffice.md) (4,809 ⭐) — unioffice is a comprehensive document processing suite that provides a PDF document processor, an Open XML document library, a document security toolkit, and a document content extractor. It is designed to programmatically create, read, and modify Word, Excel, and PowerPoint files, as well as generate and edit PDF documents.

The project is distinguished by its native language implementation of the Open XML standard, which removes native binary dependencies to simplify container deployments. It features advanced capabilities for digital document security, including hardware-based PDF signing,
- [bertrandg/angular-split](https://awesome-repositories.com/repository/bertrandg-angular-split.md) (930 ⭐) — 🍌 Angular UI library to split views and allow dragging to resize areas using CSS grid layout.
- [home-assistant/core](https://awesome-repositories.com/repository/home-assistant-core.md) (87,753 ⭐) — Home Assistant is a centralized home automation platform designed to orchestrate diverse internet-connected devices and services. It functions as a local-first control system that normalizes heterogeneous hardware protocols into a unified set of entities, attributes, and services. The core architecture relies on an event-driven state bus and a modular integration model, allowing the system to manage state changes and communicate across decoupled components through standardized interfaces.

The platform distinguishes itself through a highly flexible, declarative configuration framework that all
- [stirling-tools/stirling-pdf](https://awesome-repositories.com/repository/stirling-tools-stirling-pdf.md) (81,109 ⭐) — Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-based interface for interactive editing and a programmatic, API-first architecture that allows for the automation of document workflows through standard HTTP requests.

The project distinguishes itself through its focus on private, infrastructure-agnostic deployment and granular
- [infiniflow/ragflow](https://awesome-repositories.com/repository/infiniflow-ragflow.md) (82,922 ⭐) — This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasoning workflows. By integrating document intelligence with advanced retrieval pipelines, the platform enables the creation of grounded, verifiable responses supported by traceable citations.

The platform distinguishes itself through deep document understanding and sophisticated know
- [qpdf/qpdf](https://awesome-repositories.com/repository/qpdf-qpdf.md) (4,780 ⭐) — qpdf is a collection of specialized utility tools for the structural transformation, metadata inspection, file optimization, and cryptographic management of PDF documents. It provides a command line tool for transforming and inspecting internal PDF structures, a structural transformer for reorganizing pages and merging documents, and an encryption engine for managing passwords and restrictions.

The project distinguishes itself through a technical approach to document manipulation, utilizing an object-based structural representation to modify files as a graph of unique objects. It includes a m
- [mbattyani/cl-pdf](https://awesome-repositories.com/repository/mbattyani-cl-pdf.md) (126 ⭐) — CL-PDF is a cross-platform Common Lisp library for generating PDF files.
- [ecrmnn/collect.js](https://awesome-repositories.com/repository/ecrmnn-collect-js.md) (6,571 ⭐) — collect.js is a dependency-free JavaScript library that provides a fluent, chainable interface for manipulating arrays and objects. It mirrors the Laravel Collection API, offering a consistent set of methods for data transformation across JavaScript and Laravel backend environments. The library stores collection data as plain arrays internally and supports fluent method chaining, where each method returns a new collection instance.

The library distinguishes itself by closely replicating the Laravel Collection API in JavaScript, mapping each PHP method to an equivalent JavaScript implementatio
- [galkahana/pdf-writer](https://awesome-repositories.com/repository/galkahana-pdf-writer.md) (1,010 ⭐) — High performance library for creating, modiyfing and parsing PDF files in C++
- [pymupdf/pymupdf](https://awesome-repositories.com/repository/pymupdf-pymupdf.md) (9,086 ⭐) — PyMuPDF is a comprehensive PDF manipulation library and document analysis tool. It serves as a text extraction tool, OCR engine, and image converter, providing a programmatic interface to edit, merge, split, and optimize PDF and Office documents.

The project distinguishes itself through high-performance capabilities, including the use of C-bindings for low-level manipulation and parallelized page processing to accelerate workloads. It provides specialized conversion paths, such as transforming PDF content into Markdown for retrieval-augmented generation and large language model pipelines.

It
- [jakevdp/pythondatasciencehandbook](https://awesome-repositories.com/repository/jakevdp-pythondatasciencehandbook.md) (48,561 ⭐) — This project is an interactive data science environment that combines code execution, rich media visualization, and narrative documentation into a persistent, browser-based platform. It serves as a comprehensive educational resource for scientific computing, providing a framework for iterative data analysis and machine learning prototyping.

The environment is distinguished by its focus on high-performance numerical computing, utilizing vectorized array operations and memory-mapped data structures to handle large-scale computations efficiently. It features a unified estimator interface that st
- [xilopaint/alfred-pdf-tools](https://awesome-repositories.com/repository/xilopaint-alfred-pdf-tools.md) (152 ⭐) — Optimize, encrypt and manipulate PDF files.
- [maxogden/binary-split](https://awesome-repositories.com/repository/maxogden-binary-split.md) (79 ⭐) — a fast newline (or any delimiter) splitter stream - like require('split') but specific for binary data
- [howie6879/weekly](https://awesome-repositories.com/repository/howie6879-weekly.md) (991 ⭐) — Weekly is an automated workflow orchestrator and content curation platform designed to manage the lifecycle of technical information. It functions as a static site generator and newsletter publication system, enabling users to aggregate, organize, and distribute curated resources through structured markdown files and automated deployment pipelines.

The platform distinguishes itself through its modular pipeline processing, which connects discrete tasks such as web data extraction, generative AI workflows, and optical character recognition. By integrating browser-based scraping with database-ba
- [graphiteeditor/graphite](https://awesome-repositories.com/repository/graphiteeditor-graphite.md) (24,258 ⭐) — Graphite is a node-based visual design environment that integrates vector illustration, raster image processing, and motion graphics generation into a single platform. It utilizes a functional reactive pipeline and a data-flow execution model to propagate state changes through a graph of interconnected nodes, allowing users to construct complex, automated design workflows.

The platform distinguishes itself through a context-aware evaluation engine that injects runtime metadata—such as coordinate data and loop indices—directly into the node graph. This enables the creation of procedural geomet
- [pdfarranger/pdfarranger](https://awesome-repositories.com/repository/pdfarranger-pdfarranger.md) (5,194 ⭐) — Pdfarranger is a PDF page organizer, document editor, image converter, and booklet generator. It provides a visual drag-and-drop interface to reorder, merge, split, and delete pages within PDF documents.

The application includes specialized tools for creating booklet printing layouts and converting image files into PDF pages or exporting PDF pages as PNG and JPEG images. It allows for the modification of document metadata while preserving internal outlines and hyperlinks.

The software covers a range of structural manipulations, including page rotation, resizing, cropping, and overlaying. It
- [florinpop17/app-ideas](https://awesome-repositories.com/repository/florinpop17-app-ideas.md) (95,036 ⭐) — App-ideas is a development platform that integrates autonomous AI agents into local environments to orchestrate code review, automated fix application, and workflow management. It functions as a command-line interface that connects external AI assistants to your codebase, enabling iterative development cycles through plugin-based integration and natural language triggers.

The platform distinguishes itself through a robust static analysis engine that traverses syntax trees to enforce structural coding standards and identify violations. Users can define custom review rules, architectural prefer
- [pdf-archiver/pdf-archiver](https://awesome-repositories.com/repository/pdf-archiver-pdf-archiver.md) (336 ⭐) — A tool for tagging files and archiving tasks.
- [serhack/pdf-diff](https://awesome-repositories.com/repository/serhack-pdf-diff.md) (864 ⭐) — A tool for visualizing differences between two pdf files.
- [dapperlib/dapper](https://awesome-repositories.com/repository/dapperlib-dapper.md) (18,331 ⭐) — Dapper is a lightweight object-relational mapper for .NET that functions as a high-performance data access library. It operates by extending standard database connection interfaces, allowing developers to execute raw SQL queries while automating the mapping of database results to strongly-typed objects.

The library distinguishes itself through its use of runtime code generation, which creates high-performance instructions to map database rows to object properties with minimal overhead. It provides flexible data retrieval options, supporting both memory-buffered loading for speed and row-by-ro
- [l0o0/jasminum](https://awesome-repositories.com/repository/l0o0-jasminum.md) (7,030 ⭐) — Jasminum is a Zotero plugin designed for the management of Chinese bibliographic data. It serves as a metadata integration tool that automates the extraction of publication details from the China National Knowledge Infrastructure database and provides utilities for editing PDF outlines and bookmarks directly within the reference manager.

The project focuses on Chinese academic citation standards, providing specialized tools to format and parse personal names to meet specific regional requirements. It also manages the integration of language-specific translators and citation styles sourced fro
