PyPDF2 is a pure Python library for reading, writing, and manipulating PDF files. It functions as a document manipulator, text extractor, and encryption tool, allowing users to process PDF files without relying on external C libraries or native binaries.
The library provides specialized tools for modifying document structures, such as merging multiple files into one, splitting documents into separate files, and transforming page layouts through cropping. It also includes capabilities for securing documents via passwords and encryption.
Additional capabilities include the extraction of written content and metadata from the inner layers of a document. The toolkit also supports the management of internal document properties and the handling of interactive annotations and highlights.