Extract tables from PDF files
The main features of tabulapdf/tabula-java are: PDF Processing.
Open-source alternatives to tabulapdf/tabula-java include: py-pdf/pypdf — pypdf is a Python library for parsing, manipulating, and generating PDF documents. It provides high-level operations… zotero/zotero — Zotero is reference management software designed for collecting, organizing, and citing bibliographic research sources… dynamicreports/dynamicreports — Java reporting library for creating dynamic report designs at runtime. flyingsaucerproject/flyingsaucer — XML/XHTML and CSS 2.1 renderer in pure Java. librepdf/openpdf — OpenPDF is a Java library and document processor used for creating, editing, rendering, and encrypting PDF documents.… openhtmltopdf/openhtmltopdf — An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 3. With SVG image support. Now also with…
pypdf is a Python library for parsing, manipulating, and generating PDF documents. It provides high-level operations for document processing, such as merging multiple files into one or splitting a single document into smaller files. The project includes specialized tools for managing interactive elements, including the creation and modification of annotations, hyperlinks, and form fields. It also supports advanced metadata management, allowing for the extraction and modification of standard document properties and XML-based XMP metadata. Beyond basic structural changes, the library covers pa
Zotero is reference management software designed for collecting, organizing, and citing bibliographic research sources and digital documents for academic work. It functions as a web bibliographic collector, a citation generator, and a collaborative research platform. The system integrates tools for capturing metadata and archiving web pages into a centralized research library. It provides a specialized environment for reading and marking up PDF and EPUB files with highlights and notes linked directly to research sources. The software covers a broad range of capabilities including bibliograph
Java reporting library for creating dynamic report designs at runtime
XML/XHTML and CSS 2.1 renderer in pure Java