awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Format Conversion Toolkits · Awesome GitHub Repositories

4 repos

Awesome GitHub RepositoriesFormat Conversion Toolkits

Utilities for programmatic transformation between diverse file formats, including office suites, Markdown, and PDF.

Explore 4 awesome GitHub repositories matching content management & publishing · Format Conversion Toolkits. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Content Processing and Transformation
  4. Document Processing and Conversion
  5. Document Processing Tools
  6. Format Conversion Toolkits

Awesome Format Conversion Toolkits GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • avelino/awesome-go

    avelino/awesome-go

    165,543GitHubView on GitHub↗

    This project serves as a comprehensive language ecosystem index, functioning as a centralized, community-curated directory for the Go programming language. It organizes a vast landscape of software components, libraries, and development tools into a structured, navigable hierarchy, enabling developers to efficiently di

    Goawesomeawesome-listgo
  • microsoft/markitdown

    microsoft/markitdown

    87,305GitHubView on GitHub↗

    This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine

    Pythonautogenautogen-extensionlangchain
  • Stirling-Tools/Stirling-PDF

    Stirling-Tools/Stirling-PDF

    74,357GitHubView on GitHub↗

    Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-

    TypeScriptdockerhacktoberfestjava
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    C++hacktoberfestlstmmachine-learning

Explore sub-tags

  • Cloud Document ConversionCloud-based services that convert images and documents into searchable text formats through remote processing.
  • Document Conversion ToolkitsUtilities that transform diverse file formats into structured data through command-line or programmatic interfaces.
  • Office Document LibrariesSoftware libraries designed for creating and processing office document formats such as spreadsheets and presentations.
PDF Format ConvertersTools that convert PDF files to and from various other document formats.
  • PDF Generation ToolsUtilities that generate PDF documents by combining image data with hidden text layers for searchability.