awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Format-Specific Parsers · Awesome GitHub Repositories

5 repos

Awesome GitHub RepositoriesFormat-Specific Parsers

Libraries and utilities designed to interpret and manipulate specific file formats like PDF, Markdown, or office suites.

Explore 5 awesome GitHub repositories matching content management & publishing · Format-Specific Parsers. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Content Processing and Transformation
  4. Document Processing and Conversion
  5. Document Processing
  6. Format-Specific Parsers

Awesome Format-Specific Parsers GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • microsoft/markitdown

    microsoft/markitdown

    87,305GitHubView on GitHub↗

    This project is an AI-powered document processing engine designed to transform diverse file formats into structured Markdown. By leveraging multimodal language models, it performs complex layout analysis and semantic text extraction, allowing for the conversion of both unstructured files and scanned images into machine

    Pythonautogenautogen-extensionlangchain
  • fffaraz/awesome-cpp

    fffaraz/awesome-cpp

    69,832GitHubView on GitHub↗

    This project is a comprehensive, curated directory of high-quality libraries, tools, and educational resources for C and C++ development. It serves as an ecosystem discovery index, helping developers navigate the vast landscape of third-party components, frameworks, and technical documentation available for the languag

    awesomeawesome-listc
  • adam-p/markdown-here

    adam-p/markdown-here

    60,151GitHubView on GitHub↗

    Markdown Here is a browser extension that enables rich text composition within web-based editors that lack native formatting support. By transforming plain text markdown syntax into rendered HTML, it allows users to draft professional emails and documents using standard markup, including headers, tables, and footnotes,

    JavaScript
  • marktext/marktext

    marktext/marktext

    53,968GitHubView on GitHub↗

    Marktext is a cross-platform desktop application designed for markdown document authoring and structured note-taking. It functions as a WYSIWYG text processor, providing a distraction-free interface that renders formatted content in real-time while hiding the underlying markup syntax. The application utilizes a multi-

    JavaScriptdark-modeeditorelectron
  • mozilla/pdf.js

    mozilla/pdf.js

    52,848GitHubView on GitHub↗

    This project is a portable document rendering engine designed to parse and display complex document layouts directly within standard web browser environments. It functions as a web-native viewer that enables the presentation of documents without requiring external software or browser plugins. The engine utilizes a can

    JavaScript

Explore sub-tags

  • JavaScript Document ParsersJavaScript-based utilities that interpret binary data to extract structured content from documents.
  • Markdown Abstract Syntax TreesParsers that transform input text into structured tree representations to facilitate efficient document transformation.
  • Markdown ConvertersUtilities that transform diverse document formats or rich text into Markdown syntax.
Office Document Parsers
Libraries designed for parsing and manipulating common office document formats such as spreadsheets and word processing files.
  • PDF LibrariesCross-platform libraries that provide functionality for parsing and manipulating PDF documents.