# quivrhq/megaparse

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/quivrhq-megaparse).**

7,389 stars · 423 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/quivrhq/megaparse
- Homepage: https://megaparse.com
- awesome-repositories: https://awesome-repositories.com/repository/quivrhq-megaparse.md

## Description

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

## Tags

### Part of an Awesome List

- [Data Extraction And Generation](https://awesome-repositories.com/f/awesome-lists/ai/data-extraction-and-generation.md) — Universal parser for various document types.
- [Data Preprocessing](https://awesome-repositories.com/f/awesome-lists/data/data-preprocessing.md) — Universal parser designed to minimize information loss during document ingestion.
- [Document Parsing and Extraction](https://awesome-repositories.com/f/awesome-lists/data/document-parsing-and-extraction.md) — File parser optimized for high-fidelity LLM ingestion.
