# kreuzberg-dev/kreuzberg

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/kreuzberg-dev-kreuzberg).**

6,071 stars · 266 forks · HTML · mit

## Links

- GitHub: https://github.com/kreuzberg-dev/kreuzberg
- Homepage: https://kreuzberg.dev/
- awesome-repositories: https://awesome-repositories.com/repository/kreuzberg-dev-kreuzberg.md

## Topics

`document-intelligence` `elixir` `ffi` `golang` `java` `metadata-extraction` `node` `pdf-extraction` `pdfium` `php` `python` `rag` `ruby` `rust` `table-extraction` `tesseract` `text-extraction` `wasm`

## Tags

### Part of an Awesome List

- [RAG Frameworks](https://awesome-repositories.com/f/awesome-lists/ai/rag-frameworks.md) — Polyglot library for extracting text and metadata from diverse document formats.
- [Data Ingestion Pipelines](https://awesome-repositories.com/f/awesome-lists/data/data-ingestion-pipelines.md) — Polyglot library for document intelligence and extraction.
- [Document and File Processing](https://awesome-repositories.com/f/awesome-lists/data/document-and-file-processing.md) — Extracts content from various document types using a Rust core.
- [Productivity and Collaboration](https://awesome-repositories.com/f/awesome-lists/productivity/productivity-and-collaboration.md) — Extracts text, tables, and metadata from diverse document formats.
