awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Document Lifecycle and Retrieval · Awesome GitHub Repositories

2 repos

Awesome GitHub RepositoriesDocument Lifecycle and Retrieval

Infrastructure for the network-based fetching, asynchronous loading, and ingestion of document content into systems.

Explore 2 awesome GitHub repositories matching content management & publishing · Document Lifecycle and Retrieval. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Content Processing and Transformation
  4. Document Processing and Conversion
  5. Document Processing
  6. Document Lifecycle and Retrieval

Awesome Document Lifecycle and Retrieval GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • zylon-ai/private-gpt

    zylon-ai/private-gpt

    57,116GitHubView on GitHub↗

    This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov

    Python
  • mozilla/pdf.js

    mozilla/pdf.js

    52,848GitHubView on GitHub↗

    This project is a portable document rendering engine designed to parse and display complex document layouts directly within standard web browser environments. It functions as a web-native viewer that enables the presentation of documents without requiring external software or browser plugins. The engine utilizes a can

    JavaScript

Explore sub-tags

  • Asynchronous Data FetchingMechanisms that download specific document segments on demand to enable immediate viewing without full file retrieval.
  • Remote Document FetchersServices that retrieve external document files from network locations for conversion into accessible formats.
  • Text Ingestion ServicesSystems that process raw text chunks to create searchable document representations for retrieval.