awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Web Archiving Utilities · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesWeb Archiving Utilities

Tools designed to ingest, capture, and store web content for offline research and reference.

Distinguishing note: Focuses on the ingestion and automated capture workflow for web content rather than general content management.

Explore 1 awesome GitHub repository matching content management & publishing · Web Archiving Utilities. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Web Archiving Utilities

Awesome Web Archiving Utilities GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • ArchiveBox/ArchiveBox

    ArchiveBox/ArchiveBox

    26,876View on GitHub↗

    ArchiveBox is a self-hosted archiving tool designed for personal digital preservation and research data management. It functions as an automated web preservation engine that monitors URL inputs from bookmarks, browser history, or manual entries to capture and store permanent, offline copies of web content. By utilizing headless browser automation, the system renders dynamic web pages to ensure that captured snapshots, PDFs, and media assets remain accurate and accessible even if the original source disappears. The project distinguishes itself through a modular extractor pipeline and a task-qu

    Collect URLs from browser history, bookmarks, and manual inputs to trigger the automated process of capturing and saving web pages for future offline viewing and research.

    Pythonarchiveboxbackupsbookmark-archiver
    26,876View on GitHub↗