awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Web Archiving Tools · Awesome GitHub Repositories

2 repos

Awesome GitHub RepositoriesWeb Archiving Tools

Systems for creating persistent visual records of interactive web content.

Distinguishing note: Focuses on visual persistence for historical or offline purposes.

Explore 2 awesome GitHub repositories matching content management & publishing · Web Archiving Tools. Refine with filters or upvote what's useful.

  1. Home
  2. Content Management & Publishing
  3. Web Archiving Tools

Awesome Web Archiving Tools GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • niklasvh/html2canvas

    niklasvh/html2canvas

    31,796View on GitHub↗

    This project is a browser-based rendering engine that captures visual snapshots of web page elements. It functions as a document object model to canvas renderer, programmatically reconstructing the visual appearance of web content by interpreting CSS box models and document structures directly within the client environment. The tool distinguishes itself by performing all image generation locally, eliminating the need for server-side processing or external rendering services. By simulating browser layout logic and mapping geometric shapes and text properties to pixel-based drawing commands, it

    Creates persistent visual snapshots of interactive web elements for historical record keeping and offline viewing.

    TypeScriptdomjavascriptscreenshot
    31,796View on GitHub↗
  • ArchiveBox/ArchiveBox

    ArchiveBox/ArchiveBox

    26,876View on GitHub↗

    ArchiveBox is a self-hosted archiving tool designed for personal digital preservation and research data management. It functions as an automated web preservation engine that monitors URL inputs from bookmarks, browser history, or manual entries to capture and store permanent, offline copies of web content. By utilizing headless browser automation, the system renders dynamic web pages to ensure that captured snapshots, PDFs, and media assets remain accurate and accessible even if the original source disappears. The project distinguishes itself through a modular extractor pipeline and a task-qu

    A local server application that captures and preserves web content into multiple portable formats for long-term offline access and research.

    Pythonarchiveboxbackupsbookmark-archiver
    26,876View on GitHub↗