Self-hosted bookmarking platforms that provide visual organization and tagging for your saved web links.
ArchiveBox is a self-hosted archiving tool designed for personal digital preservation and research data management. It functions as an automated web preservation engine that monitors URL inputs from bookmarks, browser history, or manual entries to capture and store permanent, offline copies of web content. By utilizing headless browser automation, the system renders dynamic web pages to ensure that captured snapshots, PDFs, and media assets remain accurate and accessible even if the original source disappears. The project distinguishes itself through a modular extractor pipeline and a task-queue-based processing model, which allow it to handle long-running ingestion jobs reliably and at scale. It organizes all captured data into a predictable, file-system-based directory structure, ensuring that archives remain portable and accessible without the need for a dedicated database engine. This architecture supports the generation of static, self-contained archives that can be hosted on any standard web server. To maintain high fidelity across diverse web environments, the system includes configuration-driven dependency management that coordinates the necessary browser binaries and command-line tools. The platform provides a comprehensive suite of command-line interfaces, configuration options, and core modules to support operational management and integration. Detailed documentation is available to guide users through installation, dependency maintenance, and the security considerations of managing archived web content.
ArchiveBox is a powerful self-hosted tool for preserving and archiving web content, though it focuses more on deep digital preservation and offline snapshots than on the visual organization and management features typical of a standard bookmark manager.
Linkwarden is a self-hosted bookmark manager and web archiving platform designed to preserve permanent copies of online content. It functions as a centralized repository where users can capture, store, and organize web pages to ensure they remain accessible even if the original source is removed. The platform distinguishes itself through its focus on collaborative knowledge management and multi-platform capture. It enables teams to curate shared collections, apply custom tags, and annotate saved resources within a unified workspace. Users can integrate the service into their daily workflows via browser extensions and mobile device sharing, allowing for the direct archiving of links from various environments. The system provides a comprehensive suite of organization and administrative tools, including folder-based grouping, role-based access control, and programmatic management through a secure API. It supports scalable storage and user seat management, ensuring that both individual researchers and teams can maintain structured, searchable libraries of web-based information.
Linkwarden is a self-hosted bookmark manager that provides visual organization, tagging, full-text search, browser extensions, and robust web archiving, making it a comprehensive solution for your requirements.
Karakeep is a self-hosted, open-source platform designed for personal knowledge management and web content archiving. It functions as a centralized repository where users can capture, organize, and preserve bookmarks, notes, and media files, ensuring long-term access to digital information even if original sources are removed or modified. The system distinguishes itself through its automated content processing and security-focused architecture. It utilizes headless browser crawling and optical character recognition to ingest and index web content, while a modular artificial intelligence pipeline automatically generates summaries and metadata for saved items. To maintain privacy and security, the platform supports single sign-on authentication and includes robust network controls, such as proxy-based crawling and request forgery prevention, to protect internal infrastructure during automated tasks. Beyond core archival capabilities, the platform provides extensive tools for library maintenance and data portability. Users can manage their collections through a command-line interface, synchronize content across devices, and integrate external data sources like RSS feeds. The system also facilitates collaboration through shared collections and public link generation, while offering a comprehensive programmatic interface that allows external applications to interact with stored data via webhooks and authenticated requests. The application is designed for containerized deployment, providing a unified environment for managing services, database migrations, and external storage backends.
Karakeep is a self-hosted platform that provides comprehensive bookmark management, including automated archiving, full-text indexing, tagging, and a robust API for external integrations.
ArchiveBox is a self-hosted web archiving system designed to capture and preserve permanent static copies of webpages, media, and PDFs on personal infrastructure. It functions as a digital content curator and personal web archive manager, allowing users to import URLs from bookmarks, RSS feeds, and browser history to create a centralized, searchable knowledge base. The project is distinguished by its ability to archive private, paywalled, or login-protected content using browser cookies and authenticated session persistence. It ensures long-term availability by saving pages in multiple concurrent formats, including HTML, PDF, and PNG, and can automatically mirror these local snapshots to external preservation services. The system includes capabilities for multimedia asset extraction, full-text archive indexing, and scheduled content mirroring. Users can manage their collections through a web-based interface, a command-line interface, or a remote API, with options to export the entire collection as a standalone static HTML site for offline browsing.
ArchiveBox is a self-hosted web archiving system that functions as a powerful bookmark manager by capturing, indexing, and organizing web content with full-text search and API support.
linkding is a self-hosted bookmark manager designed for saving, organizing, and retrieving web links. It functions as a centralized, private repository for personal link collections, featuring multi-user support and authentication to manage access and shared bookmarks. The project distinguishes itself through a webpage archive tool that prevents link rot by saving local HTML snapshots of bookmarked sites. It includes a programmable API for integrating third-party scripts and external applications, as well as a Netscape HTML importer to migrate bookmark libraries from other services. The system provides automated metadata extraction for page titles and descriptions, bulk editing tools for managing collections, and browser integration that enables adding links and searching archived content via the address bar. Cloud deployment templates are available to assist with installing the application on various infrastructure platforms.
Linkding is a self-hosted bookmark manager that provides essential features like tagging, full-text search, browser extensions, and automated webpage archiving, making it a comprehensive solution for your requirements.
Shiori is a self-hosted bookmark manager and webpage archiving tool. Written in Go, it functions as a backend service that allows users to save, organize, and search for web links while maintaining a private collection of online resources. The system ensures content availability by creating offline copies of saved pages, preventing data loss if the original source is removed. It is distributed as a containerized application to provide consistent installation and deployment across different operating systems. The software provides a dual-interface access model, featuring both a web-based management dashboard and a command-line interface. It supports the migration of existing link collections through external data import pipelines and persists all metadata and bookmarks using relational database engines.
Shiori is a self-hosted bookmark manager that provides the requested visual organization, tagging, full-text search, and webpage archiving capabilities in a containerized package.
Buku is a personal bookmark manager that provides a command line interface, a portable bookmark database, and a self-hosted server for organizing web links. It functions as a command line knowledge base for saving, tagging, and searching web resources. The system features a portable, mergeable database that supports AES-256 encryption and is designed for cross-device data synchronization. It includes a RESTful API and a self-hosted web interface, allowing users to manage their collection via a browser or programmatic requests. Capabilities include automatic metadata fetching to populate page titles and descriptions, link health auditing to detect dead links, and tag-based organization with logical filtering. The tool supports importing and exporting data in multiple formats, including HTML, Markdown, and XBEL. The project is written in Python and can be run within virtual environments to isolate dependencies.
Buku is a powerful, self-hostable bookmark manager that supports tagging, full-text search, archiving, and API access, though it prioritizes a command-line-first workflow over a visual-heavy interface.
TagSpaces is an offline-first file tagging and organization platform that lets you manage local files with portable metadata stored directly in filenames or sidecar JSON files, eliminating the need for a central database. It functions as a full-text file search engine, a Kanban board file organizer, a local AI file assistant, an S3-compatible cloud file manager, and a web clipper and bookmark manager, all within a single application. The project distinguishes itself through a local-first architecture where all file operations, indexing, and AI processing run entirely on the device, with cloud storage treated as an optional remote mount point. It integrates with a locally running Ollama engine for on-device AI tasks such as automatic tagging, summarization, and image analysis, keeping all data private. A plugin-based file viewer system renders over 50 file formats, while metadata is stored in sidecar files or embedded in filenames, ensuring portability across devices and sync services. Beyond its core identity, TagSpaces offers a command-line interface for programmatic file operations and search indexing, supports S3-compatible object storage and WebDAV servers for remote file management, and provides a browser extension for capturing web pages, screenshots, and bookmarks as local files with automatic tagging. The application includes built-in viewers and editors for documents, images, audio, video, 3D models, and Markdown files, along with geo-tagging on interactive maps, Kanban board task management, and full-text search with fuzzy matching and saved queries. The application can be installed on Windows, macOS, and Linux, run in portable mode, or self-hosted as a static web app on personal servers or cloud platforms like Cloudflare Pages and AWS Amplify.
TagSpaces is a local-first file and bookmark management platform that supports visual organization, tagging, and web clipping, though it functions primarily as a file-based organizer rather than a dedicated bookmark-only database.
Markdownload is a browser extension that functions as a markdown web clipper, converting webpages and selected text into clean markdown files for offline storage and archiving. It operates as a content extractor that isolates the main document from the page while removing navigation elements and advertisements. The tool includes a template generator for injecting dynamic front-matter and metadata into documents via user-defined placeholders. It also serves as a local media downloader that saves remote images to the filesystem and updates links to reference those local files. Additionally, it acts as an integration tool to transfer captured web data and metadata directly into Obsidian vaults using custom URI schemes. The extension supports capturing content from all open browser tabs simultaneously and clipping specific highlighted text. Users can customize markdown styling for links and images, organize downloaded files into specific subfolders, and export media as formatted embeds or hyperlinks to the system clipboard.
This is a browser-based web clipper designed to extract and save content as local markdown files, rather than a centralized bookmark management application with a visual interface for organizing and searching saved links.