These open-source platforms provide website traffic insights while ensuring user data remains private and untracked.
This project is an open-source, privacy-focused web analytics platform designed for high-throughput data ingestion and multi-tenant data management. It provides a cookie-less tracking engine that captures visitor interactions using ephemeral request metadata, ensuring comprehensive traffic visibility while maintaining strict privacy standards. The architecture utilizes an event-driven ingestion pipeline and aggregated metric storage to decouple data collection from processing, enabling efficient long-term retrieval and responsive dashboard performance. What distinguishes this platform is its emphasis on first-party data collection and proxy-based routing. By allowing tracking requests to be routed through a custom domain, the system effectively masks analytics traffic as internal requests, bypassing ad-blocking software and privacy filters that typically interfere with client-side scripts. This approach, combined with server-side event processing, ensures that site owners maintain accurate traffic data even when browser-based limitations are present. The platform offers a broad capability surface for managing complex organizational needs, including granular role-based access control, SAML-based single sign-on, and automated reporting workflows. Users can programmatically manage site configurations, integrate external data sources, and export raw event logs for deep analysis in third-party business intelligence tools. The system also supports advanced conversion funnel tracking, allowing teams to define and measure specific user journeys and revenue-generating actions across multiple websites from a centralized dashboard.
Umami is a self-hosted, privacy-focused web analytics platform designed to provide full control over infrastructure and user data. It captures website traffic and visitor behavior through anonymous tracking methods that avoid cookies, browser fingerprinting, and the storage of personally identifiable information. The platform distinguishes itself through a comprehensive suite of behavioral analysis tools, including session replays, heatmaps, and cohort-based retention reporting. It features a multi-tenant architecture that allows teams to manage multiple websites within a single, collaborative dashboard, supported by granular role-based access controls and the ability to share specific insights via public links. Beyond core traffic monitoring, the system includes a robust event tracking framework for capturing custom user interactions, conversion funnels, and marketing campaign attribution. It also provides diagnostic capabilities for web performance, allowing users to track core web vitals and troubleshoot data collection through detailed session logs and visitor activity searches. The software supports flexible deployment strategies, including containerized installations and source-code-based setups, and can be integrated into various environments via a standard API or pre-built plugins.
vibe-vibe is an LLM agent engineering framework and toolchain optimizer designed for orchestrating multi-agent systems. It serves as a comprehensive guide and methodology for transforming conceptual ideas into deployed applications through agentic software engineering. The project focuses on the orchestration of specialized AI agent roles with defined collaboration boundaries and iterative feedback loops. It provides frameworks for toolchain optimization, including the selection and evaluation of protocols that extend model capabilities and the design of standardized tool interfaces. The system covers a broad range of capabilities, including agent architecture design, prompt engineering workflows, and the management of the AI product development lifecycle. It also addresses technical implementation areas such as API integration, containerized deployment, vector-embedding memory, and security boundary design for agent systems. The project includes an AI software development course and a product development guide to facilitate the transition from traditional programming to AI-assisted engineering.
Fathom is a privacy-focused website analytics server written in Go. It monitors website traffic and page views without collecting personal data or using intrusive cookies, providing a self-hosted alternative for traffic monitoring. The system utilizes a Preact-based dashboard interface for visualizing traffic patterns and reports. Data is persisted in a SQL database analytics store, with support for MySQL, PostgreSQL, and SQLite. The project covers the collection of visitor data via lightweight tracking snippets and the management of that data through a pluggable storage layer. It includes mechanisms for filtering referrer spam by blocking known fraudulent domains.
Ungoogled Chromium is a desktop web browser derived from the open-source Chromium codebase, modified to remove all background communication with external services and proprietary dependencies. It functions as a privacy-focused distribution that ensures user data remains local by eliminating telemetry hooks and data collection integrations. The project distinguishes itself through extensive source-code pruning and domain-substitution patching, which replace hardcoded service URLs with non-functional placeholders to prevent unauthorized data transmission. It further hardens the browser runtime by stripping out non-essential binary components and applying binary-level instrumentation to disable automatic updates that would otherwise restore removed tracking features. Beyond these core privacy modifications, the browser provides a customizable environment where users can tailor behavior and search preferences through command-line configuration and custom overrides. This approach reduces the overall attack surface and removes software bloat, resulting in a minimalist distribution that prioritizes transparency and user control over browser functionality.
This project is a serverless service that generates dynamic, themeable visual summaries of software development activity. It functions as an automated metadata visualizer, transforming raw platform logs and repository metrics into resolution-independent vector graphics that can be embedded directly into markdown environments. The service distinguishes itself by offering highly configurable, query-parameter-driven rendering that allows users to customize the visual presentation of their coding patterns, language proficiency, and repository details. It supports both real-time generation via serverless functions and the creation of static image files through automated workflows, providing flexibility in how data is fetched and displayed. The platform aggregates disparate data points from multiple sources to provide comprehensive insights into development habits and project metadata. Users can deploy private instances of the service to maintain full control over caching strategies, authentication tokens, and rate limit management.
Matomo is a self-hosted web analytics platform designed to track user behavior and website performance while maintaining full data ownership. It functions as a comprehensive analytics suite that captures visitor interactions and processes raw tracking logs into structured metrics, providing organizations with a centralized system for monitoring traffic patterns and engagement. The platform distinguishes itself through a strong emphasis on privacy and modularity. It includes built-in tools to anonymize visitor information and manage user consent, ensuring compliance with global data protection standards. Its architecture is built on a plugin-based system, allowing users to extend core functionality through independent modules that integrate directly into the application lifecycle. Beyond core tracking, the software serves as a marketing tag manager and a business intelligence reporter. It enables the dynamic injection of third-party scripts and marketing tags based on configurable triggers, eliminating the need for manual source code modifications. Users can aggregate complex datasets into custom dashboards and automated summaries, while also importing or exporting data to maintain a unified view across external platforms. The system is managed through a web-based interface and supports role-based access control to restrict data visibility and permissions. It is designed for deployment on local infrastructure, utilizing relational database storage to organize high-volume analytics data for historical trend analysis.
PostHog is a comprehensive product analytics and feature management platform designed to capture, process, and visualize user behavior data. It provides a unified suite for tracking application events, managing feature rollouts, and monitoring system health through session recordings and error tracking. By leveraging a columnar-storage-optimized architecture, the platform enables high-performance aggregation and filtering across massive event datasets. What distinguishes PostHog is its integrated approach to data pipelines and application control. It features a robust event ingestion system that supports custom transformation logic through sandboxed scripting, allowing for real-time data manipulation before storage. The platform also includes a sophisticated feature flagging service that supports multivariate testing and dynamic configuration across web and mobile environments, alongside automated anomaly detection and alerting engines that monitor data streams for performance shifts. The platform covers a broad observability surface, including application performance monitoring, qualitative user feedback collection via targeted surveys, and detailed activity auditing. It provides extensive administrative controls, such as granular access management and secure proxy infrastructure, to ensure reliable data collection and compliance. Developers can interact with the platform through a documented API that supports authenticated access, rate limiting, and efficient result pagination.
Ackee is a self-hosted web analytics platform designed for tracking website traffic and visitor behavior. It functions as a privacy-first visitor tracker that allows for the collection of engagement metrics without relying on third-party cloud providers. The platform ensures data ownership through a self-hosted deployment model. It includes an analytics data API that provides a queryable interface for fetching detailed visitor data to create custom reports and external visualizations. The system covers web traffic analysis and privacy-focused user tracking. It supports the generation of tailored analytics reporting by translating site visitor patterns into structured data.
uBlock is a browser-based content blocker that functions as a declarative filtering engine to intercept network requests and modify web page content. It operates by parsing standardized filter lists into optimized data structures, allowing it to block network hosts, enforce security policies, and prevent unauthorized data transmission. The extension provides a comprehensive security layer that monitors outgoing traffic and disables intrusive browser features to enhance user privacy. What distinguishes this project is its granular control over filtering behavior through a dynamic rule orchestrator. Users can manage custom rules, apply site-specific overrides, and toggle filtering settings on a per-domain basis. The engine also employs advanced techniques such as CNAME uncloaking, IP address filtering, and response body modification to identify and neutralize trackers that attempt to bypass standard blocking methods. Furthermore, it supports enterprise-grade deployment, enabling organizations to enforce consistent security and filtering configurations across managed environments. The project covers a broad capability surface including cosmetic page modification, which uses CSS injection and sandboxed scriptlets to remove visual clutter and neutralize anti-blocking scripts. It also provides interactive tools for real-time network traffic inspection and manual element removal, ensuring users can debug and customize their browsing experience. The extension is designed to maintain high performance by synchronizing its initialization at startup, ensuring that all security rules are active before any network requests are processed.
CyberChef is a web-based application designed for performing complex data encoding, decoding, encryption, and analysis tasks. It provides a visual interface where users construct data transformation pipelines by chaining modular operations together, allowing raw input to be processed into a desired output format entirely within the local browser environment. The tool functions as a client-side cryptographic workbench, ensuring that all data processing logic remains local to the user's machine to maintain privacy and eliminate server-side overhead. By utilizing functional pipeline composition and state-driven synchronization, the application automatically updates its output as users modify their transformation sequences. To maintain responsiveness during intensive operations, the platform employs background thread processing and loads transformation modules on demand. The suite covers a broad range of capabilities for security incident investigation, cryptographic protocol analysis, and automated data parsing. Users can serialize their entire transformation state into a URL fragment, facilitating the sharing of complex workflows for security tool prototyping and technical research.
Rybbit is an open-source, self-hosted web analytics platform designed for comprehensive user behavior tracking and product engagement analysis. It provides a complete suite for monitoring visitor interactions, conversion funnels, and site performance, allowing organizations to maintain full ownership of their data and infrastructure. The platform distinguishes itself through a strong emphasis on privacy-compliant data collection and visual session replay capabilities. It supports advanced traffic routing through custom domains to bypass ad blockers and includes configurable masking tools to protect sensitive user information during session recordings. By linking anonymous activity to persistent user profiles, it enables accurate cross-device analysis and detailed cohort segmentation. Beyond core tracking, the system offers extensive tools for event-driven data pipelines, including custom event logging, e-commerce transaction monitoring, and automated error tracking. It features robust administrative controls, such as role-based access management, team collaboration workflows, and granular data retention policies. The platform is built for flexible deployment, utilizing containerized orchestration to simplify maintenance and updates in private server environments.
This project is a comprehensive software observability suite and application performance monitoring platform designed to track runtime errors, performance bottlenecks, and system health. It functions as a centralized diagnostic service that aggregates and categorizes exceptions, providing the infrastructure necessary to visualize complex execution paths across distributed systems and microservices. The platform distinguishes itself through a high-throughput distributed event ingestion pipeline and a columnar storage analytics engine that enables rapid aggregation of large-scale performance metrics. It utilizes runtime-level instrumentation hooks to capture execution data directly from the host environment and employs symbolication-based stack trace resolution to map minified code or raw memory addresses back to original source files. Furthermore, the system includes specialized capabilities for monitoring the operational performance of AI agents and ensuring sensitive data compliance through schema-driven scrubbing of incoming event payloads. Beyond core error tracking and tracing, the platform supports a wide range of programming languages and frameworks, allowing for consistent visibility across diverse software architectures. It integrates with external services to automate incident response workflows and provides a command-line interface for managing releases, debug symbols, and project configurations. The system also features a modular, plugin-based architecture that facilitates connectivity with third-party tools for issue tracking and alerting.
Faker is a Python library designed to generate realistic synthetic data for software testing, database prototyping, and privacy-preserving anonymization. It provides a comprehensive suite of tools to create diverse information types, including personal identities, financial records, geographic locations, and technical system metadata, allowing developers to populate environments with mock data that mimics real-world structures. The library is built on a modular provider architecture that supports dynamic method dispatch, enabling users to extend functionality by registering custom data generation logic. To ensure consistency across testing workflows, it features deterministic seeding for repeatable output and stateful uniqueness tracking to prevent duplicate entries within a session. Furthermore, the system is locale-aware, allowing for the generation of data that adheres to specific regional formats, languages, and cultural conventions. Beyond its core generation capabilities, the library includes utilities for integrating synthetic data into automated test suites, such as performance toggles for high-volume generation and fixture-based injection. It covers a broad spectrum of domains, ranging from business and media content to complex network and automotive identifiers, providing a flexible framework for simulating varied user environments and system requirements.
This project is an open-source desktop web browser built on the Gecko rendering engine. It is designed to prioritize user privacy and security, utilizing a multi-process architecture to isolate web content and maintain a secure sandbox environment for all browsing activities. The browser distinguishes itself through a highly modular interface engine that allows users to customize visual layouts and functional behaviors using style sheets and community-developed modifications. It supports advanced productivity workflows by enabling users to synchronize navigation state across multiple windows, organize tabs into distinct workspaces, and utilize split-view layouts for efficient multitasking. Beyond its core interface capabilities, the browser provides a comprehensive suite of security and privacy protections. This includes automated tracker blocking, encrypted domain name resolution, and strict enforcement of secure connection protocols to prevent unauthorized data collection and mitigate threats from malicious websites. The application also features an extensible architecture that supports third-party modules, allowing users to integrate specialized tools for enhanced navigation and media management.
This repository serves as a comprehensive research platform and toolkit for advancing machine learning, quantum computing, and large-scale scientific data analysis. It provides foundational frameworks for developing complex algorithmic systems, offering the necessary infrastructure for distributed training, computational graph execution, and high-performance model development. The project distinguishes itself by integrating specialized research domains with robust, privacy-preserving methodologies. It supports diverse scientific discovery through tools for quantum simulation, physics-informed neural modeling, and secure data aggregation. Beyond core machine learning, the platform facilitates advanced research in fields such as genomics, environmental forecasting, and clinical health diagnostics, enabling researchers to apply deep learning to complex, real-world datasets. The repository encompasses a broad capability surface, including automated research tooling, natural language processing, and machine perception. It provides infrastructure for monitoring model performance, benchmarking factuality, and ensuring responsible artificial intelligence through fairness and robustness evaluations. These tools are designed to support experimental workflows, from hypothesis generation and scientific code synthesis to the deployment of energy-efficient models on edge hardware.
Glance is a self-hosted web portal designed to aggregate real-time data, system monitoring, and content feeds into a single, customizable dashboard. It functions as a centralized interface for managing internal tools and network services, allowing users to secure their portal with password authentication and define the application's visual identity and layout through structured configuration files. The platform is built on a modular widget engine that treats individual dashboard components as isolated units. This architecture enables users to arrange widgets into custom pages and columns, with each component independently fetching and rendering data from external sources, service status checks, or system interfaces. The system supports a wide range of integrations, including containerized workload monitoring, server resource tracking, and various web-based content feeds. Beyond its core monitoring capabilities, the application includes productivity utilities such as task management and custom search interfaces. It provides flexibility for advanced users through custom API data visualization, which allows for the transformation of raw JSON responses into formatted widgets using custom templates. The system is designed to operate behind reverse proxies and can be configured to interact directly with host-level interfaces and container runtimes.
Infinite Scroll is a JavaScript library that automatically loads and appends the next page of content when a user scrolls near the bottom of a container, or optionally when they click a button. It provides scroll-based pagination that fetches and inserts subsequent pages of content, with URL-driven content loading that determines next page URLs from selectors, templates, or custom functions. The library integrates with browser history, updating the URL and history state as new pages load while preserving scroll position so each page state is bookmarkable and refreshable. It is compatible with Masonry, Isotope, and Packery layout libraries, automatically appending loaded items into visual grid layouts. Infinite Scroll also supports analytics tracking, sending pageview events to Google Analytics each time new content is loaded. Additional capabilities include loading JSON content from API endpoints, rendering embedded elements like CodePens and Tweets after new content is appended, and displaying status indicators for request, last page, and error states during page loading. The library offers instance management for accessing, reconfiguring, or destroying an Infinite Scroll instance from a DOM element after initialization.
Lighthouse is an automated diagnostic tool that evaluates web pages against industry standards for performance, accessibility, and search engine optimization. It functions as a programmatic analysis engine and a command-line utility, allowing developers to integrate comprehensive web quality checks directly into continuous integration pipelines and local development workflows. The project distinguishes itself through a modular architecture that utilizes artifact-based data collection to ensure consistent analysis across different environments. It supports a headless execution mode for automated testing and provides a plugin-driven framework, enabling developers to register custom audit logic and specialized reporting categories to meet unique project requirements. Beyond its core auditing capabilities, the tool detects underlying web frameworks and content management systems to provide tailored optimization recommendations. It generates structured, machine-readable reports and offers multiple interfaces, including a browser-integrated panel and a dedicated extension, to facilitate real-time feedback during the development process.