# Data Lineage Tracking Tools

> Search results for `track data lineage from source to dashboard` on awesome-repositories.com. 115 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/track-data-lineage-from-source-to-dashboard

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/track-data-lineage-from-source-to-dashboard).**

## Results

- [grafana/grafana](https://awesome-repositories.com/repository/grafana-grafana.md) (74,456 ⭐) — Grafana is an observability data platform designed to aggregate metrics, logs, and traces from diverse sources into a unified environment. It functions as a centralized interface for visualizing complex telemetry data, transforming raw streams into interactive dashboards that support real-time system health tracking and performance monitoring.

The platform distinguishes itself through a plugin-based modular architecture that integrates disparate databases, cloud services, and monitoring tools via a standardized data abstraction layer. This framework allows for the dynamic loading of external
- [joelgrus/data-science-from-scratch](https://awesome-repositories.com/repository/joelgrus-data-science-from-scratch.md) (9,636 ⭐) — This project is a collection of foundational machine learning algorithms and data science tools implemented in Python. It focuses on building the logic of these tools using basic programming primitives rather than relying on specialized libraries.

The implementation covers several core domains, including a linear algebra library for matrix and vector operations, a statistical analysis toolkit for probability and hypothesis testing, and a framework for map-reduce distributed processing. It also includes implementations for natural language processing, graph theory for network analysis, and var
- [amitshekhariitbhu/from-java-to-kotlin](https://awesome-repositories.com/repository/amitshekhariitbhu-from-java-to-kotlin.md) (6,324 ⭐) — From Java To Kotlin - Your Cheat Sheet For Java To Kotlin
- [dubinc/dub](https://awesome-repositories.com/repository/dubinc-dub.md) (23,722 ⭐) — This project is a comprehensive link management and marketing attribution platform designed for creating, tracking, and analyzing shortened URLs. It functions as a centralized hub for marketing analytics, providing tools to monitor link performance, visualize conversion funnels, and manage affiliate programs through a unified dashboard.

The platform distinguishes itself by integrating advanced attribution modeling and partner management directly into the link infrastructure. It supports complex marketing workflows, including automated commission calculations, fraud detection, and payout distr
- [fastshift/x-track](https://awesome-repositories.com/repository/fastshift-x-track.md) (6,250 ⭐) — X-Track is a firmware project for an embedded bicycle computer that combines GPS-based speed and ride metrics with offline map navigation. It functions as a GPS bicycle speedometer, displaying speed, distance, altitude, and other ride data on a handlebar-mounted screen, while also serving as an offline map viewer that renders locally stored map tiles without an internet connection.

The project distinguishes itself by including a firmware emulator that runs the embedded code on a PC, enabling development and testing without physical hardware. It also provides GPS-based clock calibration to aut
- [datahub-project/datahub](https://awesome-repositories.com/repository/datahub-project-datahub.md) (12,141 ⭐) — DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations.

The platform distinguishes itself through its focus on grounding artificial intelligence and autono
- [quarkusio/quarkus](https://awesome-repositories.com/repository/quarkusio-quarkus.md) (15,479 ⭐) — Quarkus is a Kubernetes-native Java framework designed for building high-performance, memory-efficient applications. It utilizes ahead-of-time native compilation to transform Java code into standalone, optimized binaries that eliminate the need for a virtual machine, enabling rapid startup and reduced memory consumption. By performing code augmentation during the build phase, it shifts heavy processing tasks away from runtime, ensuring that applications are optimized for cloud-native environments.

The framework distinguishes itself through a unified approach to reactive and imperative program
- [gitpod-io/gitpod](https://awesome-repositories.com/repository/gitpod-io-gitpod.md) (13,713 ⭐) — Gitpod is a cloud development environment platform and Docker-based workspace orchestrator. It provisions on-demand, containerized Linux workspaces that synchronize directly with remote version control repositories to eliminate manual local setup.

The platform functions as an infrastructure-as-code environment manager, using declarative configuration files to define base images, tool versions, and environment variables. This ensures team-wide consistency and reproducible workspace configurations across different projects.

It provides ephemeral coding workspaces for rapid prototyping, debuggi
- [niuiic/track.nvim](https://awesome-repositories.com/repository/niuiic-track-nvim.md) (26 ⭐) — Neovim plugin to track the thought process of reading source code.
- [f/prompts.chat](https://awesome-repositories.com/repository/f-prompts-chat.md) (163,814 ⭐) — This platform serves as a centralized management system for organizing, refining, and versioning AI instructions and agent skills. It functions as a repository that enables users to store, categorize, and retrieve structured prompts, ensuring consistent performance across various artificial intelligence models. By integrating with the Model Context Protocol, the system allows external AI assistants and development environments to discover and access these instruction libraries directly.

The platform distinguishes itself through its focus on prompt engineering and automated refinement, utilizi
- [rougier/from-python-to-numpy](https://awesome-repositories.com/repository/rougier-from-python-to-numpy.md) (2,138 ⭐) — Copyright (c) 2017 Nicolas P. Rougier License: Creative Commons Attribution 4.0 International (CC BY-NC-SA 4.0). Website: http://www.labri.fr/perso/nrougier/from-python-to-numpy
- [cube-js/cube](https://awesome-repositories.com/repository/cube-js-cube.md) (20,251 ⭐) — Cube is a semantic data layer that provides a unified framework for defining business metrics, dimensions, and relationships across diverse data sources. By acting as a headless business intelligence engine, it transforms raw data into a governed model that can be queried via SQL, REST, and GraphQL interfaces. This architecture ensures consistent data definitions and logic across all downstream analytical applications and reporting tools.

The platform distinguishes itself through its integrated conversational AI capabilities, which allow users to explore data using natural language. It orches
- [casdoor/casdoor](https://awesome-repositories.com/repository/casdoor-casdoor.md) (13,814 ⭐) — Casdoor is a centralized identity and access management platform that functions as an OAuth 2.0 authorization server. It provides a comprehensive suite of services for managing user identities, authentication sessions, and access policies across both web and machine-to-machine applications. Built with a decoupled frontend-backend architecture in Go, the platform supports high-concurrency environments and offers a web-based management interface for administrative tasks.

The platform distinguishes itself through its extensive support for federated identity management, allowing integration with
- [mlflow/mlflow](https://awesome-repositories.com/repository/mlflow-mlflow.md) (26,554 ⭐)
- [cockroachdb/cockroach](https://awesome-repositories.com/repository/cockroachdb-cockroach.md) (32,207 ⭐) — Cockroach is a distributed SQL database designed to scale horizontally across multiple nodes while maintaining strict ACID compliance and global data consistency. It functions as a relational database engine that automatically partitions data into ranges, rebalancing them across a cluster to accommodate growing storage and throughput requirements. By utilizing a distributed consensus protocol, the system ensures that all nodes agree on the order of operations, providing fault tolerance and continuous availability even in the event of hardware failures.

The system distinguishes itself through
- [jialeli1/from-voxel-to-point](https://awesome-repositories.com/repository/jialeli1-from-voxel-to-point.md) (0 ⭐) — This is a reproduced repo of "From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder" (FromVoxelToPoint) and "Anchor-free 3D Single Stage Detector with Mask-Guided Attention for Point Cloud" (MGAF-3DSSD) in ACM MM 2021.
- [dbt-labs/dbt-core](https://awesome-repositories.com/repository/dbt-labs-dbt-core.md) (13,051 ⭐) — dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history.

The project distinguishes itself through an adapter-based d
- [danthareja/contribute-to-open-source](https://awesome-repositories.com/repository/danthareja-contribute-to-open-source.md) (1,495 ⭐) — The goal of this project is to empower you to contribute code to open source projects on GitHub by teaching you the mechanics of the process in an interactive experience.
- [amruthpillai/reactive-resume](https://awesome-repositories.com/repository/amruthpillai-reactive-resume.md) (38,613 ⭐) — This project is a web-based platform designed for creating, managing, and sharing professional resumes. It functions as a structured document builder that integrates artificial intelligence to assist with content generation, editing, and analysis. Users can maintain a collection of resumes, customize their visual presentation through various templates, and export them into multiple formats for job applications.

The platform distinguishes itself through its autonomous AI agent capabilities, which can perform research, suggest incremental edits, and apply data patches directly to documents. It
- [keen/dashboards](https://awesome-repositories.com/repository/keen-dashboards.md) (11,038 ⭐) — This project is a collection of responsive CSS Grid dashboard templates and a data visualization UI kit. It provides a set of HTML layouts designed for building analytics interfaces and monitoring views for KPIs and business metrics that adapt to different screen sizes.

The toolkit is library-agnostic, allowing the connection of static HTML templates to any external data source or third-party charting library without requiring custom adapter code. It uses a template-driven approach to separate the visual structure of the dashboard from the underlying data.

The capabilities cover the assembly
- [plausible/analytics](https://awesome-repositories.com/repository/plausible-analytics.md) (24,245 ⭐) — This project is an open-source, privacy-focused web analytics platform designed for high-throughput data ingestion and multi-tenant data management. It provides a cookie-less tracking engine that captures visitor interactions using ephemeral request metadata, ensuring comprehensive traffic visibility while maintaining strict privacy standards. The architecture utilizes an event-driven ingestion pipeline and aggregated metric storage to decouple data collection from processing, enabling efficient long-term retrieval and responsive dashboard performance.

What distinguishes this platform is its
- [growinggit/github-chinese-top-charts](https://awesome-repositories.com/repository/growinggit-github-chinese-top-charts.md) (108,509 ⭐) — This project functions as a curated software directory and developer resource index, providing a centralized platform for discovering and evaluating high-quality open-source repositories. It serves as an aggregator that monitors trending software and educational resources, organizing them by technical domain and programming language to assist developers in identifying tools for their specific technical challenges.

The directory distinguishes itself through a community-driven curation workflow, where repository lists are validated and updated based on collective developer consensus. This infor
- [tracksapp/tracks](https://awesome-repositories.com/repository/tracksapp-tracks.md) (1,235 ⭐) — Tracks is a GTD™ web application, built with Ruby on Rails
- [duckdb/duckdb](https://awesome-repositories.com/repository/duckdb-duckdb.md) (38,805 ⭐) — DuckDB is an in-process analytical database engine designed to run directly within an application process. As a zero-dependency, embedded system, it provides enterprise-grade SQL data processing capabilities without the overhead of managing a dedicated database server. It is built to handle complex analytical and aggregation tasks by storing and retrieving information in columns, allowing for high-performance relational data manipulation.

The engine distinguishes itself through a columnar vectorized execution model that maximizes CPU cache efficiency during query operations. It employs adapti
- [activitywatch/activitywatch](https://awesome-repositories.com/repository/activitywatch-activitywatch.md) (17,912 ⭐) — ActivityWatch is an open-source, privacy-focused time tracking platform that automatically records computer usage patterns to provide a comprehensive history of personal digital activity. It functions as a background service that monitors system metrics, application focus, and user input to build a detailed log of how time is allocated across various tasks and projects.

The platform distinguishes itself through a local-first architecture that stores all activity data as structured files on the user's machine, ensuring that sensitive usage history remains private and accessible offline. It sup
- [ivaskuu/dashboard](https://awesome-repositories.com/repository/ivaskuu-dashboard.md) (947 ⭐) — Dashboard concept made with Flutter.
- [openpanel-dev/openpanel](https://awesome-repositories.com/repository/openpanel-dev-openpanel.md) (5,349 ⭐) — OpenPanel is a self-hosted product analytics platform designed for tracking user behavior and visualizing product metrics on private infrastructure. It provides a comprehensive system for collecting events across web, mobile, and server environments while ensuring complete ownership of data.

The platform distinguishes itself through a privacy-first approach, utilizing cookieless event tracking and regional data residency to simplify regulatory compliance. It integrates large language models via the Model Context Protocol, enabling users to query behavioral data and analyze trends using natura
- [easychen/lean-side-bussiness](https://awesome-repositories.com/repository/easychen-lean-side-bussiness.md) (11,984 ⭐) — This project is a comprehensive software entrepreneurship curriculum and solopreneurship business playbook designed for developers. It provides a strategic framework for building, validating, and monetizing side businesses using lean startup methodology and a systematic product development approach.

The project distinguishes itself by offering specific guides for digital monetization and career anti-fragility, helping software engineers transition from employment to self-employment. It focuses on turning technical skills into scalable digital assets, paid communities, and independent software
- [maquannene/track](https://awesome-repositories.com/repository/maquannene-track.md) (268 ⭐) — Track is a thread safe cache write by Swift. Composed of DiskCache and MemoryCache which support LRU.
- [helix-editor/helix](https://awesome-repositories.com/repository/helix-editor-helix.md) (44,911 ⭐) — Helix is a terminal-based modal text editor designed for efficient code manipulation and navigation. It centers on a selection-first editing model, where operations are performed on active ranges rather than individual cursor positions, allowing for precise control over text and code structures.

The editor distinguishes itself through deep integration with structural parsing and language intelligence. By utilizing an incremental parsing library, it builds concrete syntax trees that enable advanced features like structural code navigation, intelligent indentation, and syntax-aware text object
- [stirling-tools/stirling-pdf](https://awesome-repositories.com/repository/stirling-tools-stirling-pdf.md) (81,109 ⭐) — Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-based interface for interactive editing and a programmatic, API-first architecture that allows for the automation of document workflows through standard HTTP requests.

The project distinguishes itself through its focus on private, infrastructure-agnostic deployment and granular
- [eduardolundgren/tracking.js](https://awesome-repositories.com/repository/eduardolundgren-tracking-js.md) (9,472 ⭐) — tracking.js is a browser computer vision library written in JavaScript for performing real-time image analysis and object tracking directly within a web browser. It functions as a real-time object tracker, a color tracking tool, and a face detection utility.

The library enables the detection and monitoring of specific color ranges, human faces, and known visual patterns across consecutive video frames. It extracts visual features and descriptors from images to identify distinct landmarks for matching and tracking.

The project covers broad computer vision capabilities, including the ability t
- [davila7/claude-code-templates](https://awesome-repositories.com/repository/davila7-claude-code-templates.md) (20,933 ⭐) — Claude Code Templates is a comprehensive framework for orchestrating specialized AI agents and automating development workflows within local environments. It provides a structured system for defining, configuring, and deploying AI personas that handle specific technical tasks, ranging from backend architecture and frontend implementation to security auditing and infrastructure management.

The project distinguishes itself through a configuration-driven approach that allows teams to standardize development environments and share reusable agent definitions across projects. It includes a robust C
- [tomoki-takahashi-oisix/dbt-column-lineage](https://awesome-repositories.com/repository/tomoki-takahashi-oisix-dbt-column-lineage.md) (14 ⭐) — This is a tool to visualize the column level lineage of dbt models. It uses the manifest.json and catalog.json files generated by dbt to create a graph of the lineage of the models. It is a web application that uses a FastAPI backend and a Next.js frontend.
- [umami-software/umami](https://awesome-repositories.com/repository/umami-software-umami.md) (37,285 ⭐) — Umami is a self-hosted, privacy-focused web analytics platform designed to provide full control over infrastructure and user data. It captures website traffic and visitor behavior through anonymous tracking methods that avoid cookies, browser fingerprinting, and the storage of personally identifiable information.

The platform distinguishes itself through a comprehensive suite of behavioral analysis tools, including session replays, heatmaps, and cohort-based retention reporting. It features a multi-tenant architecture that allows teams to manage multiple websites within a single, collaborativ
- [bloxstraplabs/bloxstrap](https://awesome-repositories.com/repository/bloxstraplabs-bloxstrap.md) (3,034 ⭐) — Bloxstrap is a custom game bootstrapper and configuration tool for Roblox. It replaces the standard launcher to enable advanced startup configurations, inject internal engine flags, and manage a specialized installation directory.

The project provides a client mod manager that allows users to override local assets, such as sounds, textures, and fonts, and ensures these customizations persist across game updates. It includes a configuration utility to unlock hidden graphics settings and engine parameters, alongside a server tracker that identifies the geographic location of active game servers
- [zce/dashboard](https://awesome-repositories.com/repository/zce-dashboard.md) (559 ⭐) — A dashboard scaffolding based on Vue.js 3.0 created by Vite.
- [pinpoint-apm/pinpoint](https://awesome-repositories.com/repository/pinpoint-apm-pinpoint.md) (13,830 ⭐) — Pinpoint is a distributed application performance management tool designed to trace requests and monitor metrics across large-scale distributed architectures. It functions as a request tracer, topology mapper, and JVM application monitor, providing a backend capable of collecting and visualizing trace data from OpenTelemetry compatible sources.

The system distinguishes itself through a combination of bytecode-based instrumentation via a Java agent and topology-based visualization that renders live maps of service interconnections. It captures execution flow across asynchronous boundaries, suc
- [go-gitea/gitea](https://awesome-repositories.com/repository/go-gitea-gitea.md) (56,328 ⭐) — Gitea is a self-hosted service designed for managing version control repositories, project issue tracking, and software artifact distribution. It provides a collaborative platform that enables teams to host their own source code, manage development tasks through integrated project boards, and store container images or language-specific packages within a unified environment.

The platform distinguishes itself through a built-in automation engine that executes continuous integration and delivery pipelines directly triggered by repository events. It utilizes a background task queue to manage asyn
- [danielbayerlein/dashboard](https://awesome-repositories.com/repository/danielbayerlein-dashboard.md) (1,337 ⭐) — 📺 Create your own team dashboard with custom widgets. Built with Next.js, React, styled-components and polished.
- [dagster-io/dagster](https://awesome-repositories.com/repository/dagster-io-dagster.md) (14,974 ⭐) — Dagster is a data orchestration platform designed to manage the entire lifecycle of data assets through declarative modeling and version-controlled code. It functions as a workflow engine that treats data assets as first-class primitives, allowing teams to define, schedule, and monitor complex pipelines while maintaining clear visibility into lineage, dependencies, and data quality.

The platform distinguishes itself by using a code-as-configuration framework that enables standard software engineering practices, such as unit testing and local mocking, to be applied directly to data workflows.
- [signoz/signoz](https://awesome-repositories.com/repository/signoz-signoz.md) (27,355 ⭐) — SigNoz is a full-stack observability platform designed to collect, store, and visualize metrics, logs, and distributed traces in a unified environment. It leverages OpenTelemetry-based data collection to ingest telemetry from diverse sources using vendor-neutral protocols, ensuring interoperability across complex microservices architectures. The platform utilizes a high-performance columnar storage engine to enable rapid aggregation and filtering, providing a centralized backend for monitoring application health and performance.

What distinguishes the platform is its focus on automated instru
- [grafana/tempo](https://awesome-repositories.com/repository/grafana-tempo.md) (5,079 ⭐) — Grafana Tempo is a high-scale distributed tracing backend and columnar trace database. It serves as an observability data store that persists and queries spans and traces using OpenTelemetry standards, allowing for the analysis of request flows across microservices.

The system distinguishes itself by using an object-store based backend with columnar Parquet storage. This architecture enables efficient attribute searching and large-scale data retrieval through dedicated attribute columnization and block-based data partitioning. It includes a specialized TraceQL query engine for filtering trace
- [gokumohandas/made-with-ml](https://awesome-repositories.com/repository/gokumohandas-made-with-ml.md) (48,343 ⭐) — Made-With-ML is an automated documentation generator and developer experience platform designed to transform source code into structured, searchable reference websites. It functions as a codebase intelligence tool that parses implementation details to provide clear explanations of logic and data requirements.

The system distinguishes itself by leveraging language-level type annotations and structured code comments to generate interface specifications. By utilizing static analysis to extract metadata, it automates the transformation of docstrings into web-ready documentation, ensuring that tec
- [gaomingqi/track-anything](https://awesome-repositories.com/repository/gaomingqi-track-anything.md) (6,936 ⭐) — Track-Anything is an AI-driven video object segmentation and tracking system. It utilizes the Segment Anything Model to isolate and mask multiple objects across video frames, providing tools for automated mask propagation and background-filling inpainting.

The system distinguishes itself through a multi-object segmentation pipeline that can follow several distinct targets simultaneously. It includes a video inpainting utility to remove tracked objects and replace them with synthesized background content, as well as temporal mask refinement to correct tracking drift.

The project covers broad
- [aperezhortal/flake8-dashboard](https://awesome-repositories.com/repository/aperezhortal-flake8-dashboard.md) (16 ⭐) — flake8-dashboard
- [cortexproject/cortex](https://awesome-repositories.com/repository/cortexproject-cortex.md) (5,751 ⭐) — Cortex is an open-source, horizontally scalable metrics platform that ingests, stores, and queries Prometheus-compatible time-series data with multi-tenant isolation. It accepts metrics via Prometheus remote write and OpenTelemetry, executes PromQL queries against both recent and historical data, and provides a Prometheus-compatible alerting and recording rule engine with an integrated Alertmanager. The system is built as a set of independently scalable microservices that use hash-ring-based sharding, gossip-based cluster membership, and tenant-aware object storage to distribute workloads acro
- [fingerprintjs/fingerprintjs](https://awesome-repositories.com/repository/fingerprintjs-fingerprintjs.md) (27,334 ⭐) — Fingerprint is a visitor identification and fraud detection platform that generates persistent, unique identifiers by analyzing browser and device attributes. By extracting technical signals from the client environment, it enables reliable user tracking across sessions without relying on traditional cookies.

The platform distinguishes itself through its focus on high-accuracy identification and security-first architecture. It employs edge-side proxying to bypass ad-blockers and privacy restrictions, ensuring consistent data collection. To maintain data integrity, it uses cryptographic payload
- [appsilon/semantic.dashboard](https://awesome-repositories.com/repository/appsilon-semantic-dashboard.md) (256 ⭐) — semantic.dashboard
- [jaegertracing/jaeger](https://awesome-repositories.com/repository/jaegertracing-jaeger.md) (22,890 ⭐) — Jaeger is a distributed tracing platform used for collecting, storing, and visualizing request flows across microservices. It identifies performance bottlenecks and errors by tracking requests as they move through multiple service boundaries. The system includes telemetry collectors, a multi-tenant backend, and a trace visualizer.

The platform provides a multi-tenant tracing infrastructure that isolates data and queries by tenant to support shared environments. It supports standardized telemetry ingestion via the OpenTelemetry Protocol over gRPC and HTTP. To manage storage costs and overhead,
