# apache/incubator-devlake

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/apache-incubator-devlake).**

2,940 stars · 684 forks · Go · apache-2.0

## Links

- GitHub: https://github.com/apache/incubator-devlake
- Homepage: https://devlake.apache.org/
- awesome-repositories: https://awesome-repositories.com/repository/apache-incubator-devlake.md

## Topics

`dashboard-friendly` `data` `data-analysis` `data-engineering` `data-integration` `data-transfers` `devops` `domain-layer` `dora` `etl` `golang` `hacktoberfest` `integration` `jira` `open-source` `user-friendly`

## Description

DevLake is a DevOps data platform and analytics tool designed to orchestrate data pipelines that ingest, transform, and sync metadata from external development tools into a unified database. It functions as a system for collecting and normalizing data from source control, CI/CD pipelines, and issue trackers into a standardized schema to enable consistent software delivery analytics.

The platform distinguishes itself by transforming tool-specific data into a common domain model, allowing for the calculation of engineering metrics via SQL. It provides specialized frameworks for measuring DORA metrics, analyzing engineering throughput, and tracking open source community engagement and contributor health.

The system covers a broad range of capabilities including plugin-based data ingestion, incremental synchronization to reduce API load, and the creation of custom engineering dashboards. It supports data pipeline orchestration to automate the movement of information from diverse external sources into a centralized relational database.

Deployment is managed as a cloud-native application using Helm charts for Kubernetes environments.

## Tags

### Data & Databases

- [Data Ingestion Pipelines](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-ingestion-pipelines.md) — Provides a comprehensive framework for extracting, transforming, and loading DevOps metadata from diverse sources.
- [Data Standardization](https://awesome-repositories.com/f/data-databases/data-governance-modeling/data-standardization.md) — Transforms and normalizes disparate data formats from various development tools into consistent structures. ([source](https://devlake.apache.org/docs/Overview/Introduction))
- [Incremental Syncing](https://awesome-repositories.com/f/data-databases/data-modification-apis/incremental-syncing.md) — Implements incremental synchronization to fetch only new or changed records, reducing API load. ([source](https://devlake.apache.org/docs/Overview/SupportedDataSources))
- [Unified Schema Syncs](https://awesome-repositories.com/f/data-databases/data-modification-apis/incremental-syncing/unified-schema-syncs.md) — Aggregates data from multiple external APIs into a consistent, unified schema for a full lifecycle view. ([source](https://cdn.jsdelivr.net/gh/apache/incubator-devlake@main/README.md))
- [Data Normalization Layers](https://awesome-repositories.com/f/data-databases/data-normalization-layers.md) — Provides a transformation layer that converts tool-specific data into a common internal schema. ([source](https://devlake.apache.org/docs/Overview/Architecture))
- [Data Pipeline Configurations](https://awesome-repositories.com/f/data-databases/data-pipeline-configurations.md) — Implements a configuration system for defining connection scopes and transformation rules to automate external data collection. ([source](https://devlake.apache.org/docs/Overview/Architecture))
- [Data Pipeline Orchestration](https://awesome-repositories.com/f/data-databases/data-pipeline-orchestration.md) — Sequences data collection and enrichment tasks through a configurable workflow to automate data movement.
- [Plugin-Based Ingestion](https://awesome-repositories.com/f/data-databases/database-metadata-discovery/database-metadata-ingestion/plugin-based-ingestion.md) — Uses a modular plugin architecture to ingest metadata from diverse external development tools. ([source](https://devlake.apache.org/docs/Overview/KeyConcepts))
- [DevOps Metadata Ingestors](https://awesome-repositories.com/f/data-databases/database-metadata-discovery/database-metadata-ingestion/plugin-based-ingestion/devops-metadata-ingestors.md) — Provides specialized plugins for the automated gathering of metadata from source control and CI/CD tools. ([source](https://devlake.apache.org/docs/Overview/Architecture))
- [DevOps Metric Collection](https://awesome-repositories.com/f/data-databases/devops-metric-collection.md) — Provides a centralized system for gathering activity metrics from issue trackers, source control, and CI/CD pipelines. ([source](https://devlake.apache.org/docs/Overview/Roadmap))
- [External Data Connectors](https://awesome-repositories.com/f/data-databases/external-data-connectors.md) — Implements a flexible framework for linking external tools and managing the collection and transformation of incoming data. ([source](https://devlake.apache.org/))
- [SQL Metric Definitions](https://awesome-repositories.com/f/data-databases/sql-metric-definitions.md) — Calculates engineering performance indicators by executing SQL queries against a normalized relational database.
- [Connection Management](https://awesome-repositories.com/f/data-databases/connection-management.md) — Provides a dedicated interface for managing links and credentials between external tools and the system. ([source](https://devlake.apache.org/docs/Overview/Roadmap))
- [Data Processing Configurations](https://awesome-repositories.com/f/data-databases/data-processing-configurations.md) — Offers a guided process for setting up ingestion parameters to automate how data is gathered from various sources. ([source](https://cdn.jsdelivr.net/gh/apache/incubator-devlake@main/README.md))
- [Logical Project Grouping](https://awesome-repositories.com/f/data-databases/data-project-management/logical-project-grouping.md) — Organizes data scopes from multiple tools into single logical projects for unified performance analysis. ([source](https://devlake.apache.org/docs/Overview/KeyConcepts))
- [Data Querying](https://awesome-repositories.com/f/data-databases/data-querying.md) — Provides an interface for querying normalized DevOps data collections using SQL. ([source](https://devlake.apache.org/docs/Overview/Introduction))
- [Data Schema Management](https://awesome-repositories.com/f/data-databases/data-schema-management.md) — Provides tools for modifying the domain layer schema and transforming data models to fit specific organizational needs. ([source](https://devlake.apache.org/docs/Overview/Roadmap))

### DevOps & Infrastructure

- [Software Delivery Analytics](https://awesome-repositories.com/f/devops-infrastructure/software-delivery-analytics.md) — Provides a comprehensive platform for measuring DORA metrics and community health by aggregating activity across the software development lifecycle.
- [Delivery Performance Tracking](https://awesome-repositories.com/f/devops-infrastructure/automated-software-delivery/delivery-performance-tracking.md) — Measures cycle time and deployment frequency to identify bottlenecks in the release process.
- [DevOps Data Platforms](https://awesome-repositories.com/f/devops-infrastructure/devops-data-platforms.md) — Collects and normalizes data from source control, CI/CD, and issue trackers into a standardized schema.
- [Webhook](https://awesome-repositories.com/f/devops-infrastructure/event-ingestion/webhook.md) — Receives real-time data pushes from external services via HTTP endpoints when native polling is unavailable.
- [Plugin Development Kits](https://awesome-repositories.com/f/devops-infrastructure/release-automation/plugin-extensibility/plugin-development-kits.md) — Provides a software development kit for building custom integration plugins while managing external API rate limits. ([source](https://devlake.apache.org/docs/Overview/Architecture))
- [Webhook Integrations](https://awesome-repositories.com/f/devops-infrastructure/webhook-integrations.md) — Implements a capability for receiving real-time data pushes from external tools via HTTP webhooks. ([source](https://devlake.apache.org/docs/Overview/Introduction))

### Web Development

- [Domain-Model Normalizations](https://awesome-repositories.com/f/web-development/external-api-integrations/schema-normalizations/domain-model-normalizations.md) — Transforms tool-specific data into a standardized common schema to allow cross-platform metric calculation.

### Networking & Communication

- [Schema-Standardizing Transformations](https://awesome-repositories.com/f/networking-communication/api-content-transformations/schema-standardizing-transformations.md) — Provides a set of custom rules for converting raw API responses into a standardized layer for accurate metric computation. ([source](https://devlake.apache.org/docs/Overview/KeyConcepts))

### Software Engineering & Architecture

- [Normalization Engines](https://awesome-repositories.com/f/software-engineering-architecture/data-formats/normalization-engines.md) — Normalizes diverse input formats from multiple software development tools into a unified structure for querying.
- [DORA Metrics](https://awesome-repositories.com/f/software-engineering-architecture/dora-metrics.md) — Implements a framework for tracking DORA metrics to evaluate software release efficiency. ([source](https://devlake.apache.org/livedemo/EngineeringLeads))
- [Engineering Throughput Analytics](https://awesome-repositories.com/f/software-engineering-architecture/engineering-throughput-analytics.md) — Provides tools for measuring cycle time and throughput across teams to identify bottlenecks and track velocity. ([source](https://devlake.apache.org/livedemo/EngineeringLeads))
- [Development Velocity Metrics](https://awesome-repositories.com/f/software-engineering-architecture/performance-metrics/development-velocity-metrics.md) — Tracks engineering performance and delivery velocity by analyzing data from version control and CI/CD tools.
- [Trend Visualizations](https://awesome-repositories.com/f/software-engineering-architecture/bug-pattern-mining/trend-visualizations.md) — Provides visualizations of bug patterns and resolution timelines to analyze quality issues. ([source](https://devlake.apache.org/livedemo/EngineeringLeads))
- [Community Engagement Analytics](https://awesome-repositories.com/f/software-engineering-architecture/community-engagement-analytics.md) — Periodic aggregation of activity data to evaluate trends and growth in community contributions over time. ([source](https://devlake.apache.org/livedemo/OSSMaintainers))
- [Open Source Community Analytics](https://awesome-repositories.com/f/software-engineering-architecture/open-source-community-analytics.md) — Gathers contributor and engagement metrics to evaluate the health and growth of community-driven projects.

### System Administration & Monitoring

- [Data Ingestion Plugins](https://awesome-repositories.com/f/system-administration-monitoring/data-ingestion-plugins.md) — Uses a modular plugin system to fetch raw data from diverse external APIs and toolchains.
- [Engagement Analytics](https://awesome-repositories.com/f/system-administration-monitoring/engagement-analytics.md) — Tracks engagement and contributor health metrics to understand user interaction with open source projects. ([source](https://devlake.apache.org/livedemo/OSSMaintainers))
- [Metric Dashboards](https://awesome-repositories.com/f/system-administration-monitoring/metric-dashboards.md) — Ships prebuilt dashboards for tracking delivery process performance indicators and development progress. ([source](https://devlake.apache.org/))
- [Custom Metric Dashboards](https://awesome-repositories.com/f/system-administration-monitoring/metrics-visualizers/custom-metric-dashboards.md) — Enables creation of user-defined SQL visualizations to monitor development KPIs and team goals.

### Development Tools & Productivity

- [Git Repository Analysis](https://awesome-repositories.com/f/development-tools-productivity/git-repository-analysis.md) — Tracks component and file-level metrics to identify high-churn areas and codebase complexity within Git repositories. ([source](https://devlake.apache.org/livedemo/EngineeringLeads))
- [REST APIs](https://awesome-repositories.com/f/development-tools-productivity/rest-apis.md) — Provides a standard RESTful API enabling programmatic retrieval and modification of system data for automation. ([source](https://devlake.apache.org/docs/Overview/References))
