# datadog/datadog-agent

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/datadog-datadog-agent).**

3,519 stars · 1,376 forks · Go · apache-2.0

## Links

- GitHub: https://github.com/DataDog/datadog-agent
- Homepage: https://docs.datadoghq.com/
- awesome-repositories: https://awesome-repositories.com/repository/datadog-datadog-agent.md

## Topics

`apm-agent` `apm-instrumentation` `datadog` `distributed-tracing` `go` `logging` `metrics` `monitoring` `observability` `open-telemetry` `otel` `profiling` `tracing`

## Description

The Datadog Agent is an infrastructure monitoring agent and host telemetry collector. It functions as a background process that gathers system metrics and application health data to send to a centralized monitoring platform.

The project operates as a plugin-based metric collector, using a modular system of independent check scripts to gather data from various third-party services and applications. It serves as a remote telemetry transmitter, providing a pipeline to stream infrastructure and system information to a remote analysis and alerting backend.

Its capabilities cover application performance monitoring, host resource tracking, and infrastructure performance monitoring. The agent collects low-level system telemetry from the operating system kernel and filesystem while aggregating application-level performance data to identify service degradation.

## Tags

### System Administration & Monitoring

- [System Telemetry Collectors](https://awesome-repositories.com/f/system-administration-monitoring/system-telemetry-collectors.md) — Gathers performance metrics from kernel interfaces and system files for monitoring purposes.
- [Application Performance Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/application-performance-monitoring.md) — Tracks runtime errors and performance bottlenecks in distributed software to ensure stability.
- [Modular Check Collectors](https://awesome-repositories.com/f/system-administration-monitoring/global-metric-registries/custom-metric-collectors/modular-check-collectors.md) — Implements a modular system using independent check scripts to gather data from various third-party services.
- [System Metrics Collection](https://awesome-repositories.com/f/system-administration-monitoring/logging/system-metrics-collection.md) — Gathers native system-level performance metrics from the operating system kernel and filesystem.
- [Monitoring Agents](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-agents.md) — Acts as a lightweight software agent on remote hosts to collect and push system health metrics to a central server.
- [System Resource Tracking](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/observability-platforms/metric-performance-monitors/model-performance-tracking/system-resource-tracking.md) — Monitors server-level hardware and network resource utilization including CPU, memory, and disk.
- [Application Diagnostic Data](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/observability-platforms/operational-health-alerting/health-monitoring-endpoints/application-diagnostic-data.md) — Collects stability and performance diagnostic telemetry from running applications to identify bottlenecks.
- [Application Health Monitors](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/observability-platforms/operational-health-alerting/health-monitoring-endpoints/application-health-monitors.md) — Tracks the operational health and performance metrics of running software applications. ([source](https://cdn.jsdelivr.net/gh/datadog/datadog-agent@main/README.md))
- [Telemetry Collection and Aggregation](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/observability-platforms/telemetry-collection-aggregation.md) — Provides the transport and streaming infrastructure to send telemetry data to a remote platform for analysis. ([source](https://cdn.jsdelivr.net/gh/datadog/datadog-agent@main/README.md))
- [Infrastructure Health Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/real-time-monitoring-systems/infrastructure-health-monitoring.md) — Provides real-time observation of server hardware and system resource stability to ensure availability.
- [Remote Telemetry Dashboards](https://awesome-repositories.com/f/system-administration-monitoring/remote-telemetry-dashboards.md) — Provides the telemetry stream that populates remote monitoring dashboards for system health and performance visualization.
- [System Health Monitors](https://awesome-repositories.com/f/system-administration-monitoring/system-health-monitors.md) — Tracks real-time system metrics, hardware status, and resource usage of the host operating system. ([source](https://cdn.jsdelivr.net/gh/datadog/datadog-agent@main/README.md))
- [Telemetry Transmission Pipelines](https://awesome-repositories.com/f/system-administration-monitoring/telemetry-transmission-pipelines.md) — Implements a secure pipeline for streaming collected infrastructure telemetry to a remote analysis and alerting backend.
- [Metric Aggregators](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/observability-platforms/metric-performance-monitors/server-metrics/metric-aggregators.md) — Collects individual metrics over the network and summarizes them for centralized backend delivery.
- [Periodic Metric Polling](https://awesome-repositories.com/f/system-administration-monitoring/periodic-metric-polling.md) — Triggers metric collection tasks at defined intervals using a central scheduler to ensure consistent time-series data.

### Operating Systems & Systems Programming

- [Kernel-Level Operations](https://awesome-repositories.com/f/operating-systems-systems-programming/kernel-core-internals/operating-system-kernels/kernel-level-operations.md) — Interacts directly with the operating system kernel and filesystem to extract low-level hardware and resource metrics.

### Software Engineering & Architecture

- [Metric Collection Plugins](https://awesome-repositories.com/f/software-engineering-architecture/software-architecture/architectural-patterns/plugin-module-systems/modular-plugin-architectures/plugin-based-architectures/plugin-based-architectures/metric-collection-plugins.md) — Uses a modular system of independent check scripts to gather specific data from third-party services and applications.
- [Process Isolation Architectures](https://awesome-repositories.com/f/software-engineering-architecture/process-isolation-architectures.md) — Runs separate processes for the core agent and specialized collectors to isolate failures and manage resources.

### DevOps & Infrastructure

- [Auto-Discovery Mechanisms](https://awesome-repositories.com/f/devops-infrastructure/service-discovery-and-configuration-platforms/auto-discovery-mechanisms.md) — Automatically identifies running services and monitors by scanning system processes and reading local configuration files.
