2 repos
Automated workflows designed to detect service disruptions and restore stability through predefined incident response actions.
Explore 2 awesome GitHub repositories matching system administration & monitoring · Automated Incident Response Workflows. Refine with filters or upvote what's useful.
Grafana is an observability data platform designed to aggregate metrics, logs, and traces from diverse sources into a unified environment. It functions as a centralized interface for visualizing complex telemetry data, transforming raw streams into interactive dashboards that support real-time system health tracking an
Traefik is a cloud-native edge router and API gateway designed to manage service communication and traffic flow across distributed infrastructure. It functions as a dynamic service proxy that automatically discovers backend services and configures routing rules in real time, eliminating the need for manual restarts or