Keep is an open-source AIOps alert management platform that aggregates, deduplicates, and orchestrates the lifecycle of alerts from multiple monitoring tools. It functions as a multi-provider integration hub to centralize the flow of data between observability, ticketing, and communication tools.
The platform distinguishes itself through incident workflow automation and AI-powered enrichment. It uses a declarative workflow engine to execute multi-step operational sequences and integrates large language models to summarize event data and correlate technical logs for faster incident resolution.
The system provides broader capabilities for unified alert routing and bi-directional state synchronization across external platforms. It includes a containerized observability stack for telemetry and employs role-based access control and database-backed authentication to secure system entry.
The platform is deployed as a series of containerized services, including frontend, backend, and websocket layers.