# Open Source Incident Alerting Platforms

> Search results for `open-source alternative to PagerDuty for incident alerts` on awesome-repositories.com. 116 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/open-source-alternative-to-pagerduty-for-incident-alerts

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/open-source-alternative-to-pagerduty-for-incident-alerts).**

## Results

- [cachethq/cachet](https://awesome-repositories.com/repository/cachethq-cachet.md) (14,932 ⭐) — Cachet is a self-hosted, open-source status page system designed to communicate service uptime, incident history, and infrastructure performance to end users. It provides a centralized dashboard for managing the operational lifecycle of system components, tracking service disruptions, and scheduling maintenance windows.

The platform distinguishes itself through a comprehensive RESTful API that enables programmatic status page management and automated incident reporting. It supports deep integration with external monitoring tools, allowing for the synchronization of performance metrics and the automated triggering of status updates. Administrators can standardize communication using reusable incident templates and maintain system integrity through event-driven webhook notifications that include payload signing for authenticity.

Beyond core reporting, the system offers extensive customization options for the public-facing interface, including branding, layout adjustments, and custom asset injection. It manages administrative access through team-based permissions and protects service availability using request throttling and token-based authentication. The platform also includes built-in telemetry for usage reporting and tools for visualizing quantitative performance data over time.

The software is built using a model-view-controller pattern and relies on a relational database for state persistence. It is distributed as a web-based application that can be installed and configured to match specific organizational branding requirements.
- [datahub-project/datahub](https://awesome-repositories.com/repository/datahub-project-datahub.md) (12,141 ⭐) — DataHub is a metadata management platform designed to unify technical, operational, and business context across diverse data ecosystems. By utilizing a graph-based metadata model and an event-driven ingestion architecture, it creates a centralized source of truth that maps complex data relationships, lineage, and ownership. This foundational framework enables organizations to maintain a synchronized view of their data landscape, supporting both human-led discovery and automated data operations.

The platform distinguishes itself through its focus on grounding artificial intelligence and autonomous agents in verified enterprise context. It provides specialized capabilities to inject provenance-aware lineage, business definitions, and quality signals into AI prompts, ensuring that generated insights are accurate and trustworthy. Through a policy-as-code governance engine, it enforces access controls and compliance rules directly within the metadata graph, allowing for programmatic oversight of data assets across hybrid environments.

Beyond its core identity, the project offers a comprehensive suite of tools for data discovery, observability, and lifecycle management. It includes features for automated lineage extraction, impact analysis, and semantic search, enabling users to navigate data dependencies and resolve quality issues efficiently. The platform also supports collaborative workflows, allowing teams to manage business glossaries, certify data assets, and automate access requests through integrated communication channels.

DataHub is built to scale, utilizing a distributed architecture that allows storage, search, and graph processing layers to operate independently. It provides standardized interfaces and a bridge-based connector framework to facilitate integration with heterogeneous data sources and external AI agent frameworks.
- [crowdsecurity/crowdsec](https://awesome-repositories.com/repository/crowdsecurity-crowdsec.md) (12,574 ⭐) — CrowdSec is a collaborative, distributed security engine designed for threat detection and infrastructure protection. It functions as an intrusion detection system that parses logs and network traffic to identify malicious patterns, utilizing a bucket-based threshold detection model to aggregate events and trigger alerts. The platform is built on a modular architecture that includes a centralized local API server for managing security signals and a relational database for persistent storage of remediation decisions.

What distinguishes the project is its decoupled enforcement model, which offloads active blocking to lightweight external components known as bouncers. These bouncers query the central API to synchronize threat intelligence and apply real-time remediation across distributed environments. The system also features a hub-based configuration management framework, allowing users to download and deploy community-curated security scenarios, parsers, and collections to ensure consistent protection against evolving threats.

The platform provides a comprehensive suite of tools for security operations, including automated log parsing pipelines, event-driven plugin systems for notification workflows, and extensive command-line utilities for infrastructure management. It supports flexible deployment patterns across standalone, containerized, and cloud-native environments, enabling centralized orchestration of security agents and fleet-wide monitoring of threat activity.

The project includes a robust documentation and command-line interface that facilitates the lifecycle management of security components, from initial service discovery and configuration to the validation of detection logic and the auditing of active security policies.
- [dkhamsing/open-source-ios-apps](https://awesome-repositories.com/repository/dkhamsing-open-source-ios-apps.md) (50,744 ⭐) — This project is a comprehensive directory of open-source iOS applications designed to serve as a technical reference for developers and learners. It functions as a curated index of mobile software, categorizing projects by their functionality, implementation language, and architectural design to provide a clear view of how professional applications are structured.

The repository distinguishes itself by offering a deep dive into mobile app architecture, allowing users to study real-world codebases that utilize patterns such as Model-View-ViewModel, VIPER, and Clean Architecture. It highlights how these structures support complex application requirements, including the integration of platform-specific technologies like ARKit, CoreML, WidgetKit, and WatchOS. By showcasing diverse implementations, the directory provides a practical look at how developers manage state-driven components and modular UI elements within the Apple ecosystem.

Beyond native iOS development, the collection covers a broad spectrum of mobile engineering practices, including cross-platform development strategies using frameworks like Flutter, React Native, and Kotlin Multiplatform. It also catalogs various integration strategies, such as reactive data binding and asynchronous message passing, which are essential for maintaining synchronized and responsive user interfaces.

The directory is organized as a technical catalog, making it a resource for discovering high-quality, community-maintained projects that demonstrate standard industry practices. It serves as a starting point for developers looking to explore specific API integrations, UI patterns, and hardware-access implementations across a wide range of application categories.
- [danthareja/contribute-to-open-source](https://awesome-repositories.com/repository/danthareja-contribute-to-open-source.md) (0 ⭐) — The goal of this project is to empower you to contribute code to open source projects on GitHub by teaching you the mechanics of the process in an interactive experience.
- [berriai/litellm](https://awesome-repositories.com/repository/berriai-litellm.md) (50,579 ⭐) — LiteLLM is a unified gateway and proxy server designed to centralize access to over one hundred language model providers. It provides a standardized API interface that abstracts vendor-specific schemas, allowing developers to interact with diverse models through a single, consistent format. By acting as a central traffic management layer, it enables organizations to route, secure, and govern model interactions across multiple deployments.

The platform distinguishes itself through its policy-driven architecture, which uses configuration-based routing to manage traffic distribution, load balancing, and automatic fallbacks without requiring code changes. It incorporates a robust security and compliance layer that enforces content moderation, secret redaction, and fine-grained access control. Additionally, it supports complex operational requirements such as semantic routing, rule-based complexity scoring, and persistent virtual key management for multi-tenant environments.

Beyond core routing, the project provides comprehensive governance and observability tools to monitor usage, track spending, and log request metadata across teams. It includes an integrated software development kit for tool calling and agent orchestration, alongside support for advanced features like response caching, batch processing, and structured output configuration. The system is designed for enterprise-wide deployment, offering features for audit logging, single sign-on integration, and granular cost reporting.
- [louislam/uptime-kuma](https://awesome-repositories.com/repository/louislam-uptime-kuma.md) (88,107 ⭐) — Uptime Kuma is a self-hosted monitoring platform designed to track the availability and performance of network services and websites. It functions as a centralized dashboard that executes asynchronous health checks on a scheduled interval, providing real-time visibility into infrastructure health and service uptime.

The platform distinguishes itself through a dedicated notification engine that dispatches alerts across multiple third-party messaging services, alongside a public status page generator that allows users to communicate service health and historical metrics via custom domains. Its architecture utilizes a reactive, single-page interface that maintains persistent bidirectional connections with the server to push live status updates without requiring manual page refreshes.

The system is built for flexible deployment, supporting containerized environments, native package installations, and bare-metal execution. It manages monitoring configurations and historical data using a local, file-based relational database, while a decoupled abstraction layer ensures that alert delivery logic remains independent of the core monitoring engine.
- [open-source-flash/open-source-flash](https://awesome-repositories.com/repository/open-source-flash-open-source-flash.md) (7,320 ⭐) — This project is an open source specification petition platform and proprietary specification archive. It serves as a markdown-based repository for collecting signatures and community support to urge vendors to open source proprietary software specifications.

The platform functions as a tool for open source specification advocacy and proprietary software archival. It creates permanent records of proprietary standards and documents the community efforts required to transition them to open source licenses, ensuring the preservation of technical knowledge.

The system utilizes a git-driven contribution workflow and distributed version control storage to manage petitions. Data is stored as formatted text files and organized via static file-based routing for archival display and retrieval.
- [langchain-ai/langchain](https://awesome-repositories.com/repository/langchain-ai-langchain.md) (139,458 ⭐) — LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows that manage state, memory, and tool execution.

The project distinguishes itself through a durable execution runtime that maintains persistent state across long-running processes by checkpointing progress to external storage. It models agent workflows as directed graphs, allowing for explicit node-to-node routing and state management. Furthermore, it includes a human-in-the-loop control layer that enables developers to pause execution at defined breakpoints, allowing for manual inspection, modification, and approval of agent actions during runtime.

Beyond its core orchestration capabilities, the framework supports a tiered memory architecture that separates short-term conversation context from long-term persistent data. It also provides comprehensive observability tools for tracing and monitoring execution flows, alongside security features for managing authentication and fine-grained access control. The platform is supported by extensive documentation and standardized interfaces for models, embeddings, and data sources to facilitate the development of production-grade agentic systems.
- [rajnandan1/kener](https://awesome-repositories.com/repository/rajnandan1-kener.md) (4,732 ⭐) — Kener is a self-hosted status page platform and uptime monitoring tool designed to track the health and availability of websites, APIs, and infrastructure components. It functions as an incident management system that automates the detection of service disruptions and provides a public-facing dashboard to communicate real-time system status and maintenance schedules to end users.

The platform distinguishes itself through its multi-tenant architecture, which allows for the operation of multiple independent, branded status pages from a single installation. It supports deep customization of the public interface through static asset injection and template-driven notifications, ensuring that dashboards and alert communications align with specific organizational branding. Administrators can manage these environments using role-based access control and programmatic configuration tools.

The system covers a broad range of operational capabilities, including periodic background polling for health checks, automated incident lifecycle tracking, and the ability to embed status widgets or badges into external websites. It also provides features for localizing the user interface, managing data retention for historical trends, and configuring complex maintenance windows with automated subscriber notifications.

The application is designed for container-based deployment to ensure consistent behavior across infrastructure providers and can also be hosted on a Node.js runtime.
- [langchain-ai/langchainjs](https://awesome-repositories.com/repository/langchain-ai-langchainjs.md) (17,818 ⭐) — LangChain.js is a framework for building, executing, and monitoring stateful agentic applications. It provides an orchestration engine that models workflows as directed graphs, allowing developers to connect language models, data sources, and external tools into modular, multi-step processes.

The platform distinguishes itself through its focus on stateful execution and human-in-the-loop control. It manages agent lifecycles by persisting execution state across threads, enabling fault tolerance and the ability to pause workflows at designated breakpoints for manual review or modification. This architecture supports both autonomous agent orchestration and complex multi-agent systems, with built-in capabilities for streaming real-time execution updates and managing long-term memory.

Beyond core orchestration, the project offers a comprehensive suite of tools for the entire application lifecycle. This includes integrated observability for tracing and evaluating agent performance, schema-enforced data serialization for reliable communication, and extensive support for deployment, security, and infrastructure management.

The project provides a TypeScript-based software development kit and a command-line interface to facilitate local development, testing, and deployment of agentic workflows.
- [swift-open-source/ultratabsaver](https://awesome-repositories.com/repository/swift-open-source-ultratabsaver.md) (290 ⭐) — The open source Tab Manager Extension for Safari.
- [fosrl/pangolin](https://awesome-repositories.com/repository/fosrl-pangolin.md) (21,255 ⭐) — Pangolin is a zero-trust remote access platform designed to provide secure, identity-aware connectivity to private network resources. It functions as a cloud-native network controller that orchestrates encrypted tunnels, traffic routing, and access policies across distributed environments. By leveraging WireGuard for secure data transport, the platform enables authenticated access to internal web applications, terminal sessions, and remote desktops without exposing services to the public internet.

The platform distinguishes itself through a declarative infrastructure model that synchronizes network state using version-controlled manifests. It supports complex connectivity requirements through peer-to-peer NAT traversal, which facilitates direct encrypted connections between nodes, with automatic fallback to server-based relaying when necessary. Additionally, it provides browser-based access to remote resources, eliminating the need for local client software for many common administrative and service-access tasks.

Beyond its core tunneling capabilities, the platform includes a comprehensive suite of tools for traffic management, security, and observability. It features granular access control policies based on user identity, geolocation, and network attributes, alongside automated certificate management and multi-factor authentication. The system also provides extensive monitoring, audit logging, and alerting capabilities to track infrastructure health and security events across multi-site deployments.

Pangolin is designed for containerized and multi-site environments, offering flexible deployment options through standard packaging and automated reconciliation workflows.
- [voltagent/awesome-claude-code-subagents](https://awesome-repositories.com/repository/voltagent-awesome-claude-code-subagents.md) (21,906 ⭐) — This project provides a framework for managing multi-agent systems, designed to automate complex software development, infrastructure, and business workflows. It functions as a multi-agent workflow orchestrator that routes tasks to domain-specific workers while maintaining state persistence and infrastructure automation. By leveraging large language models, the system decomposes high-level objectives into actionable plans, ensuring that complex operations are executed with consistency and reliability.

The framework distinguishes itself through its hierarchical agent registry and policy-driven tool access, which enforce security boundaries by restricting agent operations based on defined functional roles. It utilizes context-aware task routing to match incoming requests with specific agent capabilities and model performance profiles, while implementing deterministic fallback mechanisms to maintain operational continuity when agents encounter errors or context limits. This architecture allows for modular capability expansion and reproducible environment configurations through version-controlled templates.

The system covers a broad capability surface, including automated technical documentation, cloud infrastructure management, and security auditing. It supports diverse domains such as API design, database optimization, and system reliability engineering, providing tools for incident response, performance monitoring, and compliance enforcement. These capabilities are integrated into a command-line interface that enables developers to search, fetch, and deploy specialized subagents directly from the repository.
- [ellerbrock/open-source-badges](https://awesome-repositories.com/repository/ellerbrock-open-source-badges.md) (548 ⭐) — :octocat: Open Source & Licence Badges
- [prowler-cloud/prowler](https://awesome-repositories.com/repository/prowler-cloud-prowler.md) (13,049 ⭐) — Prowler is an automated cloud infrastructure security scanner and posture management tool. It evaluates cloud environments and infrastructure-as-code templates against security benchmarks to identify misconfigurations, vulnerabilities, and compliance gaps that could compromise system integrity.

The platform distinguishes itself through graph-based attack path analysis, which identifies chains of misconfigurations that create exploitable routes for unauthorized access. It utilizes a plugin-based execution model to perform state-based assessments of live environments and static analysis of configuration files, ensuring security coverage across the entire development lifecycle.

The tool provides comprehensive capabilities for continuous security integration, allowing teams to automate compliance reporting by mapping findings to regulatory frameworks. It supports risk prioritization and provides actionable remediation guidance, while enabling the integration of security data into external incident management and monitoring systems through automated reporting pipelines.
- [tapaswenipathak/open-source-programs](https://awesome-repositories.com/repository/tapaswenipathak-open-source-programs.md) (3,856 ⭐) — A list of open source programs.
- [github/opensource.guide](https://awesome-repositories.com/repository/github-opensource-guide.md) (15,530 ⭐) — This project serves as a comprehensive repository of best practices and documentation standards for managing open source software. It provides a foundational framework for establishing project governance, defining contributor roles, and structuring the lifecycle of collaborative software development. By centralizing knowledge on community building and operational transparency, it acts as a guide for launching, maintaining, and scaling healthy software projects.

The project distinguishes itself by offering actionable strategies for the human and organizational aspects of software development that often fall outside of technical implementation. It covers methodologies for formalizing leadership hierarchies, implementing consensus-based decision-making, and enforcing codes of conduct to foster inclusive environments. Furthermore, it provides specific guidance on long-term sustainability, including frameworks for securing financial support, navigating legal requirements, and managing maintainer well-being to prevent burnout.

Beyond its core governance focus, the project encompasses a broad range of operational capabilities. These include standardized workflows for contributor onboarding, security compliance practices such as vulnerability reporting and threat modeling, and quality assurance standards that integrate accessibility and automated maintenance. The documentation is designed to help maintainers navigate the complexities of project health, visibility, and strategic planning throughout the entire lifecycle of an open source initiative.
- [prometheus/alertmanager](https://awesome-repositories.com/repository/prometheus-alertmanager.md) (8,356 ⭐) — Alertmanager is a monitoring notification gateway and routing service that deduplicates, groups, and directs alerts to the correct receivers. It functions as a central manager for Prometheus alerts, using a hierarchical routing tree and label-based matchers to dispatch notifications to external services.

The system employs a peer-to-peer mesh network to coordinate multiple instances in a high availability cluster, ensuring continuous alert processing. It features a dedicated inhibition engine and grouping mechanisms to reduce notification noise by suppressing redundant alerts when related issues are already active.

Capability areas include incident notification management via webhooks and third-party integrations, temporal alert silencing, and active alert limiting to prevent receiver flooding. The service also provides system event recording and event log export for auditing notification deliveries.

Administrative tasks can be performed through a command-line interface for managing silences and routing configurations.
- [open-source-society/bioinformatics](https://awesome-repositories.com/repository/open-source-society-bioinformatics.md) (0 ⭐) — Open Source Society University :microscope: Path to a free self-taught education in Bioinformatics! Archived
- [clickhouse/clickhouse](https://awesome-repositories.com/repository/clickhouse-clickhouse.md) (48,229 ⭐) — ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring.

The platform distinguishes itself through advanced storage and execution techniques, including vectorized query processing and a merge tree storage engine that maintains performance during massive insertions. It features adaptive subcolumn mapping for semi-structured data and supports native vector search for machine learning and generative AI applications. To facilitate efficient data movement, the engine utilizes zero-copy shared memory buffers, minimizing overhead when interacting with external analytical tools or processing diverse file formats like Parquet, JSON, and Arrow.

Beyond its core storage and processing capabilities, the project provides a comprehensive suite of tools for observability, security, and data integration. It includes built-in support for natural language querying, automated workflow orchestration for AI agents, and extensive diagnostic features for query plan inspection. The platform also offers robust cloud infrastructure management, including support for private networking, compliant deployment strategies, and integrated billing consolidation.
- [upptime/upptime](https://awesome-repositories.com/repository/upptime-upptime.md) (17,066 ⭐) — Upptime is a monitoring system that uses GitHub Actions to check endpoint availability and track response times. It functions as an HTTP availability checker and a static status page generator, converting uptime data into a branded website hosted on GitHub Pages.

The project integrates a GitHub Issues incident manager to automatically create and close tracking tickets for service outages and recovery. It provides a JSON status badge provider to expose uptime percentages and response times for use in external visual badges.

The system covers website uptime monitoring and service performance tracking, including the generation of performance graphs and the scheduling of maintenance windows. It handles infrastructure alerting via chat notifications, email, and SMS, and supports custom status detection based on specific text strings. Status page management includes brand identity customization, custom domain integration, and the injection of external web assets.

Configuration is managed through a structured JSON file, and sensitive credentials can be handled via credential encryption.
- [activities/contributing-to-open-source](https://awesome-repositories.com/repository/activities-contributing-to-open-source.md) (0 ⭐)
- [apache/airflow](https://awesome-repositories.com/repository/apache-airflow.md) (45,902 ⭐) — Airflow is a platform for programmatically authoring, scheduling, and monitoring complex data pipelines. It functions as a workflow automation engine that manages the lifecycle of recurring business processes by executing code-defined task dependencies. By representing workflows as directed acyclic graphs, the system ensures that task execution order and data flow are explicitly defined and reliably maintained across distributed computing environments.

The platform distinguishes itself through a highly modular, provider-based architecture that decouples core orchestration logic from external service integrations. This extensibility allows users to connect diverse cloud services, databases, and storage systems through custom plugins and packages. The system utilizes a distributed task queue to enable horizontal scaling, while a centralized scheduler and metadata-driven state management ensure fault tolerance and visibility across large-scale infrastructure.

Beyond core scheduling, the project provides comprehensive observability through a web-based interface for pipeline visualization, status tracking, and source code inspection. It supports secure operations by integrating with external secret management services and offers robust administrative control through both a command-line interface and a programmatic API. The system is designed for containerized deployment, providing tools for building optimized images and managing complex dependency environments.
- [arpit456jain/open-source-programs](https://awesome-repositories.com/repository/arpit456jain-open-source-programs.md) (0 ⭐) — I am planning to list some good and beginner friendly open source programs and their timelines
- [thehive-project/thehive](https://awesome-repositories.com/repository/thehive-project-thehive.md) (3,891 ⭐) — TheHive is a security incident response platform and multi-tenant case management system. It functions as a Security Orchestration, Automation, and Response (SOAR) tool and a threat intelligence platform designed to coordinate security investigations by managing alerts, cases, and observables.

The platform is distinguished by its multi-tenant architecture, which isolates data across different organizations while supporting selective cross-tenant sharing. It features a SOAR automation engine capable of executing sandboxed JavaScript logic to automate workflows and trigger response actions through external connectors.

The system covers a broad range of capabilities, including incident lifecycle management, threat intelligence synchronization with frameworks like MITRE ATT&CK and MISP, and automated data ingestion. It provides extensive identity and access management through role-based access control and integration with various identity providers.

The software can be installed on Linux, via Docker containers, or deployed to Kubernetes clusters using Helm charts.
- [afonsopacifer/open-source-checklist](https://awesome-repositories.com/repository/afonsopacifer-open-source-checklist.md) (215 ⭐) — :octocat: A guide to help you remember important things when creating an open source project ;D
- [cheeriojs/cheerio](https://awesome-repositories.com/repository/cheeriojs-cheerio.md) (30,386 ⭐) — Cheerio is an HTML and XML parsing library and server-side DOM implementation. It functions as a markup manipulation tool and CSS selector engine, allowing users to parse, query, and modify HTML or XML documents in non-browser environments.

The project provides a DOM-like tree representation of markup strings, enabling programmatic addition, removal, and modification of elements and attributes. It features a prototype-based plugin system that allows the extension of core functionality by adding custom methods to the document prototype.

The library covers a broad range of capabilities including CSS selector querying, recursive tree traversal, and element collection filtering. It also includes utilities for attribute manipulation, CSS class management, node positioning, and the rendering of manipulated trees back into serialized HTML or XML strings.
- [linkedin/school-of-sre](https://awesome-repositories.com/repository/linkedin-school-of-sre.md) (8,093 ⭐) — This project is a comprehensive educational resource and curriculum focused on site reliability engineering, distributed systems, and infrastructure operations. It provides technical guides, a systems engineering course, and instructional manuals designed to teach the principles of managing large-scale computing environments.

The curriculum covers high-level architectural design for scalability and resilience, including fault-tolerant infrastructure, high-availability patterns, and microservices decomposition. It emphasizes the practical application of site reliability engineering through the study of system design, resource estimation, and the elimination of single points of failure.

The material extends into broad operational capabilities, including container orchestration, continuous integration and delivery pipelines, layered observability, and network routing. It also provides detailed instruction on Linux system administration, database management, security auditing, and the implementation of service level indicators and objectives.
- [elastic/detection-rules](https://awesome-repositories.com/repository/elastic-detection-rules.md) (2,508 ⭐) — This project is a detection-as-code framework providing a library of security monitoring rules and predefined detection content for Elasticsearch data indices. It serves as a threat detection rule library designed to identify malicious activity and attack patterns across diverse data streams in cloud and on-premises environments.

The framework implements a detection engineering workflow where rules are defined in YAML and managed as versioned code. It includes a set of command-line utilities for automated rule deployment, metadata searching, and template generation, supported by a Python-based testing framework to validate rule syntax and accuracy before deployment.

The system covers a broad range of security operations, including threat intelligence integration, cloud posture auditing, and security event correlation. It also provides capabilities for anomaly detection, entity risk analysis, and the coordination of security incidents through case management and alert noise suppression.
- [veeral-patel/incidents](https://awesome-repositories.com/repository/veeral-patel-incidents.md) (0 ⭐) — INCIDENTS is a web-based tool for incident response, just like TheHive.
- [kestra-io/kestra](https://awesome-repositories.com/repository/kestra-io-kestra.md) (27,073 ⭐) — Kestra is a declarative workflow orchestrator designed to manage complex task dependencies and automated processes through versioned configuration files. It functions as a distributed platform that decouples task scheduling from execution by offloading computational workloads to a fleet of worker nodes. The system uses a reactive, event-driven engine to initiate workflows automatically in response to external signals, webhooks, schedules, or file system changes.

The platform distinguishes itself through a modular plugin architecture that allows for the integration of custom tasks and external services. It provides an AI-native development environment that incorporates language models to generate, refine, and execute automation logic using natural language prompts. To support diverse operational needs, Kestra implements a multi-tenant execution model that isolates resources, data, and access controls for different teams within a single shared instance.

The system covers a broad range of operational capabilities, including robust state management, granular role-based access control, and comprehensive system auditing. It offers extensive tools for workflow logic, such as conditional branching, parallel task execution, and iterative processing, alongside built-in resilience features like automated retries and failure policies. Users can manage these configurations through a centralized interface that supports visual editing and real-time monitoring of execution status.
- [grafana/grafana](https://awesome-repositories.com/repository/grafana-grafana.md) (74,456 ⭐) — Grafana is an observability data platform designed to aggregate metrics, logs, and traces from diverse sources into a unified environment. It functions as a centralized interface for visualizing complex telemetry data, transforming raw streams into interactive dashboards that support real-time system health tracking and performance monitoring.

The platform distinguishes itself through a plugin-based modular architecture that integrates disparate databases, cloud services, and monitoring tools via a standardized data abstraction layer. This framework allows for the dynamic loading of external components to support varied data sources and visualization types without requiring modifications to the core codebase. Additionally, the system incorporates a rule-based alerting engine that evaluates incoming data streams against defined thresholds to trigger automated notifications for incident response.

Beyond its core visualization and alerting capabilities, the platform provides tools for infrastructure performance monitoring and operational data analysis. It utilizes a declarative, component-driven interface to manage dashboard states and a compiled backend to process high-throughput queries and API requests. The system maintains configuration persistence and state consistency across distributed instances through a centralized metadata storage layer.
- [tapadoo/alerter](https://awesome-repositories.com/repository/tapadoo-alerter.md) (5,511 ⭐) — An Android Alerting Library
- [kubeshark/kubeshark](https://awesome-repositories.com/repository/kubeshark-kubeshark.md) (11,954 ⭐) — Kubeshark is a network observability platform designed for Kubernetes environments, functioning as an eBPF-powered engine for cluster-wide traffic analysis. It captures, indexes, and visualizes network activity and API calls directly from the kernel, providing deep visibility into service-to-service communication without requiring sidecar proxies or manual code instrumentation.

The platform distinguishes itself through its ability to perform protocol-aware traffic dissection and user-space cryptographic hooking, which allows for the inspection of encrypted traffic and the reconstruction of application-layer protocols like HTTP, gRPC, and Kafka. It supports advanced diagnostic capabilities, including AI-driven troubleshooting, forensic analysis of network snapshots, and the correlation of infrastructure events with application-level traffic patterns.

Beyond core monitoring, the system provides a comprehensive suite of tools for managing traffic data, including granular role-based access control, sensitive data redaction, and flexible storage options ranging from ephemeral local buffers to cloud-based object storage. It is built to operate in diverse environments, supporting air-gapped deployments and integrating with standard Kubernetes ingress resources for secure dashboard access.

The project is managed via a command-line interface that facilitates deployment control, custom script execution, and the sharing of specific traffic analysis views through encoded search queries.
- [zachflower/awesome-open-source-supporters](https://awesome-repositories.com/repository/zachflower-awesome-open-source-supporters.md) (681 ⭐) — ⭐️ A curated list of companies that offer their services for free to Open Source projects
- [cockroachlabs/open-sourced-interview-process](https://awesome-repositories.com/repository/cockroachlabs-open-sourced-interview-process.md) (425 ⭐) — Open Sourced Interview Process
- [greenrobot/eventbus](https://awesome-repositories.com/repository/greenrobot-eventbus.md) (24,760 ⭐) — EventBus is a publish-subscribe messaging library designed to facilitate decoupled communication between components in Java applications. It functions as a central hub where producers dispatch events that are routed to subscribers based on the class type of the payload. By using annotation-based markers, the system maps event handlers to specific data types, allowing different parts of an application to exchange information without requiring direct references between classes.

The library distinguishes itself through a focus on performance and execution control. It utilizes a compile-time indexing mechanism that generates static lookup tables, replacing slow runtime reflection with direct method calls to accelerate message routing. Furthermore, it provides a thread-aware dispatcher that allows developers to configure whether event handlers execute on the main interface thread, in background pools, or synchronously within the posting thread.

Beyond basic routing, the system supports advanced messaging patterns including priority-ordered delivery and sticky events. Sticky events maintain a memory-based cache of recent data, ensuring that late-registering subscribers automatically receive the most current state upon initialization. The library also offers granular control over the event lifecycle, enabling developers to cancel event propagation or manage custom thread pools and error handling strategies to maintain application responsiveness.
- [bluewave-labs/checkmate](https://awesome-repositories.com/repository/bluewave-labs-checkmate.md) (10,093 ⭐) — Checkmate is an open-source, self-hosted tool designed to track and monitor server hardware, uptime, response times, and incidents in real-time with beautiful visualizations. Don't be shy, join here: https://discord.com/invite/NAb6H3UTjK :)
- [aws/aws-cdk](https://awesome-repositories.com/repository/aws-aws-cdk.md) (12,817 ⭐) — The AWS Cloud Development Kit is an infrastructure-as-code framework that enables developers to define and provision cloud resources using familiar programming languages. By utilizing construct-based synthesis, it translates high-level, object-oriented code into declarative templates, allowing for the automated management of complex cloud environments through a centralized, code-driven control plane.

The framework distinguishes itself through its ability to model infrastructure as a dependency-aware resource graph, ensuring that components are provisioned and updated in the correct order. It employs a language-agnostic intermediate representation to synthesize these definitions into platform-specific configurations, while supporting aspect-oriented policy injection to apply security and compliance rules across infrastructure definitions during the synthesis phase.

Beyond core provisioning, the project provides a modular component registry for distributing and reusing pre-configured infrastructure building blocks. It supports multi-account orchestration, allowing for the deployment of consistent resource sets across different regions and accounts from a single template, and includes capabilities for detecting infrastructure drift to ensure deployed environments remain aligned with their defined state.

The project is distributed as a software development kit, providing programmatic interfaces to manage the full lifecycle of cloud resources and integrate infrastructure definitions directly into application codebases.
- [cfpb/open-source-project-template](https://awesome-repositories.com/repository/cfpb-open-source-project-template.md) (214 ⭐) — A project template containing default open source files for new projects
- [dragonflydb/dragonfly](https://awesome-repositories.com/repository/dragonflydb-dragonfly.md) (30,688 ⭐) — Dragonfly is a high-performance, multi-model in-memory data store designed to serve as a drop-in replacement for existing database infrastructures. By utilizing a multi-threaded, shared-nothing architecture and a fiber-based concurrency model, it maximizes CPU utilization and minimizes latency for read and write operations. The system supports a wide range of data structures, including strings, hashes, lists, sets, sorted sets, and JSON documents, while maintaining full compatibility with standard industry wire protocols and client libraries.

What distinguishes Dragonfly is its focus on efficiency and scalability through advanced memory management and request processing. It employs a lock-free, cache-friendly hash table structure and zero-copy serialization to reduce overhead during high-throughput operations. For durability, the system utilizes asynchronous, snapshot-based persistence that captures the state of the dataset without blocking active requests. Furthermore, it provides built-in support for horizontal scaling and cluster management, allowing for the distribution of large datasets across multiple nodes to ensure high availability.

Beyond core storage, the platform includes a comprehensive suite of operational and analytical capabilities. It features integrated support for geospatial data management, real-time message brokering via publish-subscribe patterns, and full-text search. To handle massive datasets efficiently, the engine incorporates probabilistic data structures for cardinality estimation, frequency tracking, and membership testing. These features are complemented by robust administrative tools, including access control, request rate limiting, and detailed server monitoring.
- [admol/systemdesign](https://awesome-repositories.com/repository/admol-systemdesign.md) (2,645 ⭐) — This project is a reference library of architectural blueprints, study materials, and design patterns for building scalable, high-availability distributed systems. It serves as a technical guide for scalability engineering, providing structural solutions for common engineering challenges.

The repository focuses on distributed systems design, covering essential patterns for data replication, consensus algorithms, and transaction management. It distinguishes itself by offering detailed blueprints for specialized domains, including real-time data streaming, large-scale data storage, and high-availability infrastructure.

The project covers a broad range of capability areas, including traffic management and rate limiting, geospatial services, payment processing, and messaging and event streaming. It also details implementations for search and indexing, monitoring and observability, web crawling, and financial trading engines.

The library provides a comprehensive set of guides on distributed primitives such as consistent hashing and sharding to assist in estimating system capacity.
- [formbricks/formbricks](https://awesome-repositories.com/repository/formbricks-formbricks.md) (12,391 ⭐) — Formbricks is an open-source survey and feedback platform designed to help teams capture and analyze user insights through targeted, in-app, and website-based interactions. It functions as a comprehensive customer experience analytics system that allows organizations to maintain full control over their data, user attributes, and survey workflows.

The platform distinguishes itself through its event-driven architecture, which enables precise behavioral targeting by triggering surveys based on specific user actions or application events. It supports deep integration with external ecosystems by automatically synchronizing response data to CRMs, databases, and communication tools, while providing programmatic interfaces for managing resources and automating feedback loops.

Beyond core collection, the system includes advanced logic for conditional branching, scoring, and personalized routing to create adaptive survey experiences. It offers extensive customization options, including white-labeling, CSS overrides, and multi-channel distribution across web, mobile, and email environments.

The platform is built for self-hosting, supporting containerized deployments with built-in multi-tenant data isolation and enterprise-grade security features like single sign-on and role-based access control.
- [open-source-society/computer-science](https://awesome-repositories.com/repository/open-source-society-computer-science.md) (0 ⭐) — Open Source Society University Path to a free self-taught education in Computer Science!
- [open-source-legal/opencontracts](https://awesome-repositories.com/repository/open-source-legal-opencontracts.md) (1,356 ⭐) — The open document intelligence platform for builders and hackers - DMS for the agentic world
- [hummingbot/hummingbot](https://awesome-repositories.com/repository/hummingbot-hummingbot.md) (18,907 ⭐) — Hummingbot is an open-source framework designed for building, backtesting, and deploying autonomous trading agents and algorithmic strategies across centralized and decentralized cryptocurrency exchanges. It provides a modular environment where users can orchestrate containerized bots to execute complex market-making, grid trading, and arbitrage operations.

The platform distinguishes itself through a skill-based architecture that integrates large language models, enabling users to monitor market conditions and control trading operations via natural language commands. It features a unified connectivity layer that standardizes diverse exchange APIs, allowing for consistent order execution, liquidity provisioning, and real-time data processing across global financial markets.

The system includes comprehensive tools for quantitative analysis, including a simulation engine for validating strategies against historical data and structured configuration management for auditability. It also incorporates safety mechanisms such as automated risk controls, secure wallet and identity management, and performance monitoring to ensure reliable operation in live environments.

The project provides a complete development environment for building custom strategies, supported by interactive API documentation and automated installation tools for local deployment.
- [analogj/scrutiny](https://awesome-repositories.com/repository/analogj-scrutiny.md) (7,906 ⭐) — Scrutiny is a distributed hardware monitoring system and predictive drive failure analyzer. It provides a centralized management platform and web-based dashboard for tracking hard drive health and S.M.A.R.T. metrics across multiple remote servers.

The system functions as a S.M.A.R.T. alerting gateway and storage health trend visualizer. It estimates hardware risk by comparing drive attributes against real-world failure thresholds and records historical data to identify gradual degradation patterns that may not trigger immediate alerts.

Capabilities include distributed data collection via remote agents, automated storage device detection, and risk level evaluation. The platform incorporates time-series metric storage for long-term trend analysis and a multi-channel notification system that sends failure alerts through webhooks.
- [bitwarden/server](https://awesome-repositories.com/repository/bitwarden-server.md) (18,074 ⭐) — This project provides a comprehensive, self-hosted platform for zero-knowledge credential management and enterprise secrets orchestration. It functions as a secure vault that ensures all encryption and decryption processes occur exclusively on the client side, preventing the server from ever accessing plaintext data. By combining identity federation with robust access controls, the system enables organizations to centralize the management of passwords, passkeys, and sensitive infrastructure credentials.

The platform distinguishes itself through its focus on both human-centric security and automated machine-to-machine workflows. It supports advanced authentication methods including hardware security keys, passkeys, and biometric unlocking, while simultaneously offering programmatic interfaces for injecting secrets directly into development pipelines and automated infrastructure deployments. This dual-purpose design allows teams to maintain strict data sovereignty through local hosting and containerized deployments while enforcing granular governance across their entire user base.

Beyond core storage, the system includes extensive observability and compliance tools, such as immutable audit logging, credential risk analysis, and integration with external security information and event management platforms. It also facilitates secure collaboration through encrypted information sharing, emergency access delegation, and automated identity provisioning. The software is designed for flexible deployment across diverse infrastructure environments and includes command-line utilities for administrative tasks, bulk data migration, and secret retrieval.
- [apache/incubator-skywalking](https://awesome-repositories.com/repository/apache-incubator-skywalking.md) (24,832 ⭐) — SkyWalking is a comprehensive observability stack and application performance monitoring platform. It functions as a distributed tracing system and an AI application monitor, providing a centralized suite for collecting and analyzing logs, metrics, and traces to maintain the health of containerized architectures.

The platform distinguishes itself through a service topology visualizer that renders interactive maps of infrastructure dependencies and communication patterns. It also includes specialized capabilities for generative AI workflow observation to track the execution flow and performance of AI components within a software stack.

The system covers a broad range of monitoring capabilities, including automated performance alerting driven by machine learning for anomaly detection. Its telemetry surface encompasses distributed request tracing, log pipeline management, and the aggregation of performance metrics for microservices and system resource profiling.
