3 repositorios
Integration of development and operations for automated delivery.
Explore 3 awesome GitHub repositories matching software engineering & architecture · DevOps Practices. Refine with filters or upvote what's useful.
This project serves as a comprehensive knowledge base and reference for distributed systems engineering and enterprise software architecture. It provides a structured collection of technical resources, design patterns, and methodologies intended to assist in the design, maintenance, and scaling of complex, high-performance software environments. The repository distinguishes itself by offering deep dives into core architectural concepts such as actor-based concurrency, aspect-oriented interception, and inversion-of-control containers. It emphasizes the practical application of distributed syst
Integrate development and operations to accelerate delivery through automated infrastructure provisioning and continuous monitoring.
The Byte Book is an open-source book that covers cloud-native infrastructure, focusing on kernel networking, Kubernetes, service meshes, and containers. It serves as a technical reference for designing stable and cost-effective infrastructure, combining DevOps workflows and site reliability engineering principles. The book provides a deep dive into Kubernetes networking, including CNI, service mesh integration, and container network interfaces for production clusters. It also covers container runtime operations, service mesh architecture for observability and traffic management, and Linux ker
Applies DevOps and site reliability engineering principles to balance system stability, efficiency, and operational cost.
Litmus es una plataforma de ingeniería del caos nativa de la nube y herramienta de inyección de fallos utilizada para diseñar y ejecutar simulaciones controladas de fallos de infraestructura dentro de entornos Kubernetes. Sirve como un framework de pruebas de resiliencia para analizar el comportamiento del sistema durante interrupciones inducidas para identificar debilidades y posibles caídas. El proyecto funciona como un orquestador de caos GitOps, utilizando control de versiones declarativo para automatizar el despliegue y la programación de pruebas de resiliencia. Proporciona herramientas para la gestión de flujos de trabajo de caos y la orquestación de secuencias de experimentos para visualizar y probar la estabilidad de la infraestructura. La plataforma cubre la validación del estado estable mediante monitoreo basado en métricas y proporciona capacidades para exportar resultados de experimentos para el análisis de rendimiento. Incluye soporte para gestión de acceso multi-inquilino y aislamiento de namespaces, así como puentes para integrar herramientas de inyección de fallos de terceros y plantillas personalizadas.
Analyzes system behavior during induced outages to determine if infrastructure requires stability tuning.