# Cloud Engineering Roadmaps for AWS Azure and GCP

> Search results for `cloud engineer roadmap for AWS, Azure and GCP` on awesome-repositories.com. 91 total matches; showing the first 50.

Explore on the web: https://awesome-repositories.com/q/cloud-engineer-roadmap-for-aws-azure-and-gcp

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [this search on awesome-repositories.com](https://awesome-repositories.com/q/cloud-engineer-roadmap-for-aws-azure-and-gcp).**

## Results

- [bregman-arie/devops-exercises](https://awesome-repositories.com/repository/bregman-arie-devops-exercises.md) (82,879 ⭐) — This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding challenges, and hands-on scenarios that cover the full spectrum of software delivery and operational workflows.

The repository distinguishes itself through a modular, domain-specific structure that links instructional problem statements with verified implementation examples. By employing a standardized documentation schema, it provides a predictable learning path for mastering complex technical concepts, ranging from infrastructure-as-code patterns and container orchestration to cloud platform administration and security best practices.

The content spans a wide array of technical domains, including automated configuration management, distributed system monitoring, database operations, and version control. It provides deep dives into specific tooling for cloud provisioning, container networking, and service deployment, ensuring that learners can validate their technical skills through isolated, practical exercises.

All instructional materials are organized into a unified taxonomy of markdown-based documents, allowing users to navigate and study specific technical topics at their own pace.
- [elastic/elasticsearch-cloud-aws](https://awesome-repositories.com/repository/elastic-elasticsearch-cloud-aws.md) (576 ⭐) — AWS Cloud Plugin for Elasticsearch
- [kamranahmedse/developer-roadmap](https://awesome-repositories.com/repository/kamranahmedse-developer-roadmap.md) (357,434 ⭐) — Developer Roadmap is a community-driven platform that provides structured, graph-based learning paths for software engineering. It serves as a comprehensive knowledge repository where technical domains are organized into visual sequences to guide professional skill acquisition and career growth.

The project distinguishes itself through a collaborative ecosystem that enables users to contribute roadmaps, curate industry best practices, and maintain professional profiles. It integrates diagnostic assessment frameworks to evaluate technical proficiency, helping developers identify knowledge gaps and prepare for professional interviews through targeted learning sequences.

Beyond its core mapping capabilities, the platform offers practical project ideas and interactive tutoring to reinforce engineering concepts. It provides a centralized space for the community to share resources, track progressive skill development, and navigate complex technical landscapes.
- [appsecco/breaking-and-pwning-apps-and-servers-aws-azure-training](https://awesome-repositories.com/repository/appsecco-breaking-and-pwning-apps-and-servers-aws-azure-training.md) (952 ⭐) — Course content, lab setup instructions and documentation of our very popular Breaking and Pwning Apps and Servers on AWS and Azure hands on training!
- [ignitetechnologies/mindmap](https://awesome-repositories.com/repository/ignitetechnologies-mindmap.md) (8,656 ⭐) — Mindmap is a cybersecurity knowledge base and reference library that organizes security tools, frameworks, and methodologies into a visual knowledge map. It functions as a curated directory of cheat sheets and command guides for offensive and defensive security operations, presented as a hierarchical interface with collapsible nodes.

The project converts structured markdown files into navigable visual trees to facilitate the study of penetration testing workflows and DevOps learning roadmaps. It also serves as a security compliance framework, providing structured mappings of NIST and ISO 27001 controls for information security auditing.

The platform covers a wide range of security domains, including tool cataloging for reconnaissance and reverse engineering, privilege escalation guides, and reference materials for active directory pentesting and network traffic analysis.

The knowledge base is built using static content generation and a JSON-driven metadata catalog to populate its searchable lists and filterable galleries.
- [cloudquery/cloudquery](https://awesome-repositories.com/repository/cloudquery-cloudquery.md) (6,438 ⭐) — CloudQuery is a cloud infrastructure ETL tool and multi-cloud data pipeline designed to collect, synchronize, and normalize resource metadata from various cloud providers and SaaS platforms. It functions as a centralized asset inventory manager and security posture manager, extracting configuration and state data into relational databases, data lakes, or data warehouses.

The system distinguishes itself by transforming complex, nested cloud API responses into flat relational tables, enabling the use of standard SQL for asset querying and analysis. It employs a modular plugin system for data extraction and driver-based adapters for destination-agnostic loading, allowing metadata to be pushed into diverse storage backends.

The platform covers several broad capability areas, including cloud security posture management, FinOps cost optimization, and infrastructure compliance auditing. It utilizes SQL-based transformation pipelines to implement security frameworks, detect configuration drift, and identify underutilized resources. Additionally, the tool provides event-driven responses to fire webhooks or alerts when policy violations occur.
- [bootdotdev/curriculum](https://awesome-repositories.com/repository/bootdotdev-curriculum.md) (3,415 ⭐) — This project is an interactive programming curriculum and educational system designed to teach computer science and software engineering. It provides a structured set of courses and professional roadmaps focused on backend engineering, DevOps, and systems fundamentals.

The platform is distinguished by an AI-powered coding tutor that provides Socratic guidance and contextual hints to help students find solutions independently. It features a browser-based code sandbox using WebAssembly to eliminate local environment setup, alongside automated test-based grading and spaced-repetition logic to reinforce difficult concepts.

The curriculum covers a broad range of technical domains, including programming languages such as Go, Python, and TypeScript, as well as relational database design, container orchestration with Kubernetes, and cloud operations. It also includes professional development resources for technical interview preparation and portfolio construction.

Learning engagement is managed through gamified incentives like experience points and leaderboards, while progress is tracked via sequenced learning paths and AI-generated coding challenges.
- [onyx-dot-app/onyx](https://awesome-repositories.com/repository/onyx-dot-app-onyx.md) (17,491 ⭐) — Onyx is an enterprise-grade AI platform designed for knowledge management, search, and autonomous agent orchestration. It functions as a centralized system that aggregates unstructured organizational data, enabling secure, context-aware retrieval and interaction across internal documents and communication history. By integrating retrieval-augmented generation with multi-model orchestration, the platform provides a unified interface for teams to query internal knowledge bases and execute complex, multi-step business processes.

The platform distinguishes itself through a focus on private infrastructure and strict security, allowing organizations to deploy services on-premise or in isolated containers to meet data residency requirements. It features a modular data connector framework that indexes information from disparate third-party applications, ensuring that all search and chat interactions adhere to existing role-based access controls. Furthermore, the system supports agentic workflows that decompose complex research requests into parallel sub-queries, synthesizing evidence-based responses from both internal data and live web research.

Beyond its core search and retrieval capabilities, the platform includes tools for managing the full lifecycle of AI integration. This includes administrative oversight for team collaboration, benchmarking and cost analysis for various language models, and the ability to configure specialized agents with unique instructions and tool access. Users can interact with these capabilities through a web interface, integrated messaging platforms, or a dedicated desktop application.
- [dswh/ai-engineer-roadmap](https://awesome-repositories.com/repository/dswh-ai-engineer-roadmap.md) (648 ⭐) — A roadmap describing the required skills, learning resources and sample tools to become an AI Engineer
- [mbianchidev/platform-engineering-roadmap](https://awesome-repositories.com/repository/mbianchidev-platform-engineering-roadmap.md) (129 ⭐) — An opinionated platform engineering roadmap - in the form of a website
- [sidpalas/devops-directive-docker-course](https://awesome-repositories.com/repository/sidpalas-devops-directive-docker-course.md) (3,109 ⭐) — This project is a Docker educational course and containerization training material. It provides a structured learning path and a DevOps curriculum focused on bundling software and dependencies into standalone images to ensure consistent environment deployment.

The material covers the operational workflows of containerized applications within a software delivery pipeline. This includes instruction on Docker application packaging and the integration of containerization into the development lifecycle to standardize how applications are built, shipped, and run.

The course addresses the setup of microservices environments and the deployment of portable images. It covers the fundamentals of container-based application isolation, declarative image definitions, and the use of layered file systems and virtualized network bridging.
- [clickhouse/clickhouse](https://awesome-repositories.com/repository/clickhouse-clickhouse.md) (48,229 ⭐) — ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring.

The platform distinguishes itself through advanced storage and execution techniques, including vectorized query processing and a merge tree storage engine that maintains performance during massive insertions. It features adaptive subcolumn mapping for semi-structured data and supports native vector search for machine learning and generative AI applications. To facilitate efficient data movement, the engine utilizes zero-copy shared memory buffers, minimizing overhead when interacting with external analytical tools or processing diverse file formats like Parquet, JSON, and Arrow.

Beyond its core storage and processing capabilities, the project provides a comprehensive suite of tools for observability, security, and data integration. It includes built-in support for natural language querying, automated workflow orchestration for AI agents, and extensive diagnostic features for query plan inspection. The platform also offers robust cloud infrastructure management, including support for private networking, compliant deployment strategies, and integrated billing consolidation.
- [kestra-io/kestra](https://awesome-repositories.com/repository/kestra-io-kestra.md) (27,073 ⭐) — Kestra is a declarative workflow orchestrator designed to manage complex task dependencies and automated processes through versioned configuration files. It functions as a distributed platform that decouples task scheduling from execution by offloading computational workloads to a fleet of worker nodes. The system uses a reactive, event-driven engine to initiate workflows automatically in response to external signals, webhooks, schedules, or file system changes.

The platform distinguishes itself through a modular plugin architecture that allows for the integration of custom tasks and external services. It provides an AI-native development environment that incorporates language models to generate, refine, and execute automation logic using natural language prompts. To support diverse operational needs, Kestra implements a multi-tenant execution model that isolates resources, data, and access controls for different teams within a single shared instance.

The system covers a broad range of operational capabilities, including robust state management, granular role-based access control, and comprehensive system auditing. It offers extensive tools for workflow logic, such as conditional branching, parallel task execution, and iterative processing, alongside built-in resilience features like automated retries and failure policies. Users can manage these configurations through a centralized interface that supports visual editing and real-time monitoring of execution status.
- [milanm/devops-roadmap](https://awesome-repositories.com/repository/milanm-devops-roadmap.md) (18,752 ⭐) — DevOps-Roadmap is a comprehensive educational repository and knowledge base designed to guide technical professionals through the complexities of modern software engineering. It functions as a structured curriculum and reference library, covering the full spectrum of skills required to master system architecture, infrastructure management, and cloud operations.

The project distinguishes itself by bridging the gap between high-level architectural design and the practical realities of engineering leadership. It provides curated insights into distributed systems, data consistency, and scalable design patterns, while simultaneously offering frameworks for managing high-performing teams, navigating corporate dynamics, and fostering psychological safety within technical organizations.

Beyond core architecture, the repository encompasses a broad capability surface that includes professional development, productivity optimization, and the integration of emerging technologies. It offers guidance on implementing AI-driven workflows, managing large-scale machine learning lifecycles, and applying evidence-based metrics to track team performance and development health.

The repository serves as a centralized resource for engineers at all career stages, providing access to industry-standard principles, technical interview preparation materials, and strategic coaching frameworks.
- [azure/azure-sdk-for-ruby](https://awesome-repositories.com/repository/azure-azure-sdk-for-ruby.md) (0 ⭐) — This project provides a Ruby package that makes it easy to access and manage Microsoft Azure Services like Storage, Service Bus and Virtual Machines.
- [quarkusio/quarkus](https://awesome-repositories.com/repository/quarkusio-quarkus.md) (15,479 ⭐) — Quarkus is a Kubernetes-native Java framework designed for building high-performance, memory-efficient applications. It utilizes ahead-of-time native compilation to transform Java code into standalone, optimized binaries that eliminate the need for a virtual machine, enabling rapid startup and reduced memory consumption. By performing code augmentation during the build phase, it shifts heavy processing tasks away from runtime, ensuring that applications are optimized for cloud-native environments.

The framework distinguishes itself through a unified approach to reactive and imperative programming, allowing developers to mix non-blocking, event-driven logic with traditional blocking code. It features a specialized dependency injection container optimized for build-time resolution and supports virtual thread concurrency to improve throughput in high-concurrency environments. Its container-native lifecycle management ensures seamless integration with cloud infrastructure, providing automated health monitoring and service orchestration.

Quarkus covers a broad capability surface, including comprehensive support for RESTful web services, event-driven messaging, and secure identity management. It integrates with standard enterprise specifications and provides extensive tooling for automated infrastructure provisioning, distributed tracing, and observability. The platform also includes a developer-focused dashboard and live-coding capabilities to streamline the development lifecycle.

The project provides extensive documentation and a modular extension system that allows developers to add features while maintaining native compatibility. It is designed to be installed and managed through standard build automation tools, supporting a wide range of deployment targets including serverless functions and managed Kubernetes clusters.
- [azure/azure-sdk-for-rust](https://awesome-repositories.com/repository/azure-azure-sdk-for-rust.md) (878 ⭐) — This repository is for the active development of the Azure SDK for Rust. For consumers of the SDK we recommend visiting Docs.rs and looking up the docs for any of libraries in the SDK.
- [ebazhanov/linkedin-skill-assessments-quizzes](https://awesome-repositories.com/repository/ebazhanov-linkedin-skill-assessments-quizzes.md) (28,781 ⭐) — This project is a technical quiz reference database and answer key designed for passing LinkedIn skill assessments and other professional technical certifications. It serves as a searchable repository of verified questions and correct answers used to earn skill badges and validate professional proficiency.

The database covers a wide range of technical domains, including various programming languages, database technologies, and cloud infrastructure certifications such as AWS Lambda and REST API assessments. It functions as a study guide for those preparing for industry-standard technical tests and coding evaluations.

The system utilizes a flat-file knowledge base and static-file data storage to organize answers. Users can retrieve specific information by querying the starting words of a question through a search-based interface.
- [microsoft/web-dev-for-beginners](https://awesome-repositories.com/repository/microsoft-web-dev-for-beginners.md) (95,883 ⭐) — This project is an open-source educational curriculum designed to facilitate technical skill acquisition through a structured, project-based learning framework. It serves as a centralized knowledge base that guides learners through foundational web development concepts, modern programming logic, and advanced technical workflows. By organizing content into modular, self-contained exercises, the repository bridges the gap between theoretical knowledge and practical application.

What distinguishes this platform is its hierarchical curriculum mapping, which connects basic web standards to specialized training in emerging technologies. The content is maintained through an open-source contribution model, allowing the community to refine instructional materials and ensure their ongoing relevance. Beyond traditional web development, the curriculum includes dedicated modules for cloud infrastructure, generative artificial intelligence, and the integration of intelligent coding assistants into development workflows.

The repository provides a comprehensive suite of pedagogical resources, including video tutorials, sketchnotes, and knowledge assessments to validate technical comprehension. To support diverse learning environments, the instructional materials are compiled into static sites and portable document formats, enabling high-performance delivery and offline access. The project is fully documented as structured text, allowing for collaborative maintenance and version control.
- [signoz/signoz](https://awesome-repositories.com/repository/signoz-signoz.md) (27,355 ⭐) — SigNoz is a full-stack observability platform designed to collect, store, and visualize metrics, logs, and distributed traces in a unified environment. It leverages OpenTelemetry-based data collection to ingest telemetry from diverse sources using vendor-neutral protocols, ensuring interoperability across complex microservices architectures. The platform utilizes a high-performance columnar storage engine to enable rapid aggregation and filtering, providing a centralized backend for monitoring application health and performance.

What distinguishes the platform is its focus on automated instrumentation and semantic correlation. It allows users to capture telemetry data across various programming languages and frameworks without manual code changes, often requiring only simple environment variable updates. Once ingested, the system automatically links logs, metrics, and traces through shared identifiers, enabling seamless navigation between different telemetry types during root cause analysis. The frontend further supports this by using virtualized rendering to efficiently display complex distributed traces containing millions of spans.

The platform provides a comprehensive suite of tools for infrastructure monitoring, application performance tracking, and log management. Users can define complex alert conditions and manage monitoring configurations as version-controlled resources, ensuring consistency across deployment environments. Additionally, the system includes specialized support for monitoring large language model applications and provides visual query pipelines that translate user-defined filters into optimized database queries for real-time dashboard generation.

The entire observability stack can be deployed using container orchestration tools, with built-in utilities for verifying service status and managing data retention.
- [azure/azure-sdk-for-go](https://awesome-repositories.com/repository/azure-azure-sdk-for-go.md) (1,829 ⭐) — This repository is for active development of the Azure SDK for Go. For consumers of the SDK we recommend visiting our public developer docs at:
- [priyankavergadia/google-cloud-4-words](https://awesome-repositories.com/repository/priyankavergadia-google-cloud-4-words.md) (8,224 ⭐) — This project is a cloud computing knowledge base and directory of products designed to help developers navigate and distill complex infrastructure offerings. It functions as a high-level reference for Google Cloud Platform, providing visual summaries and concise cheat sheets of core service capabilities.

The repository includes a cloud service comparison guide that maps Google Cloud features to equivalent services offered by other major cloud platform providers. This mapping assists in identifying equivalent service capabilities across different platforms.

The content is delivered via a static site that utilizes markdown-based authoring, metadata-driven filtering, and a responsive grid layout to organize service summaries and cheat sheets.
- [azure/azure-sdk-for-net](https://awesome-repositories.com/repository/azure-azure-sdk-for-net.md) (5,937 ⭐) — The Azure SDK for .NET is a collection of client and management libraries that enable .NET applications to interact with cloud services through a consistent, well-defined programming model. It provides a unified interface for authenticating, configuring HTTP pipelines, and calling service methods either synchronously or asynchronously, with support for pagination, long-running operations, and structured error handling.

The SDK distinguishes itself through comprehensive authentication options, including connection strings, OAuth token credentials, managed identity, service principals, and developer credentials for local testing. It offers robust testing support through mockable service clients, subclients, long-running operations, and model graphs, all enabled by protected constructors and virtual methods. The SDK also provides configurable HTTP pipeline policies, automatic retry of failed requests with customizable delay and count rules, and proxy routing supporting multiple authentication schemes.

Beyond core service interaction, the SDK covers application hosting, data storage and synchronization, messaging and notifications, monitoring and observability, and automation through serverless event-driven workflows. It includes capabilities for provisioning and managing cloud infrastructure, deploying virtual networks, and hosting applications on managed services with built-in scalability and high availability. The SDK also supports building AI-powered applications that integrate generative AI and large language models for chat, image generation, and agent orchestration.

The libraries are distributed as NuGet packages targeting .NET Standard 2.0, with each package documented through C# XML comments and accompanied by README files and ordered code samples.
- [kedacore/keda](https://awesome-repositories.com/repository/kedacore-keda.md) (10,314 ⭐) — KEDA is a Kubernetes event-driven autoscaler and cloud event scaling engine. It functions as a custom metrics provider that monitors external event sources—including message brokers, databases, and cloud metrics—to dynamically adjust the replica counts of containerized workloads.

The project is distinguished by its scale-to-zero workflow, which reduces workloads to zero replicas during inactivity and automatically restarts them when new events are detected. It operates as a multi-cloud event trigger system, using a pluggable scaler interface to integrate with a wide array of third-party services and cloud identity providers.

The system manages the scaling of various resource types, including deployments and discrete Kubernetes jobs. It provides comprehensive identity and authentication support via integration with cloud secret managers, IAM roles, and vault services. Additionally, it includes observability features for exporting telemetry via OpenTelemetry and tools for calculating complex scaling logic using multi-source metric aggregation.
- [trufflesecurity/trufflehog](https://awesome-repositories.com/repository/trufflesecurity-trufflehog.md) (24,630 ⭐) — Trufflehog is a security tool designed to continuously monitor code repositories and cloud environments to detect, verify, and remediate exposed sensitive credentials and API keys. It functions as a comprehensive secret scanning engine that integrates directly into deployment pipelines and version control systems to intercept sensitive data before it is committed or pushed. By utilizing read-only operations and volatile memory processing, the system ensures that discovered credentials are never stored persistently, maintaining strict data privacy throughout the scanning lifecycle.

The platform distinguishes itself through a privacy-focused architecture that relies on cryptographic fingerprinting to track and deduplicate findings without ever transmitting or storing raw sensitive values. It supports distributed scanning via independent agents that connect to a central dashboard, allowing for localized analysis while maintaining network isolation. Furthermore, the system provides automated incident response capabilities, including secret rotation and revocation, which help organizations minimize the window of vulnerability for compromised credentials.

Beyond core detection, the project offers a broad capability surface for enterprise-wide access governance and security compliance. It includes modular detection logic for custom rule definitions, integration with external identity providers for role-based access control, and extensive monitoring across cloud storage, container infrastructure, and collaboration platforms. The system also provides detailed metadata tracing to link findings to specific users, pipelines, or commits, facilitating efficient remediation and auditability across large-scale development environments.
- [datastacktv/data-engineer-roadmap](https://awesome-repositories.com/repository/datastacktv-data-engineer-roadmap.md) (12,747 ⭐) — This project is a collection of specialized study guides and roadmaps centered on computer science, data engineering, and machine learning fundamentals. It provides a structured curriculum of technical competencies, tools, and skills required to transition into professional data engineering roles.

The project features a data engineering skill map that visually organizes databases, processing architectures, and infrastructure tools. It also includes a machine learning learning path covering supervised and unsupervised learning techniques alongside model operations.

The curriculum covers broad capability areas including machine learning operations, technical skill mapping, and computer science fundamentals. To ensure accessibility, the project provides text-based alternatives for its visual guides.
- [datatalksclub/data-engineering-zoomcamp](https://awesome-repositories.com/repository/datatalksclub-data-engineering-zoomcamp.md) (42,483 ⭐) — This project is an open-source educational curriculum designed to provide comprehensive training in data engineering. It focuses on building scalable data pipelines and managing cloud-native infrastructure through a structured, self-paced program that combines technical explanations with hands-on practical exercises.

The curriculum distinguishes itself by emphasizing industry-standard methodologies, specifically teaching students how to implement infrastructure as code and manage data workflows through orchestration tools. By utilizing container-based environment isolation and declarative configuration, the program ensures that learners gain experience with reproducible deployments and consistent development environments across distributed systems.

The training covers a broad range of technical topics, including the design of automated data processing tasks and the configuration of cloud resources. The materials are organized into modular, progressive units that build foundational knowledge before advancing to complex engineering workflows.

The course materials are hosted in a centralized repository, which facilitates community-supported updates and collaborative improvements to the educational assets.
- [micronaut-projects/micronaut-gcp](https://awesome-repositories.com/repository/micronaut-projects-micronaut-gcp.md) (0 ⭐) — This project includes integration between Micronaut and Google Cloud Platform (GCP).
- [microsoft/data-science-for-beginners](https://awesome-repositories.com/repository/microsoft-data-science-for-beginners.md) (35,657 ⭐) — This project is a comprehensive educational curriculum designed to teach the fundamental concepts, workflows, and tools of data science. It provides a structured learning path that covers the end-to-end data science lifecycle, including data acquisition, maintenance, processing, and pattern discovery, while grounding theoretical knowledge in practical, real-world applications.

The curriculum distinguishes itself through a data-driven pedagogical design that utilizes interactive, notebook-based lessons. By combining narrative text with live code blocks, the platform allows learners to experiment with data analysis and visualization techniques in real time. The content is organized into a modular structure that sequences topics by progressive complexity, ensuring that foundational skills are established before moving into more advanced analytical techniques.

The material encompasses a broad capability surface, including tutorials on data visualization, relational database querying, and the integration of cloud computing into data science workflows. These resources rely on an established ecosystem of open-source libraries to ensure that the skills acquired are applicable to professional environments.

The repository is hosted as a centralized collection of instructional modules and guided exercises. It includes self-contained code samples and assignments that require a standard Python environment to execute.
- [hashicorp/terraform](https://awesome-repositories.com/repository/hashicorp-terraform.md) (48,720 ⭐) — Terraform is a declarative infrastructure-as-code tool designed to manage the lifecycle of cloud and on-premises resources. It functions as a workflow engine that reconciles a defined desired state against real-world infrastructure, using a persistent state-tracking layer to maintain consistency and visibility across distributed environments. By mapping infrastructure components into a directed acyclic graph, the system calculates the optimal order for provisioning, updating, or destroying resources.

The platform is distinguished by its extensible plugin-based architecture, which decouples core orchestration logic from vendor-specific service APIs. This allows users to manage diverse infrastructure across multiple providers through a unified workflow. The system enforces predictability by separating operations into a three-stage lifecycle—planning, applying, and state-updating—and supports policy-as-code evaluation to validate changes against security and compliance rules before any modifications are executed.

Beyond core orchestration, the tool provides robust support for collaborative management, including workspace isolation for environment separation and module sharing for distributing standardized infrastructure patterns. It integrates into broader development ecosystems through support for programmatic definition in various languages, external system hooks, and comprehensive tooling for configuration debugging and editor assistance.
- [stemmlerjs/software-design-and-architecture-roadmap](https://awesome-repositories.com/repository/stemmlerjs-software-design-and-architecture-roadmap.md) (3,402 ⭐) — 🧱 The software design and architecture roadmap for any developer
- [aobingjava/javafamily](https://awesome-repositories.com/repository/aobingjava-javafamily.md) (36,959 ⭐) — JavaFamily is a curated set of learning paths and reference guides for backend engineering, distributed systems, and virtual machine internals. It provides a structured curriculum covering the Java language, operating system concepts, and network protocols.

The project features detailed study guides for the Java virtual machine architecture, including memory management and garbage collection. It also includes a comprehensive reference for distributed systems, covering microservices, remote procedure call frameworks, and scalable system design.

The collection covers a broad range of technical capabilities, including concurrency and multithreading, database performance optimization, and TCP/IP networking fundamentals. It also provides material on performance tuning, such as CPU troubleshooting, memory leak diagnosis, and the application of software design patterns.

Additional resources are provided for professional development through technical interview preparation, including common coding questions and resume templates.
- [encoredev/encore](https://awesome-repositories.com/repository/encoredev-encore.md) (12,049 ⭐) — Encore is a distributed systems framework designed to unify backend development, infrastructure provisioning, and observability. It functions as an infrastructure-as-code platform that allows developers to define cloud resources, databases, and messaging topics directly within their application code. By analyzing these declarations at compile-time, the system automatically manages the deployment of cloud resources and security policies, ensuring parity between local development and production environments.

The platform distinguishes itself through its integrated development experience, which includes a local workspace that mirrors production infrastructure to facilitate testing and debugging. It provides automated AI-assisted development tools that leverage application metadata and runtime telemetry to aid in code generation and performance analysis. Furthermore, the framework enforces architectural standards and automates the creation of ephemeral, production-like environments for every pull request, streamlining the validation process before deployment.

Beyond its core orchestration capabilities, the framework includes a comprehensive suite for building type-safe APIs and event-driven services. It handles the complexities of service communication, including automated client library generation, request validation, and distributed tracing instrumentation. The system also incorporates robust security primitives, such as identity token validation, secret management, and automated traffic control, to support the development of secure, scalable backend architectures.
- [dgkanatsios/ckad-exercises](https://awesome-repositories.com/repository/dgkanatsios-ckad-exercises.md) (9,860 ⭐) — This project is a Kubernetes certification study guide and hands-on lab designed to prepare candidates for the Certified Kubernetes Application Developer exam. It provides a containerized learning sandbox and a resource validator to simulate real-world cluster configuration challenges.

The environment uses scenario-based learning modules that require the implementation of pods, network policies, and persistent volumes. Correctness is verified through automated cluster queries that check the state of resources against defined expectations.

The exercises cover a broad capability surface including Kubernetes networking configuration, state management, and general application development using declarative YAML configurations.
- [cube-js/cube](https://awesome-repositories.com/repository/cube-js-cube.md) (20,251 ⭐) — Cube is a semantic data layer that provides a unified framework for defining business metrics, dimensions, and relationships across diverse data sources. By acting as a headless business intelligence engine, it transforms raw data into a governed model that can be queried via SQL, REST, and GraphQL interfaces. This architecture ensures consistent data definitions and logic across all downstream analytical applications and reporting tools.

The platform distinguishes itself through its integrated conversational AI capabilities, which allow users to explore data using natural language. It orchestrates these interactions by mapping questions to the underlying semantic model, ensuring that AI-generated insights remain accurate and context-aware. Furthermore, Cube is designed for multi-tenant environments, offering robust infrastructure isolation, row-level security, and dynamic context injection to ensure that data access is strictly governed and personalized for every user or tenant.

Beyond its core modeling and AI features, the platform includes a comprehensive suite of tools for performance optimization, including automated pre-aggregation caching and asynchronous query queuing. It supports a wide range of data sources and deployment models, from self-hosted containers to managed cloud environments. The system also provides extensive programmatic control over report management, dashboard publishing, and user identity synchronization, making it suitable for embedding interactive analytics directly into custom software applications.
- [hasbrain/data-engineer-roadmap](https://awesome-repositories.com/repository/hasbrain-data-engineer-roadmap.md) (0 ⭐) — Below you can find a chart demonstrating the paths that you can take and the milestones that you would want to achieve in order to become a data engineer. We spoke to senior data engineers and data engineering managers from top tech companies in the Silicon Valley, and consolidated learnings…
- [welldone-cloud/aws-list-resources](https://awesome-repositories.com/repository/welldone-cloud-aws-list-resources.md) (184 ⭐) — Uses the AWS Cloud Control API to list resources that are present in a given AWS account and region(s). Discovered resources are written to a JSON result file.
- [redis/redis](https://awesome-repositories.com/repository/redis-redis.md) (74,906 ⭐) — Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to process requests efficiently, while maintaining data durability through append-only persistence logs and asynchronous snapshotting mechanisms.

What distinguishes Redis is its ability to handle complex data structures—including strings, hashes, lists, sets, and sorted sets—alongside hierarchical JSON documents and high-dimensional vector embeddings. It supports advanced operational patterns such as active-active database deployment for global distribution, real-time data streaming, and probabilistic statistics for large-scale data analysis. These capabilities are complemented by a pluggable indexing engine that enables semantic similarity matching and full-text retrieval.

The platform offers a comprehensive ecosystem for managing distributed state, including master-replica replication, automated cluster management, and granular security controls like access control lists and TLS encryption. Developers can interact with the database through language-specific client libraries that support connection multiplexing and object mapping, or via a command-line interface for direct administrative tasks and scripting.

Redis is deployed through standard package managers and supports both self-managed clusters and managed cloud instances. Observability is provided through integrated tools for performance analysis, slow log monitoring, and bulk data management.
- [openobserve/openobserve](https://awesome-repositories.com/repository/openobserve-openobserve.md) (17,937 ⭐) — OpenObserve is a unified observability data platform designed to ingest, store, and analyze logs, metrics, and traces. It functions as a cloud-native monitoring tool that centralizes telemetry from diverse sources, including standard collectors and cloud service providers, into a single, scalable system. By utilizing a columnar storage engine backed by object storage, the platform enables efficient long-term data retention and high-performance analytical querying.

The platform distinguishes itself through deep integration with artificial intelligence, allowing users to query data using natural language, generate dashboards via prompts, and automate incident analysis. It provides specialized monitoring for language model pipelines, including token usage cost analysis and performance tracking for AI agents. Furthermore, the system enforces strict multi-tenant resource isolation and zero-trust access, ensuring that organizational data remains secure and independent within shared infrastructure.

Beyond its core storage and AI capabilities, the platform includes a comprehensive suite of tools for incident management, infrastructure monitoring, and data pipeline orchestration. It supports real-time stream processing, schema-agnostic indexing, and automated data enrichment, allowing for flexible telemetry management without rigid pre-defined structures. The system also provides advanced diagnostic features such as production error deobfuscation, service dependency mapping, and user journey analysis to accelerate root cause investigation.

The software is designed for flexible deployment, running as a stateless, containerized service that supports high availability and horizontal scaling. It is distributed as a single binary or container image, with configuration managed through infrastructure-as-code templates.
- [azure/azure-mcp](https://awesome-repositories.com/repository/azure-azure-mcp.md) (1,218 ⭐) — The Azure MCP Server, bringing the power of Azure to your agents.
- [trailofbits/algo](https://awesome-repositories.com/repository/trailofbits-algo.md) (30,278 ⭐) — Algo is a cloud VPN deployment tool and WireGuard orchestrator designed to automate the provisioning and configuration of personal VPN servers across multiple cloud infrastructure providers. It functions as a multi-cloud infrastructure provisioner and a VPN client configuration generator, creating the necessary tunnels and connection profiles for secure device connectivity.

The project distinguishes itself by integrating a network ad-blocking DNS server directly into the deployment, filtering advertisements and malicious domains for all connected clients. It further simplifies the onboarding process by generating protocol-specific configuration files and Apple configuration profiles for mobile and desktop devices.

The system covers broad capability areas including cloud infrastructure automation for providers such as DigitalOcean, Google Cloud, and Hetzner, as well as network traffic management through split tunneling and LAN passthrough. It also handles security and access control via Linux firewall configuration and cloud security group automation.

Deployment can be executed in a containerized environment or via headless mode using environment variables to bypass interactive prompts.
- [azure/azure-powershell](https://awesome-repositories.com/repository/azure-azure-powershell.md) (4,733 ⭐) — Microsoft Azure PowerShell
- [kubeshark/kubeshark](https://awesome-repositories.com/repository/kubeshark-kubeshark.md) (11,954 ⭐) — Kubeshark is a network observability platform designed for Kubernetes environments, functioning as an eBPF-powered engine for cluster-wide traffic analysis. It captures, indexes, and visualizes network activity and API calls directly from the kernel, providing deep visibility into service-to-service communication without requiring sidecar proxies or manual code instrumentation.

The platform distinguishes itself through its ability to perform protocol-aware traffic dissection and user-space cryptographic hooking, which allows for the inspection of encrypted traffic and the reconstruction of application-layer protocols like HTTP, gRPC, and Kafka. It supports advanced diagnostic capabilities, including AI-driven troubleshooting, forensic analysis of network snapshots, and the correlation of infrastructure events with application-level traffic patterns.

Beyond core monitoring, the system provides a comprehensive suite of tools for managing traffic data, including granular role-based access control, sensitive data redaction, and flexible storage options ranging from ephemeral local buffers to cloud-based object storage. It is built to operate in diverse environments, supporting air-gapped deployments and integrating with standard Kubernetes ingress resources for secure dashboard access.

The project is managed via a command-line interface that facilitates deployment control, custom script execution, and the sharing of specific traffic analysis views through encoded search queries.
- [virtuesecurity/aws-extender](https://awesome-repositories.com/repository/virtuesecurity-aws-extender.md) (258 ⭐) — AWS Extender (Cloud Storage Tester) is a Burp plugin to assess permissions of cloud storage containers on AWS, Google Cloud and Azure.
- [electric-sql/electric](https://awesome-repositories.com/repository/electric-sql-electric.md) (9,909 ⭐) — Electric is a Postgres data synchronization engine and replication proxy designed to enable local-first software. It replicates data from Postgres databases to client-side stores in real time using logical replication, allowing applications to maintain a local embedded database for offline access and low-latency updates.

The system distinguishes itself by using shapes to filter and authorize specific subsets of database rows and columns before streaming them to clients or edge workers. It further supports multi-user collaboration by integrating a conflict-free replicated data type framework to ensure consistent state synchronization across different users.

The project covers a broad range of capabilities, including reactive state management and real-time data streaming to client interfaces and server-side renders. It provides tools for data shaping and transformation, database integration across various cloud and serverless Postgres providers, and security primitives such as token-based authorization and end-to-end encryption.

The service can be deployed as a containerized web service on cloud platforms with support for rolling deployment management.
- [azure/psrule.rules.azure](https://awesome-repositories.com/repository/azure-psrule-rules-azure.md) (447 ⭐) — Rules to validate Azure resources and infrastructure as code (IaC) using PSRule.
- [pathwaycom/llm-app](https://awesome-repositories.com/repository/pathwaycom-llm-app.md) (59,341 ⭐) — This project is a data processing engine and AI application platform designed for building production-grade machine learning workflows. It provides a unified programming model that handles both historical batch data and live stream ingestion, enabling the development of real-time ETL pipelines and scalable data transformation workflows.

The framework distinguishes itself through differential dataflow execution, which propagates only changes through a pipeline rather than recomputing entire datasets. It supports distributed state management across worker nodes and utilizes incremental stream processing to trigger computations only when source data updates. These capabilities are paired with a specialized vector search framework that maintains low-latency access to evolving knowledge bases for retrieval-augmented generation.

The platform facilitates enterprise AI integration by connecting large language models to private data sources. It includes pre-built application templates to assist in the deployment of high-accuracy retrieval systems and scalable data pipelines.
- [innfactory/akka-persistence-gcp-datastore](https://awesome-repositories.com/repository/innfactory-akka-persistence-gcp-datastore.md) (19 ⭐) — akka-persistence-gcp-datastore is a journal and snapshot store plugin for akka-persistence using google cloud firestore in datastore mode.
- [azure/azure-documentdb-dotnet](https://awesome-repositories.com/repository/azure-azure-documentdb-dotnet.md) (0 ⭐) — This project provides a client tools or utilities in .NET that makes it easy to interact with Azure Cosmos DB. Azure cosmos DB is published with nuget name Microsoft.Azure.DocumentDB.
- [localstack/localstack](https://awesome-repositories.com/repository/localstack-localstack.md) (64,423 ⭐) — LocalStack is an infrastructure development environment that provides a local simulation of cloud services. By leveraging container-orchestrated service lifecycles, it allows developers to build, test, and debug cloud-native applications on their local machines without requiring remote connectivity or incurring cloud provider costs.

The platform distinguishes itself through sophisticated traffic redirection and request routing, which intercept cloud service calls at the network layer and redirect them to local handlers. This enables seamless integration with existing development workflows, allowing users to mock cloud resources, replicate infrastructure states, and execute ephemeral testing environments within continuous integration pipelines.

Beyond core emulation, the platform includes a comprehensive suite of developer tools for managing service lifecycles, monitoring activity, and configuring runtime environments. It supports complex distributed architectures through event-driven simulation, persistent storage mapping, and dynamic configuration injection, ensuring that local environments accurately mirror production requirements.

The system is designed for integration into automated build and deployment workflows, providing visual dashboards and terminal-based interfaces for real-time resource management and infrastructure troubleshooting.
