8 个仓库
Tools for managing the lifecycle and connectivity of virtual machine instances in the cloud.
Distinguishing note: Candidates focus on application singletons or account configurations, not virtual machine (GCE) instance administration.
Explore 8 awesome GitHub repositories matching devops & infrastructure · Cloud Compute Instance Management. Refine with filters or upvote what's useful.
该项目是一个 REST API 客户端库和 Google Cloud SDK 组件,将 Python 应用程序与 Google 服务集成。它作为一个基于发现的 API 客户端,利用 OAuth 2.0 集成库通过访问令牌和服务帐户来保护请求并验证身份。 该库的特色在于使用发现文档在运行时动态生成客户端。通过解析 JSON 元数据,它将 Python 方法调用映射到 HTTP 请求,并构建镜像远程服务层次结构的基于接口的资源模型。 其更广泛的功能涵盖云基础设施管理、数字广告自动化和企业帐户管理。该工具包包括用于处理大数据集分页、为二进制数据执行可恢复上传协议以及通过 API 响应模拟进行质量保证的实用程序。 该库还通过将发现文档与可用的服务接口相结合,提供技术文档生成功能。
Enables the programmatic creation, modification, and deletion of compute instances within a cloud project.
DevOps-Bash-tools is a collection of shell scripts and aliases designed to automate cloud infrastructure, container orchestration, and CI/CD pipelines. It provides a comprehensive toolset for managing operational workflows through the command line. The project specializes in automating tasks across multiple platforms, including managing namespaces and secrets in Kubernetes, auditing resources in AWS and GCP, and triggering builds or managing environment variables in GitHub Actions, GitLab CI, and CircleCI. It also includes a toolkit for interacting with container registries to query manifests
Handles SSH connections and audits service accounts for Google Compute Engine instances.
SimianArmy is a chaos engineering framework and resilience testing tool designed to induce random infrastructure failures in cloud environments. It functions as a cloud instance termination tool that simulates unplanned outages to verify that distributed architectures maintain high availability and fault tolerance. The system identifies and terminates cloud server instances to ensure applications can tolerate unexpected hardware failures without interrupting service. This process allows for the verification of automated failover mechanisms and the identification of weaknesses in system reliab
Automates the lifecycle management and termination of virtual machine instances to test infrastructure robustness.
Axiom 是一个云基础设施编排器和分布式安全扫描框架。它作为一个管理器,用于跨多个云提供商和区域部署、快照和销毁一次性虚拟机集群。 该项目通过自动化在这些远程服务器上配置漏洞工具集和安全审计软件而脱颖而出。它具有一种通过在实例集群中分片目标列表并将结果数据聚合到统一文件和 HTML 报告中来分发安全扫描的机制。 该系统涵盖了广泛的操作能力,包括远程命令执行、并行文件传输以及针对高并发工作负载的内核级网络调优。它还提供用于图像快照管理、用于网络流量的 SSH 隧道以及部署预定义基础设施配置文件的工具。
Manages the lifecycle of cloud virtual machines, including the targeted removal of instances using wildcards.
Mgmt 是一个分布式配置管理系统,使用事件驱动自动化和闭环反馈来维护集群的预期状态。它作为一个基础设施自动化引擎,根据资源监控和预定义规范实时触发系统状态校正。 该系统包括一个分布式集群节点选择器,用于根据特定策略和约束选择主机子集以分配工作负载。它还具有一个云基础设施管理器,用于控制虚拟机实例的生命周期(包括镜像部署、区域选择和启动脚本),以及一个用于在基板管理控制器上安装和验证二进制文件的硬件固件编排器。 其他功能涵盖并行任务执行、基于透镜的配置文件修改和系统元数据检索。该工具集还提供用于数据类型转换、格式解码和环境检查以确定当前系统状态的实用程序。
Controls virtual machine instances by specifying images, regions, and startup scripts.
This project is a collection of structured study notes and conceptual breakdowns designed for the AWS Certified Cloud Practitioner exam. It serves as a technical reference and study guide, organizing cloud service details and architectural principles to assist in certification preparation. The knowledge base is built using markdown files and includes curated cheat sheets and interactive mind-map visualizations. These tools map complex certification topics into visual hierarchies to enable drill-down study paths and rapid revision. The materials cover a wide range of cloud capabilities, inclu
Covers the selection of hardware profiles based on compute or memory intensity to meet performance needs.
Lorax is a GPU-accelerated inference server and multi-adapter engine designed for serving large language models. It functions as a high-throughput system capable of deploying models via Kubernetes and managing the dynamic swapping of Low-Rank Adaptation adapters per request. The server distinguishes itself through multi-adapter dynamic batching, which allows requests using different adapter weights to be processed in a single GPU forward pass. It employs just-in-time adapter loading and weighted adapter merging to maximize throughput and enable multi-tasking without sacrificing performance.
Manages the lifecycle of remote compute nodes, including starting, stopping, and deleting cloud instances.
Foreman is a lifecycle infrastructure management platform used for automating the provisioning, configuration, and monitoring of physical, virtual, and cloud servers. It serves as a central hub for managing the entire lifespan of a server, from initial deployment and operating system upgrades to decommissioning and auditing. The platform functions as a hybrid cloud manager and bare-metal provisioning tool, providing a unified interface to control virtual machine lifecycles across diverse hypervisors and public cloud providers. It automates hardware discovery and operating system deployment us
Enables central management of the lifecycle and connectivity for virtual machine instances across diverse cloud platforms.