2 रिपॉजिटरी
Techniques for maximizing the number of workloads per physical node by reducing virtualization overhead.
Distinguishing note: Candidates focus on networking or monitoring, while this is about compute density and packing.
Explore 2 awesome GitHub repositories matching devops & infrastructure · Workload Density Optimization. Refine with filters or upvote what's useful.
KServe is an open platform for deploying and serving generative and predictive AI models on Kubernetes. It defines inference services as custom resources with declarative YAML specifications, enabling a Kubernetes-native approach to model deployment and lifecycle management. The platform leverages Knative-based serverless scaling for automatic scale-to-zero and revision management, and supports a pluggable serving runtime architecture that maps model formats to containerized execution environments. KServe distinguishes itself through model-aware autoscaling that scales replicas based on token
Adjusts GPU, memory, and autoscaling settings based on whether the workload is generative or predictive.
This project provides strategic roadmaps and guides detailing the evolution and deployment patterns of managed container orchestration and security services. It serves as a public tracking document for upcoming features and development priorities for EKS, ECS, ECR, and Fargate. The resource includes a cloud container orchestration guide and a Kubernetes and ECS strategy, outlining the development of managed Kubernetes and proprietary orchestration services for cloud infrastructure. It also provides a security and monitoring plan focused on scanning malicious activity and tracking workload hea
Implements IPv4 prefix assignment to network interfaces to increase container density on Windows nodes.