HAMi is a hardware orchestration and virtualization system designed to manage accelerators within Kubernetes. It functions as a device plugin that partitions physical hardware into isolated virtual slices, enabling multiple containers to share a single device through enforced memory limits and compute quotas.
The project provides a virtualization manager and a heterogeneous compute scheduler that distributes tasks across diverse accelerator types. It uses packing and topology policies to optimize workload placement and allows for specific hardware targeting using unique device identifiers.
The system includes a monitoring tool that collects real-time memory and core utilization metrics across different hardware vendors, exporting this data via Prometheus-compatible endpoints for cluster health dashboards. Administrative tasks and hardware allocations are managed through a web-based graphical interface.