1 repo
Utilities for measuring the performance, latency, and throughput of machine learning model endpoints.
Distinguishing note: Specifically targets model inference endpoints rather than general software performance testing.
Explore 1 awesome GitHub repository matching testing & quality assurance · Model Benchmarking Tools. Refine with filters or upvote what's useful.
Modular is a unified machine learning development platform designed for building, compiling, and deploying high-performance neural network models. It provides a comprehensive execution engine that supports both local and production-grade inference, enabling developers to manage the entire model lifecycle from initial architecture definition to scalable, containerized service deployment. The platform distinguishes itself through a hardware-agnostic runtime that abstracts diverse silicon architectures, allowing models to execute efficiently across varied compute environments. It includes a spec
Measures model endpoint performance through load testing and response time analysis.