2 repos
Dedicated server applications that host machine learning models to provide scalable, network-accessible inference services.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Model Inference Servers. Refine with filters or upvote what's useful.
LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface. The pro
Exposes trained models via standardized network protocols to facilitate scalable and reliable prediction services.
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Exposes loaded models via command-line API endpoints with built-in authentication for scalable inference services.