2 repos
Hardware-agnostic layers for executing models across diverse computing environments.
Distinguishing note: Focuses on the abstraction layer for hardware compatibility rather than the model runtime itself.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Inference Backends. Refine with filters or upvote what's useful.
ChatGLM-6B is a generative AI inference engine designed for local execution of transformer-based language models. It provides a comprehensive runtime environment that allows users to load and run pre-trained neural network weights directly on their own hardware, ensuring data privacy and independence from external cloud services. The project distinguishes itself through a hardware-agnostic execution backend that supports deployment across diverse environments, including standard processors, Apple Silicon, and multi-GPU configurations. It incorporates advanced optimization techniques such as w
Provides a hardware-agnostic layer to enable model execution across diverse computing environments.
Jan is a desktop application that functions as a local artificial intelligence model runtime and an open-standard API server. It enables the execution of large language models directly on local hardware, ensuring that data remains private and accessible offline while providing a unified interface for managing model weights and inference runtimes. The platform distinguishes itself by offering a modular inference backend that allows users to swap execution engines based on hardware compatibility and performance needs. It acts as a cross-platform orchestrator, providing the ability to switch bet
Swaps between different underlying model execution engines to balance performance and hardware compatibility.