What are the best open-source alternatives to Openvino?

30 open-source projects similar to openvinotoolkit/openvino, ranked by shared features. Top picks: microsoft/onnxruntime, alibaba/mnn, google-ai-edge/litert, sgl-project/sglang, ggerganov/whisper.cpp, intel/ipex-llm, pytorch/executorch, tiiny-ai/powerinfer, paddlepaddle/paddle-lite, intel/neural-compressor.

Is microsoft/onnxruntime a good alternative to Openvino?

This project is a cross-platform machine learning inference engine designed to execute pre-trained models across diverse operating systems and hardware environments. It functions as a standardized execution framework that manages the entire lifecycle of model inference, from loading and graph optim…

Is alibaba/mnn a good alternative to Openvino?

MNN is a high-performance inference engine and framework designed for on-device machine learning. It provides a comprehensive environment for executing, optimizing, and deploying neural network models directly on mobile and resource-constrained edge devices. The framework distinguishes itself thro…

Is google-ai-edge/litert a good alternative to Openvino?

LiteRT is a runtime and API for executing machine learning and generative AI models on mobile, desktop, and IoT hardware. It consists of an inference engine and a specialized environment for running quantized large language and diffusion models locally on edge hardware. The system includes an ahea…

Is sgl-project/sglang a good alternative to Openvino?

Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains thr…

Is ggerganov/whisper.cpp a good alternative to Openvino?

whisper.cpp is a C++ implementation of the Whisper speech-to-text model, serving as a lightweight machine learning inference engine and quantized runtime. It provides high-performance automatic speech recognition and real-time audio transcription without requiring a Python environment. The project…

Is intel/ipex-llm a good alternative to Openvino?

Intel XPU LLM Acceleration Library is a toolkit designed to accelerate large language model inference and finetuning on Intel CPUs, GPUs, and NPUs. It provides a distributed inference engine for scaling models across multiple accelerators, a multimodal model runtime for vision and speech tasks, and…

Is pytorch/executorch a good alternative to Openvino?

ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardwar…

Is tiiny-ai/powerinfer a good alternative to Openvino?

PowerInfer is a high-performance local large language model inference engine and sparse inference framework. It provides a runtime for executing models on consumer-grade hardware, utilizing a GPU acceleration backend to optimize tensor operations for graphics processors. The system distinguishes i…

Is paddlepaddle/paddle-lite a good alternative to Openvino?

Paddle-Lite is a deep learning inference engine and edge computing runtime designed to execute trained models on mobile and edge devices. It provides a hardware-accelerated inference framework and a decoupled runtime with a minimal binary footprint to operate in resource-constrained environments wi…

Is intel/neural-compressor a good alternative to Openvino?

Neural Compressor is a deep learning model compression toolkit and AI inference acceleration engine. It functions as an automated model quantization tool and hardware-aware model compiler designed to reduce the memory footprint of neural networks and decrease execution latency. The project provide…

Back to openvinotoolkit/openvino

Open-source alternatives to Openvino

30 open-source projects similar to openvinotoolkit/openvino, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Openvino alternative.

microsoft/onnxruntime
microsoft/onnxruntime
19,347View on GitHub
This project is a cross-platform machine learning inference engine designed to execute pre-trained models across diverse operating systems and hardware environments. It functions as a standardized execution framework that manages the entire lifecycle of model inference, from loading and graph optimization to hardware-accelerated execution and generative sequence management. The runtime distinguishes itself through a highly modular architecture that decouples model logic from hardware-specific kernels. By utilizing an execution provider abstraction, it enables developers to offload computation
C++ai-frameworkdeep-learninghardware-acceleration
View on GitHub19,347
alibaba/mnn
alibaba/MNN
14,242View on GitHub
MNN is a high-performance inference engine and framework designed for on-device machine learning. It provides a comprehensive environment for executing, optimizing, and deploying neural network models directly on mobile and resource-constrained edge devices. The framework distinguishes itself through a robust model optimization toolkit that supports quantization, compression, and structural graph manipulation to minimize memory footprint and maximize execution speed. It features a modular architecture that abstracts hardware-specific backends, allowing models to run efficiently across diverse
C++armconvolutiondeep-learning
View on GitHub14,242
google-ai-edge/litert
google-ai-edge/LiteRT
2,561View on GitHub
LiteRT is a runtime and API for executing machine learning and generative AI models on mobile, desktop, and IoT hardware. It consists of an inference engine and a specialized environment for running quantized large language and diffusion models locally on edge hardware. The system includes an ahead-of-time model compiler that translates models into hardware-specific bytecode to reduce startup latency and memory overhead. It provides a unified interface for Neural Processing Units with automatic fallback routing to CPUs or GPUs when specific subgraph support is unavailable. An edge model conve
C++
View on GitHub2,561

Open-source alternatives to Openvino

microsoft/onnxruntime

alibaba/MNN

google-ai-edge/LiteRT

sgl-project/sglang

ggerganov/whisper.cpp

intel/ipex-llm

pytorch/executorch

Tiiny-AI/PowerInfer

PaddlePaddle/Paddle-Lite

intel/neural-compressor

vllm-project/llm-compressor

xai-org/grok-1

pytorch/torchtune

Lightning-AI/litgpt

NVIDIA/triton-inference-server

apple/ml-fastvlm

facebookresearch/metaseq

PaddlePaddle/PaddleFormers

meta-llama/llama-models

abetlen/llama-cpp-python

zhaochenyang20/Awesome-ML-SYS-Tutorial

mozilla-ai/llamafile

vladmandic/sdnext

wang-xinyu/tensorrtx

bentoml/BentoML

kvcache-ai/ktransformers

tensorflow/tfjs

ymcui/Chinese-LLaMA-Alpaca

QwenLM/Qwen

d2l-ai/d2l-en