What are the best open-source alternatives to LiteRT?

30 open-source projects similar to google-ai-edge/litert, ranked by shared features. Top picks: pytorch/executorch, openvinotoolkit/openvino, alibaba/mnn, sgl-project/sglang, microsoft/onnxruntime, google-ai-edge/litert-lm, kvcache-ai/ktransformers, meituan/yolov6, intel/neural-compressor, ericlbuehler/mistral.rs.

Is pytorch/executorch a good alternative to LiteRT?

ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardwar…

Is openvinotoolkit/openvino a good alternative to LiteRT?

OpenVINO is an AI inference engine and model serving platform designed to execute optimized deep learning models across CPUs, GPUs, and NPUs through a unified API. It includes a model optimization toolkit for converting, quantizing, and compressing models from various frameworks, alongside a specia…

Is alibaba/mnn a good alternative to LiteRT?

MNN is a high-performance inference engine and framework designed for on-device machine learning. It provides a comprehensive environment for executing, optimizing, and deploying neural network models directly on mobile and resource-constrained edge devices. The framework distinguishes itself thro…

Is sgl-project/sglang a good alternative to LiteRT?

Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains thr…

Is microsoft/onnxruntime a good alternative to LiteRT?

This project is a cross-platform machine learning inference engine designed to execute pre-trained models across diverse operating systems and hardware environments. It functions as a standardized execution framework that manages the entire lifecycle of model inference, from loading and graph optim…

Is google-ai-edge/litert-lm a good alternative to LiteRT?

LiteRT-LM is a high-performance inference framework designed to execute large language models locally on mobile, desktop, and IoT hardware. It serves as an on-device model runtime that utilizes CPU, GPU, and NPU acceleration to provide low-latency processing. The framework is distinguished by its…

Is kvcache-ai/ktransformers a good alternative to LiteRT?

Ktransformers is a comprehensive framework designed for the operation, fine-tuning, and serving of large language models. It functions as a heterogeneous inference engine and quantized execution runtime, enabling the deployment of massive models by distributing computational workloads across both C…

Is meituan/yolov6 a good alternative to LiteRT?

YOLOv6 is a single-stage deep learning framework designed for industrial object detection. It serves as a computer vision model trainer for identifying and locating objects within images, as well as an instance segmentation tool that delineates precise object boundaries using masks. The project in…

Is intel/neural-compressor a good alternative to LiteRT?

Neural Compressor is a deep learning model compression toolkit and AI inference acceleration engine. It functions as an automated model quantization tool and hardware-aware model compiler designed to reduce the memory footprint of neural networks and decrease execution latency. The project provide…

Is ericlbuehler/mistral.rs a good alternative to LiteRT?

mistral.rs is an inference engine for large language models that runs locally and exposes models behind OpenAI and Anthropic-compatible APIs. It serves as a multi-model serving platform, capable of loading several models in a single server process with per-request routing and on-demand loading and…

Back to google-ai-edge/litert

Open-source alternatives to LiteRT

30 open-source projects similar to google-ai-edge/litert, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best LiteRT alternative.

pytorch/executorch
pytorch/executorch
4,296View on GitHub
ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardware acceleration and on-device large language model inference capabilities. The project distinguishes itself through a hardware accelerator delegate system that partitions model subgraphs and offloads computation to specialized backends including NPUs, GPUs, and DSPs from Apple, Arm, Intel, MediaTek,
Pythondeep-learningembeddedgpu
View on GitHub4,296
openvinotoolkit/openvino
openvinotoolkit/openvino
10,414View on GitHub
OpenVINO is an AI inference engine and model serving platform designed to execute optimized deep learning models across CPUs, GPUs, and NPUs through a unified API. It includes a model optimization toolkit for converting, quantizing, and compressing models from various frameworks, alongside a specialized generative AI runtime for large language models. The project distinguishes itself through a plugin-based hardware acceleration layer that maps neural network operations to vendor-specific drivers. It features advanced execution mechanisms such as continuous batching, speculative decoding, and
C++aicomputer-visiondeep-learning
View on GitHub10,414
alibaba/mnn
alibaba/MNN
14,242View on GitHub
MNN is a high-performance inference engine and framework designed for on-device machine learning. It provides a comprehensive environment for executing, optimizing, and deploying neural network models directly on mobile and resource-constrained edge devices. The framework distinguishes itself through a robust model optimization toolkit that supports quantization, compression, and structural graph manipulation to minimize memory footprint and maximize execution speed. It features a modular architecture that abstracts hardware-specific backends, allowing models to run efficiently across diverse
C++armconvolutiondeep-learning
View on GitHub14,242

Open-source alternatives to LiteRT

pytorch/executorch

openvinotoolkit/openvino

alibaba/MNN

sgl-project/sglang

microsoft/onnxruntime

google-ai-edge/LiteRT-LM

kvcache-ai/ktransformers

meituan/YOLOv6

intel/neural-compressor

EricLBuehler/mistral.rs

RunanywhereAI/runanywhere-sdks

apple/ml-stable-diffusion

lyogavin/airllm

huggingface/text-generation-inference

apple/ml-fastvlm

vllm-project/llm-compressor

Tencent/ncnn

pytorch/torchtune

OpenNMT/CTranslate2

NexaAI/nexa-sdk

rustformers/llm

antimatter15/alpaca.cpp

lmstudio-ai/lms

PaddlePaddle/Paddle-Lite

ggerganov/whisper.cpp

intel-analytics/ipex-llm

android/ndk-samples

facebookresearch/seamless_communication

jomjol/AI-on-the-edge-device

QwenLM/Qwen