Torch2trt

Torch2trt - convert PyTorch models to TensorRT | Awesome Repos

Open-source alternatives to Torch2trt

Similar open-source projects, ranked by how many features they share with Torch2trt.

wang-xinyu/tensorrtx
wang-xinyu/tensorrtx
7,802View on GitHub
tensorrtx is a computer vision inference engine and model implementation library designed for graphics processor acceleration. It provides a framework for optimizing deep learning models through a GPU inference optimizer, a deep learning model converter for transforming weights from frameworks like TensorFlow and PyTorch, and a custom plugin library to implement operations not natively supported by the TensorRT API. The project distinguishes itself through a comprehensive collection of pre-defined network implementations, ranging from various YOLO versions and DETR transformers for object det
C++arcfacecrnndetr
View on GitHub7,802
pytorch/executorch
pytorch/executorch
4,296View on GitHub
ExecuTorch is a lightweight C++ runtime for deploying PyTorch models on mobile, embedded, and edge hardware. It provides an ahead-of-time compilation pipeline that exports, quantizes, and lowers model graphs into compact serialized programs, then executes them through a minimal runtime with hardware acceleration and on-device large language model inference capabilities. The project distinguishes itself through a hardware accelerator delegate system that partitions model subgraphs and offloads computation to specialized backends including NPUs, GPUs, and DSPs from Apple, Arm, Intel, MediaTek,
Pythondeep-learningembeddedgpu
View on GitHub4,296
tingsongyu/pytorch_tutorial
TingsongYu/PyTorch_Tutorial
8,018View on GitHub
This project is a comprehensive collection of educational examples and reference implementations for building vision and language models using PyTorch. It serves as a deep learning tutorial covering the end-to-end process of developing neural networks, from initial architecture definition to final production deployment. The repository provides detailed guides on implementing a wide range of domain-specific models, including convolutional neural networks for object detection and segmentation, as well as transformer and recurrent architectures for natural language processing. It emphasizes gene
Python
View on GitHub8,018
abetlen/llama-cpp-python
abetlen/llama-cpp-python
9,993View on GitHub
llama-cpp-python provides a Python interface for the llama.cpp library, enabling the execution of large language models with hardware acceleration. It functions as a GGUF model loader and a structured text generator capable of running inference servers and multimodal runtimes for processing both text and image inputs. The project distinguishes itself through a local inference server that exposes model capabilities via an OpenAI-compatible web API. It supports advanced execution techniques including speculative decoding, weight quantization, and layer-based GPU offloading to manage memory acro
Python
View on GitHub9,993

See all 30 alternatives to Torch2trt

NVIDIA-AI-IOTtorch2trt

Features

Open-source alternatives to Torch2trt

wang-xinyu/tensorrtx

pytorch/executorch

TingsongYu/PyTorch_Tutorial

abetlen/llama-cpp-python

Star history

Open-source alternatives to Torch2trt

wang-xinyu/tensorrtx

pytorch/executorch

TingsongYu/PyTorch_Tutorial

abetlen/llama-cpp-python