4 repos

Awesome GitHub RepositoriesLocal Inference and Deployment

Infrastructure and engines for executing models directly on local hardware or edge devices, prioritizing privacy and performance.

Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Local Inference and Deployment. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

vllm-project/vllm
vllm-project/vllm
70,745GitHubView on GitHub
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Pythonamdblackwellcuda
meta-llama/llama
meta-llama/llama
59,157GitHubView on GitHub
Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on
Python
zylon-ai/private-gpt
zylon-ai/private-gpt
57,116GitHubView on GitHub
This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov
Python
ultralytics/ultralytics
ultralytics/ultralytics
53,426GitHubView on GitHub
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification
Pythonclicomputer-visiondeep-learning

Explore sub-tags

4 repos

Awesome GitHub RepositoriesLocal Inference and Deployment

Infrastructure and engines for executing models directly on local hardware or edge devices, prioritizing privacy and performance.

Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Local Inference and Deployment. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

vllm-project/vllm
vllm-project/vllm
70,745GitHubView on GitHub
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Pythonamdblackwellcuda
meta-llama/llama
meta-llama/llama
59,157GitHubView on GitHub
Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on
Python
zylon-ai/private-gpt
zylon-ai/private-gpt
57,116GitHubView on GitHub
This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov
Python
ultralytics/ultralytics
ultralytics/ultralytics
53,426GitHubView on GitHub
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification
Pythonclicomputer-visiondeep-learning

Awesome Local Inference and Deployment GitHub Repositories

vllm-project/vllm

meta-llama/llama

zylon-ai/private-gpt

ultralytics/ultralytics

Explore sub-tags

Awesome Local Inference and Deployment GitHub Repositories

vllm-project/vllm

meta-llama/llama

zylon-ai/private-gpt

ultralytics/ultralytics

Explore sub-tags