4 repos
Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Inference & Deployment. Refine with filters or upvote what's useful.
TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst
Refines models for production execution to improve performance and reduce resource consumption on target hardware.
vLLM is a high-throughput inference engine designed for the efficient serving and execution of large language models. It functions as a production-ready distributed model server, providing standard API protocols for online serving while also supporting offline batch processing. The system is built to maximize token gen
Supports configurable, high-performance attention backends that automatically detect and optimize computation for specific hardware accelerators.
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification
Exports and optimizes models for high-performance execution across cloud and edge hardware environments.
This project provides a deep learning architecture designed to identify and isolate distinct objects within images by generating precise pixel-level masks. It functions as a browser-based inference engine, enabling the execution of complex machine learning models directly within web environments without requiring serve
Optimizes and compresses deep learning models to minimize resource consumption during browser-based deployment.