4 repos
Tools focused on post-training conversion, compilation, and hardware-specific acceleration for deployment-ready models.
Explore 4 awesome GitHub repositories matching artificial intelligence & ml · Inference Optimization Utilities. Refine with filters or upvote what's useful.
Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a di
Applies hardware-specific tuning to model execution paths, significantly enhancing inference speed and throughput on diverse computing devices.
YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning
Translates trained models into standard industry formats to ensure compatibility across diverse hardware and deployment environments.
Faceswap is a comprehensive framework for automated media manipulation and neural face synthesis. It provides a modular pipeline that manages the entire lifecycle of facial feature extraction, deep learning model training, and image conversion. By coordinating complex computer vision workflows, the system enables users
Converts trained models into inference-ready versions by calculating required layers and configuring swap parameters.
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Exports custom model weights into standard file formats to ensure compatibility with local inference and production systems.