76 repos
Foundational systems and hardware-level tools required to support the development, deployment, and scaling of machine learning workflows.
Explore 76 awesome GitHub repositories matching artificial intelligence & ml · Infrastructure. Refine with filters or upvote what's useful.
This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima
Automates the end-to-end workflow for sourcing data, training neural models, and validating synthesis performance.
Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on
Executes model checkpoints locally with configurable parameters like sequence length and batch size to optimize performance.
This project is a community-curated directory of resources, libraries, and tools designed to support developers working with the Flutter framework. It functions as a centralized knowledge base, organizing high-quality external references into a structured, human-readable format to assist in the discovery of technical m
Identifies machine learning toolkits and model conversion utilities for deploying artificial intelligence within mobile applications.
Cline is an extensible agent runtime and multi-agent orchestration engine designed to automate complex software engineering workflows. It functions as an integrated development environment extension that bridges strategic task planning with autonomous execution, allowing users to manage multi-step projects through huma
Connects various local and cloud-based language models to facilitate automated software engineering workflows.
This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov
Runs generative language models directly on local hardware for private, offline processing tasks.
YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning
Executes high-speed visual inference using hardware-accelerated processing and test-time augmentation.
GPT-Engineer is an autonomous agent and framework designed for AI-assisted software development. It functions as a generative codebase architect that translates natural language requirements into complete, functional software projects by reading and writing files directly to the local file system. The platform disting
Supports the deployment and integration of various local and cloud-based language models for generative tasks.
GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr
Adapts pre-trained models to specific personas or characters using targeted training on small audio datasets.
Faceswap is a comprehensive framework for automated media manipulation and neural face synthesis. It provides a modular pipeline that manages the entire lifecycle of facial feature extraction, deep learning model training, and image conversion. By coordinating complex computer vision workflows, the system enables users
Bundles modular components for face extraction, model training, and image conversion to facilitate custom processing workflows.
This platform serves as a comprehensive environment for managing private language models, document knowledge bases, and automated agent workflows within secure local infrastructure. It functions as a document-aware workspace that enables users to ingest diverse file formats into searchable repositories, ensuring that a
Deploys language model interfaces and data processing engines directly onto local hardware for private, self-hosted operations.
MinerU is a document parsing pipeline designed to transform unstructured files into machine-readable, structured data. It utilizes deep learning models to perform layout analysis, identifying document regions and extracting complex content such as mathematical expressions. By combining these neural network inferences w
Deploys deep learning models to classify content types and extract complex mathematical expressions from diverse visual inputs.
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi
Executes high-dimensional array operations and mathematical functions essential for training deep neural networks.
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification
Consolidates the entire lifecycle of training, validating, and executing deep learning models into a single, cohesive workflow.
This project provides a deep learning architecture designed to identify and isolate distinct objects within images by generating precise pixel-level masks. It functions as a browser-based inference engine, enabling the execution of complex machine learning models directly within web environments without requiring serve
Enables the execution of sophisticated deep learning models directly within the browser environment using hardware-accelerated runtimes.
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Streamlines the adaptation of existing models for specific tasks while maintaining low hardware requirements.
This repository provides a collection of practical demonstrations and implementation guides for machine learning tasks using TensorFlow.js. It serves as a resource for developers to explore model architectures, training workflows, and data manipulation techniques across domains such as computer vision, natural language
Explicit disposal methods for layer and model objects enable developers to reclaim GPU-resident memory in environments lacking automatic garbage collection.