awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Infrastructure · Awesome GitHub Repositories

76 repos

Awesome GitHub RepositoriesInfrastructure

Foundational systems and hardware-level tools required to support the development, deployment, and scaling of machine learning workflows.

Explore 76 awesome GitHub repositories matching artificial intelligence & ml · Infrastructure. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Infrastructure

Awesome Infrastructure GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • CorentinJ/Real-Time-Voice-Cloning

    CorentinJ/Real-Time-Voice-Cloning

    59,355GitHubView on GitHub↗

    This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima

    Automates the end-to-end workflow for sourcing data, training neural models, and validating synthesis performance.

    Pythondeep-learningpythonpytorch
  • meta-llama/llama

    meta-llama/llama

    59,157GitHubView on GitHub↗

    Llama is a computational framework and runtime environment designed for executing transformer-based neural networks locally. It functions as a generative AI inference engine, enabling the processing of input sequences through pre-trained model weights to produce text completions and structured data outputs directly on

    Executes model checkpoints locally with configurable parameters like sequence length and batch size to optimize performance.

    Python
  • Solido/awesome-flutter

    Solido/awesome-flutter

    59,015GitHubView on GitHub↗

    This project is a community-curated directory of resources, libraries, and tools designed to support developers working with the Flutter framework. It functions as a centralized knowledge base, organizing high-quality external references into a structured, human-readable format to assist in the discovery of technical m

    Identifies machine learning toolkits and model conversion utilities for deploying artificial intelligence within mobile applications.

    Dartandroidawesomeawesome-list
  • cline/cline

    cline/cline

    58,164GitHubView on GitHub↗

    Cline is an extensible agent runtime and multi-agent orchestration engine designed to automate complex software engineering workflows. It functions as an integrated development environment extension that bridges strategic task planning with autonomous execution, allowing users to manage multi-step projects through huma

    Connects various local and cloud-based language models to facilitate automated software engineering workflows.

    TypeScript
  • zylon-ai/private-gpt

    zylon-ai/private-gpt

    57,116GitHubView on GitHub↗

    This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov

    Runs generative language models directly on local hardware for private, offline processing tasks.

    Python
  • ultralytics/yolov5

    ultralytics/yolov5

    56,830GitHubView on GitHub↗

    YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning

    Executes high-speed visual inference using hardware-accelerated processing and test-time augmentation.

    Pythoncoremldeep-learningios
  • AntonOsika/gpt-engineer

    AntonOsika/gpt-engineer

    55,201GitHubView on GitHub↗

    GPT-Engineer is an autonomous agent and framework designed for AI-assisted software development. It functions as a generative codebase architect that translates natural language requirements into complete, functional software projects by reading and writing files directly to the local file system. The platform disting

    Supports the deployment and integration of various local and cloud-based language models for generative tasks.

    Pythonaiautonomous-agentcode-generation
  • RVC-Boss/GPT-SoVITS

    RVC-Boss/GPT-SoVITS

    55,111GitHubView on GitHub↗

    GPT-SoVITS is a text-to-speech synthesis engine and voice cloning toolkit designed for generating natural-sounding human speech. It functions as a neural audio processing pipeline that maps input text to high-fidelity audio waveforms, utilizing conditional variational autoencoders and flow-based decoders to ensure expr

    Adapts pre-trained models to specific personas or characters using targeted training on small audio datasets.

    Pythontext-to-speechttsvits
  • deepfakes/faceswap

    deepfakes/faceswap

    54,974GitHubView on GitHub↗

    Faceswap is a comprehensive framework for automated media manipulation and neural face synthesis. It provides a modular pipeline that manages the entire lifecycle of facial feature extraction, deep learning model training, and image conversion. By coordinating complex computer vision workflows, the system enables users

    Bundles modular components for face extraction, model training, and image conversion to facilitate custom processing workflows.

    Pythondeep-face-swapdeep-learningdeep-neural-networks
  • Mintplex-Labs/anything-llm

    Mintplex-Labs/anything-llm

    54,751GitHubView on GitHub↗

    This platform serves as a comprehensive environment for managing private language models, document knowledge bases, and automated agent workflows within secure local infrastructure. It functions as a document-aware workspace that enables users to ingest diverse file formats into searchable repositories, ensuring that a

    Deploys language model interfaces and data processing engines directly onto local hardware for private, self-hosted operations.

    JavaScriptai-agentscustom-ai-agentsdeepseek
  • opendatalab/MinerU

    opendatalab/MinerU

    54,523GitHubView on GitHub↗

    MinerU is a document parsing pipeline designed to transform unstructured files into machine-readable, structured data. It utilizes deep learning models to perform layout analysis, identifying document regions and extracting complex content such as mathematical expressions. By combining these neural network inferences w

    Deploys deep learning models to classify content types and extract complex mathematical expressions from diverse visual inputs.

    Pythonai4sciencedocument-analysisextract-data
  • karpathy/nanoGPT

    karpathy/nanoGPT

    53,461GitHubView on GitHub↗

    nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi

    Executes high-dimensional array operations and mathematical functions essential for training deep neural networks.

    Python
  • ultralytics/ultralytics

    ultralytics/ultralytics

    53,426GitHubView on GitHub↗

    Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification

    Consolidates the entire lifecycle of training, validating, and executing deep learning models into a single, cohesive workflow.

    Pythonclicomputer-visiondeep-learning
  • facebookresearch/segment-anything

    facebookresearch/segment-anything

    53,431GitHubView on GitHub↗

    This project provides a deep learning architecture designed to identify and isolate distinct objects within images by generating precise pixel-level masks. It functions as a browser-based inference engine, enabling the execution of complex machine learning models directly within web environments without requiring serve

    Enables the execution of sophisticated deep learning models directly within the browser environment using hardware-accelerated runtimes.

    Jupyter Notebook
  • unslothai/unsloth

    unslothai/unsloth

    52,461GitHubView on GitHub↗

    Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade

    Streamlines the adaptation of existing models for specific tasks while maintaining low hardware requirements.

    Pythonagentdeepseekdeepseek-r1
  • tensorflow/tfjs-examples

    tensorflow/tfjs-examples

    6,783GitHubView on GitHub↗

    This repository provides a collection of practical demonstrations and implementation guides for machine learning tasks using TensorFlow.js. It serves as a resource for developers to explore model architectures, training workflows, and data manipulation techniques across domains such as computer vision, natural language

    Explicit disposal methods for layer and model objects enable developers to reclaim GPU-resident memory in environments lacking automatic garbage collection.

    JavaScript
Prev1234Next

Explore sub-tags

  • Data Ingestion and Preparation4 sub-tagsTools focused on the initial stages of the pipeline, including loading, formatting, and augmenting raw data for model consumption.
  • Dataset Management2 sub-tags
  • Deployment & Serving8 sub-tags
  • Domain-Specific Processing Pipelines3 sub-tagsSpecialized pipelines tailored for specific data modalities like media synthesis or real-time streaming inference.
Evaluation & Validation14 sub-tags
  • Integrated Development Platforms1 sub-tagComprehensive environments that bundle tools for the end-to-end lifecycle, including development, management, and operational workflows.
  • Machine Learning Training8 sub-tagsFrameworks and utilities used to train, fine-tune, and align machine learning models with specific objectives.
  • Model Evaluation and Analysis7 sub-tagsTools and frameworks for measuring, benchmarking, and monitoring the performance and quality of machine learning models.
  • Model Inference and Serving7 sub-tagsPlatforms and techniques for deploying, optimizing, and serving machine learning models for production use.
  • Model Management9 sub-tagsTools and interfaces for organizing, loading, and executing machine learning models throughout their operational lifecycle.
  • Model Optimization5 sub-tagsTechniques and utilities designed to improve model performance, reduce resource consumption, and refine parameters for specific deployment environments.
  • Optimization & Inference8 sub-tags
  • Training & Tuning12 sub-tags
  • Training Monitoring & Profiling3 sub-tags