awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Training & Tuning · Awesome GitHub Repositories

25 repos

Awesome GitHub RepositoriesTraining & Tuning

Explore 25 awesome GitHub repositories matching artificial intelligence & ml · Training & Tuning. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning
  4. Infrastructure
  5. Training & Tuning

Awesome Training & Tuning GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • sindresorhus/awesome

    sindresorhus/awesome

    438,690GitHubView on GitHub↗

    This project is a community-curated knowledge base that organizes vast technical ecosystems into a hierarchical, human-readable directory. It serves as a comprehensive index of libraries, frameworks, and methodologies, designed to facilitate discovery and professional development across the entire spectrum of software

    Exposes a curated directory of community-vetted resources for building and deploying machine learning models.

    awesomeawesome-listlists
  • vinta/awesome-python

    vinta/awesome-python

    283,687GitHubView on GitHub↗

    This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th

    Highlights high-performance frameworks designed for building, training, and tuning complex neural network architectures.

    Pythonawesomecollectionspython
  • practical-tutorials/project-based-learning

    practical-tutorials/project-based-learning

    258,742GitHubView on GitHub↗

    This project is a centralized, community-driven repository of hands-on tutorials designed to facilitate skill acquisition through the practical construction of real-world software applications. It serves as a comprehensive directory that aggregates external documentation and instructional materials, providing a structu

    Train neural networks and process large-scale datasets by applying mathematical frameworks in real-world project settings.

    beginner-projectcppgolang
  • openclaw/openclaw

    openclaw/openclaw

    211,971GitHubView on GitHub↗

    Openclaw is a platform for managing agent execution environments, providing the infrastructure to control agent lifecycles, session state, and workspace persistence. It features a centralized gateway that handles model loops, tool invocation, and streaming events, while supporting multi-agent routing and persistent mem

    Generates structured JSONL logs and console output with configurable redaction and traffic diagnostics.

    TypeScriptaiassistantcrustacean
  • tensorflow/tensorflow

    tensorflow/tensorflow

    193,864GitHubView on GitHub↗

    TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst

    Coordinates decentralized training across network nodes to preserve data privacy while aggregating model updates.

    C++deep-learningdeep-neural-networksdistributed
  • huggingface/transformers

    huggingface/transformers

    156,730GitHubView on GitHub↗

    Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering

    Standardizes the training, fine-tuning, and deployment of models across diverse hardware acceleration backends.

    Pythonaudiodeep-learningdeepseek
  • pytorch/pytorch

    pytorch/pytorch

    97,601GitHubView on GitHub↗

    PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe

    Shards model parameters, gradients, and optimizer states across processes to enable memory-efficient distributed training.

    Pythonautograddeep-learninggpu
  • Shubhamsaboo/awesome-llm-apps

    Shubhamsaboo/awesome-llm-apps

    96,116GitHubView on GitHub↗

    This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent

    End-to-end recipes provide step-by-step instructions for customizing and fine-tuning open-source language models.

    Pythonagentsllmspython
  • immich-app/immich

    immich-app/immich

    92,953GitHubView on GitHub↗

    Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining ful

    Refines facial recognition accuracy by iteratively adjusting detection thresholds and re-processing datasets to include previously unidentified faces.

    TypeScriptbackup-toolfluttergoogle-photos
  • rasbt/LLMs-from-scratch

    rasbt/LLMs-from-scratch

    85,529GitHubView on GitHub↗

    This repository serves as an educational framework for building large language models from the ground up. It provides a structured curriculum that guides learners through the end-to-end lifecycle of model development, including data processing, architecture design, and optimization. By focusing on low-level implementat

    Establishes a structured environment for building and training custom language models to master the development lifecycle.

    Jupyter Notebookaiartificial-intelligencechatbot
  • tensorflow/models

    tensorflow/models

    77,684GitHubView on GitHub↗

    This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines fo

    Standardizes model architecture patterns, training loops, and data pipelines to facilitate maintainable machine learning workflows.

    Python
  • d2l-ai/d2l-zh

    d2l-ai/d2l-zh

    75,708GitHubView on GitHub↗

    This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners

    Organizes end-to-end workflows that manage data sourcing, model training, and performance validation.

    Pythonbookchinesecomputer-vision
  • mlabonne/llm-course

    mlabonne/llm-course

    75,340GitHubView on GitHub↗

    This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we

    Explores strategies for combining multiple specialized model weights into a single unified architecture.

    courselarge-language-modelsllm
  • awesomedata/awesome-public-datasets

    awesomedata/awesome-public-datasets

    72,846GitHubView on GitHub↗

    This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, t

    Supplies a diverse collection of labeled datasets essential for training, validating, and benchmarking predictive models.

    aaron-swartzawesome-public-datasetsdatasets
  • tesseract-ocr/tesseract

    tesseract-ocr/tesseract

    72,460GitHubView on GitHub↗

    Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d

    Construct custom recognition models using provided training scripts and makefiles to optimize performance for specific document types.

    C++hacktoberfestlstmmachine-learning
  • dair-ai/Prompt-Engineering-Guide

    dair-ai/Prompt-Engineering-Guide

    70,526GitHubView on GitHub↗

    This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task

    Outlines best practices for managing model checkpoints and fine-tuning parameters to optimize performance for specialized domains.

    MDXagentagentsai-agents
  • hiyouga/LlamaFactory

    hiyouga/LlamaFactory

    67,386GitHubView on GitHub↗

    LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface. The pro

    Simplifies complex model refinement by offering a unified interface for both full-parameter and efficient training methods.

    Pythonagentaideepseek
  • keras-team/keras

    keras-team/keras

    63,858GitHubView on GitHub↗

    Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a di

    Coordinates automated workflows for training loops, batch processing, and validation dataset management.

    Pythondata-sciencedeep-learningjax
  • CorentinJ/Real-Time-Voice-Cloning

    CorentinJ/Real-Time-Voice-Cloning

    59,355GitHubView on GitHub↗

    This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima

    Automates the end-to-end workflow for sourcing data, training neural models, and validating synthesis performance.

    Pythondeep-learningpythonpytorch
  • ultralytics/yolov5

    ultralytics/yolov5

    56,830GitHubView on GitHub↗

    YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning

    Fine-tunes computer vision models on specialized or proprietary datasets to recognize unique objects.

    Pythoncoremldeep-learningios
Prev12Next

Explore sub-tags

  • Architecture and Operations2 sub-tags
  • Computer Vision and Recognition2 sub-tags
  • Data and Checkpointing5 sub-tags
  • Deep Learning TutorialsEducational resources and tutorials focused on building and training neural networks using mathematical frameworks.
Distributed and Scaling Strategies5 sub-tags
  • Fine-Tuning and Customization5 sub-tags
  • Machine Learning PrototypingEnvironments and utilities for rapid experimentation with model architectures and data pipelines.
  • No-Code Training InterfacesPlatforms that allow model training through configuration or UI without manual coding.
  • Training Evaluation1 sub-tagMethods and tools for assessing the performance and accuracy of machine learning models during the training phase.
  • Training Frameworks6 sub-tagsComprehensive software libraries that provide the infrastructure and APIs necessary to train machine learning models.
  • Training HyperparametersConfiguration settings that control the learning process and model optimization behavior.
  • Training Utilities1 sub-tagSupport software that assists in the training process, including tools for tracking metrics, logging progress, and debugging experiments.