25 repos
Explore 25 awesome GitHub repositories matching artificial intelligence & ml · Training & Tuning. Refine with filters or upvote what's useful.
This project is a community-curated knowledge base that organizes vast technical ecosystems into a hierarchical, human-readable directory. It serves as a comprehensive index of libraries, frameworks, and methodologies, designed to facilitate discovery and professional development across the entire spectrum of software
Exposes a curated directory of community-vetted resources for building and deploying machine learning models.
This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th
Highlights high-performance frameworks designed for building, training, and tuning complex neural network architectures.
This project is a centralized, community-driven repository of hands-on tutorials designed to facilitate skill acquisition through the practical construction of real-world software applications. It serves as a comprehensive directory that aggregates external documentation and instructional materials, providing a structu
Train neural networks and process large-scale datasets by applying mathematical frameworks in real-world project settings.
Openclaw is a platform for managing agent execution environments, providing the infrastructure to control agent lifecycles, session state, and workspace persistence. It features a centralized gateway that handles model loops, tool invocation, and streaming events, while supporting multi-agent routing and persistent mem
Generates structured JSONL logs and console output with configurable redaction and traffic diagnostics.
TensorFlow is a comprehensive machine learning framework designed for the construction, training, and deployment of complex mathematical models. It utilizes a graph-based execution model that represents operations as directed acyclic graphs, enabling automatic differentiation and efficient parallel processing. The syst
Coordinates decentralized training across network nodes to preserve data privacy while aggregating model updates.
Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering
Standardizes the training, fine-tuning, and deployment of models across diverse hardware acceleration backends.
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe
Shards model parameters, gradients, and optimizer states across processes to enable memory-efficient distributed training.
This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent
End-to-end recipes provide step-by-step instructions for customizing and fine-tuning open-source language models.
Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining ful
Refines facial recognition accuracy by iteratively adjusting detection thresholds and re-processing datasets to include previously unidentified faces.
This repository serves as an educational framework for building large language models from the ground up. It provides a structured curriculum that guides learners through the end-to-end lifecycle of model development, including data processing, architecture design, and optimization. By focusing on low-level implementat
Establishes a structured environment for building and training custom language models to master the development lifecycle.
This repository serves as a centralized collection of state-of-the-art deep learning architectures and reference implementations designed for research and application development. It provides a comprehensive toolkit for computer vision and natural language processing, offering pre-built models and training pipelines fo
Standardizes model architecture patterns, training loops, and data pipelines to facilitate maintainable machine learning workflows.
This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners
Organizes end-to-end workflows that manage data sourcing, model training, and performance validation.
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we
Explores strategies for combining multiple specialized model weights into a single unified architecture.
This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, t
Supplies a diverse collection of labeled datasets essential for training, validating, and benchmarking predictive models.
Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into d
Construct custom recognition models using provided training scripts and makefiles to optimize performance for specific document types.
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
Outlines best practices for managing model checkpoints and fine-tuning parameters to optimize performance for specialized domains.
LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface. The pro
Simplifies complex model refinement by offering a unified interface for both full-parameter and efficient training methods.
Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a di
Coordinates automated workflows for training loops, batch processing, and validation dataset management.
This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima
Automates the end-to-end workflow for sourcing data, training neural models, and validating synthesis performance.
YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning
Fine-tunes computer vision models on specialized or proprietary datasets to recognize unique objects.