195 repository-uri
Algorithms and procedures for training predictive models on prepared datasets.
Distinguishing note: Focuses on support vector machine training.
Explore 195 awesome GitHub repositories matching artificial intelligence & ml · Model Training. Refine with filters or upvote what's useful.
This project is a fine-tuning framework and training pipeline designed to optimize and adapt large language and vision models. It provides a specialized toolkit for parameter-efficient tuning and supervised learning, serving as both a trainer for multimodal models and a deployment tool for serving fine-tuned models via high-performance inference engines. The framework focuses on reducing memory and compute requirements by updating a small subset of model parameters. It supports a wide range of adaptation strategies, including vision-language model training to align text, image, video, and aud
Saves trained models in multiple formats to ensure compatibility with specific runtime environments.
This project is an interactive data science environment that combines code execution, rich media visualization, and narrative documentation into a persistent, browser-based platform. It serves as a comprehensive educational resource for scientific computing, providing a framework for iterative data analysis and machine learning prototyping. The environment is distinguished by its focus on high-performance numerical computing, utilizing vectorized array operations and memory-mapped data structures to handle large-scale computations efficiently. It features a unified estimator interface that st
Implements support vector machine training for binary classification.
This project is a comprehensive Chinese translation of a technical deep learning textbook, providing an educational resource on the theory and implementation of neural networks. It functions as a collaborative technical translation project designed to make complex academic AI literature accessible to non-English speakers. The project utilizes a community-driven translation model that integrates external suggestions and pull requests to refine linguistic accuracy and reduce bias. It employs standardized terminology mapping to ensure a uniform vocabulary throughout the translated content. To i
Describes the procedures for adversarial training to increase model robustness against perturbed inputs.
Detectron2 is a PyTorch computer vision framework and visual recognition platform designed for training and deploying models for object detection, image segmentation, and visual recognition. It provides a research-oriented environment for training complex vision models with multi-GPU acceleration. The project includes a specialized object detection library for identifying and locating multiple objects via bounding boxes, as well as an image segmentation toolkit for creating pixel-level masks through instance, semantic, and panoptic segmentation. Additionally, it features a human pose estimati
Provides utilities to export trained weights into portable formats for production and mobile deployment.
Lightning is a PyTorch training framework and distributed AI training orchestrator designed to decouple core research logic from the engineering boilerplate required for model training. It functions as a deep learning workflow manager that automates the process of pretraining and finetuning models across diverse compute environments. The project distinguishes itself by providing a hardware-agnostic training wrapper, allowing the same model code to execute on CPUs, GPUs, or TPUs without modification. It further manages the scaling of workloads from single devices to multi-node clusters and ser
Converts trained models into standardized formats to enable deployment in production environments.
This project is an educational platform and research toolkit designed to teach deep learning through a combination of mathematical theory, visual diagrams, and executable code. It provides a comprehensive environment for building, training, and evaluating neural networks, grounding complex concepts in interactive computational notebooks that allow for hands-on experimentation. The framework distinguishes itself by interleaving theoretical foundations—including linear algebra, calculus, and probability—with practical implementations across multiple industry-standard libraries. It supports flex
Executes deep learning model training using minibatch stochastic gradient descent and cross-entropy loss.
XGBoost is a distributed machine learning library for implementing scalable gradient boosting decision trees used for regression, classification, and ranking. It functions as a predictive model framework and a cross-language toolkit, providing a core implementation with native bindings for Python, R, Java, Scala, and C++. The system is designed as a GPU-accelerated library that utilizes CUDA and NCCL to speed up the training of decision tree ensembles. It operates as a distributed framework capable of scaling training and prediction across multi-node clusters and GPU environments to process m
Allows extraction of specific boosting rounds from a trained ensemble to isolate the best performing iteration.
Audiocraft is a deep learning audio library and machine learning framework designed for training, fine-tuning, and evaluating generative models for music and sound effects. It functions as a text-to-music generative model and a neural audio codec, providing the tools necessary to compress audio signals into discrete representations and synthesize high-fidelity waveforms from textual descriptions. The framework is distinguished by its ability to combine multiple conditioning signals, allowing for the generation of audio based on text prompts, melodic excerpts, or style-based audio clips. It al
Exports trained model weights into binary formats compatible with generation APIs.
This project is a comprehensive educational resource and technical documentation suite for learning and developing deep learning models. It serves as an open-source textbook, implementation manual, and framework tutorial designed to guide users through the mathematical foundations and practical application of neural networks. The resource provides detailed instructional content on building various model architectures, including convolutional and recurrent neural networks. It includes a dedicated distributed training guide and a learning path that covers the fundamentals of tensors, automatic
Guides the execution of training loops to optimize network weights for data categorization and classification.
This project is a collection of interactive instructional documents and practical code samples designed as a machine learning educational resource. It consists of Jupyter notebooks that provide runnable examples and guided exercises for learning deep learning and model development. The repository features Keras model implementations that demonstrate how to build and train neural network architectures for processing images, objects, and natural language. It includes capabilities for executing the same model code across different computation engines to compare framework behavior and performance
Provides mechanisms to download training sets and weights to feed into model training processes.
This project is a transformer-based framework for generating dense and sparse vector embeddings of text and multimodal data. It serves as a library for fine-tuning models to perform semantic similarity tasks, retrieval, and reranking. The system is distinguished by its support for diverse architectural patterns, including bi-encoders for fast similarity search and cross-encoders for high-precision reranking. It provides dedicated pipelines for multimodal embeddings, mapping text and images into a shared vector space, and implements knowledge distillation to compress large models into smaller,
Enables the generation of fixed-dimension dense vectors from text for similarity search and clustering.
LightGBM is a gradient boosting framework used to train decision tree ensembles for classification, regression, and ranking tasks. It functions as a distributed machine learning library and a decision tree ensemble implementation that utilizes leaf-wise growth and histogram-based feature binning. The framework is distinguished by its ability to offload heavy computations to CUDA or OpenCL devices for GPU acceleration and its capacity to parallelize training across multiple nodes using sockets, MPI, or Dask. It includes a specialized categorical feature processor that optimizes partitions for
Exports trained models into C++ code or standalone files for production deployment without a runtime.
This repository is an educational collection of deep learning implementations designed to demonstrate the fundamental principles of neural network architecture and optimization. It provides a comprehensive resource for understanding machine learning through hands-on code examples, ranging from basic multilayer perceptrons to complex generative models. The project distinguishes itself by emphasizing the manual construction of models, including the implementation of backpropagation from scratch to illustrate core mathematical mechanics. It covers a wide array of architectural design patterns, s
Implements adversarial training loops where generator and discriminator networks compete to improve synthetic data generation.
AISystem is a comprehensive AI full-stack infrastructure project covering the entire pipeline from AI chip architecture to high-level training frameworks. It encompasses the development of AI compiler frameworks, inference engines, and distributed training orchestrators designed to coordinate workloads across a heterogeneous compute stack of CPUs, GPUs, and NPUs. The project focuses on the deep integration of software and hardware, employing software-hardware co-design to align tensor layouts with physical memory structures. It provides specialized capabilities for accelerating Transformer mo
Optimizes the balance between compute power, memory bandwidth, and precision to accelerate large-scale model training.
Openface is a deep learning toolkit designed for facial recognition and identity verification. It provides a comprehensive pipeline for detecting faces, aligning landmarks, and transforming facial images into compact numerical vectors. By utilizing these embeddings, the system enables identity classification and similarity comparison through geometric distance calculations. The project distinguishes itself by integrating research-oriented diagnostic tools alongside its core recognition capabilities. It includes utilities for visualizing high-dimensional feature clusters, inspecting internal c
Trains support vector machines on facial representations to categorize individuals with automated parameter optimization.
dalle-mini is a text-to-image model and generative AI system designed to transform natural language descriptions into synthetic images. It functions as an image generation training toolkit and a generative model capable of creating visual representations from text prompts. The project provides a containerized deployment for consistent execution across different computing environments. It includes the necessary scripts and configuration files to train custom generative models from datasets. The system utilizes an autoregressive transformer architecture that treats visual data as discrete toke
Employs a vector-quantized variational autoencoder to compress images into a learned discrete vocabulary.
StyleGAN is a TensorFlow-based generative adversarial network framework designed for the synthesis of high-resolution synthetic imagery. It utilizes a style-based generator architecture to create realistic visual assets from latent vectors, focusing on the production of high-fidelity images. The system incorporates style mixing and stochastic noise injection to control visual attributes and fine-grained details. It uses adaptive instance normalization and progressive resolution upsampling to manage image quality and variety across different resolutions. The framework covers the full lifecycl
Implements adversarial training procedures to optimize the generator and discriminator networks.
Wan2.2 is a generative video artificial intelligence system designed to synthesize visual media by interpreting natural language instructions. It functions as a text-to-video diffusion model that transforms written concepts into coherent motion sequences through deep learning and latent space manipulation. The system utilizes a transformer-based architecture to process video data as a series of tokens, allowing it to capture complex spatial and temporal relationships. By employing a temporal attention mechanism, the model maintains visual consistency across frames, while its latent space appr
Applies variational autoencoder compression to map high-resolution visual data into compact latent spaces for efficient processing.
PaddleDetection is an object detection framework designed for the end-to-end development, training, and deployment of computer vision models. It provides a comprehensive library of modular neural network architectures and pipelines that support object detection, instance segmentation, and multi-object tracking tasks. The project distinguishes itself through a configuration-driven approach that decouples model components like backbones and heads, allowing for the flexible assembly of custom vision workflows. It incorporates advanced techniques such as anchor-free detection logic, joint detecti
Exports trained detection and tracking models into portable formats for production environments.
Latent Diffusion is a framework for high-resolution image synthesis that performs the denoising process within a compressed latent space. It uses variational autoencoders to encode images into a lower-dimensional representation, reducing the computational cost of noise prediction compared to operating on raw pixels. The project enables text-to-image generation by integrating natural language descriptions through cross-attention conditioning. It also supports image inpainting and restoration, filling masked or missing image areas with generated content, and example-based synthesis using retrie
Uses variational autoencoders to compress high-resolution images into a lower-dimensional latent space.