Open-source frameworks that automate hyperparameter optimization and model selection for machine learning workflows.
This is a scikit-learn automated machine learning framework designed to optimize model selection and hyperparameters. It functions as an automated model selector and hyperparameter optimization tool for classification and regression tasks, utilizing an automated ensemble builder to combine high-performing models for increased predictive accuracy. The system features a distributed search engine that uses Dask for parallel machine learning optimization across CPU cores or clusters. It implements a budget-based evaluation strategy through successive halving to prioritize promising model configur
This framework provides a comprehensive suite for automated model selection and hyperparameter optimization, integrating seamlessly with scikit-learn to handle the entire machine learning pipeline from preprocessing to distributed training.
AutoGluon is an automated machine learning framework designed to optimize model selection and hyperparameter tuning across tabular, text, image, and time series data. It functions as an ensemble learning library and a tabular data prediction engine, aiming to build high-accuracy predictive models without manual algorithm selection. The framework integrates multimodal machine learning pipelines that combine disparate data types into a single representation using specialized encoders. It also includes a probabilistic time series forecaster that fits multiple statistical and deep learning models
AutoGluon is a comprehensive AutoML framework that automates model selection, hyperparameter optimization, and pipeline orchestration across diverse data types, directly fulfilling all the requirements for an automated machine learning solution.
AutoGluon is an automated machine learning framework and multimodal library designed to automate the end-to-end pipeline from data preprocessing to high-accuracy model training and validation. It functions as an automated model trainer for tabular, image, text, and time series data, as well as a tool for time series forecasting and foundation model finetuning. The project is distinguished by its ability to jointly process and fuse different data types, allowing for the construction of multimodal neural networks that integrate images, text, and structured tables. It supports zero-shot inferenc
AutoGluon is a comprehensive AutoML framework that automates the entire pipeline, including model selection, hyperparameter optimization, and ensemble construction across tabular, image, and text data.
Nevergrad is a gradient-free optimization library and hyperparameter optimization framework designed to find the minimum of objective functions without using derivatives. It serves as an asynchronous optimization engine that decouples parameter suggestions from result reporting to support parallel function evaluations. The project specializes in multi-objective optimization to identify Pareto fronts for competing goals and provides a suite for benchmarking the performance and convergence of different optimization routines. It supports black-box system optimization, enabling the tuning of exte
Nevergrad is a powerful gradient-free optimization framework that provides the core hyperparameter optimization and distributed search capabilities required for AutoML, though it functions as a general-purpose optimization engine rather than a specialized end-to-end machine learning pipeline builder.
This project is a recommendation system framework designed for building, evaluating, and operationalizing personalized item suggestion engines. It provides a comprehensive toolkit for implementing collaborative filtering and content-based algorithms, supported by an end-to-end machine learning pipeline for preparing datasets and deploying predictive models. The framework distinguishes itself through the integration of knowledge graphs to provide richer context for recommendations and the use of industry-specific patterns to accelerate system deployment. It also includes a specialized model ev
This framework provides a comprehensive suite for building and optimizing recommendation systems, including hyperparameter tuning and distributed training, though it is specialized for recommendation tasks rather than general-purpose AutoML.
Optuna is a Python-based hyperparameter optimization framework designed to automate the search for optimal machine learning model configurations. It functions as a Bayesian optimization library that systematically tests parameter combinations to maximize or minimize objective functions, streamlining the model development process through iterative evaluation. The project distinguishes itself through a define-by-run dynamic construction model, which allows users to build complex, conditional search spaces using standard programming logic. Its architecture is highly modular, featuring a pluggabl
Optuna is a powerful hyperparameter optimization framework that automates the search for model configurations and supports distributed training, though it functions as a specialized library for optimization rather than an end-to-end AutoML pipeline that handles full model selection and data preprocessing.
This is a Bayesian optimization library for Python designed to find the maximum value of expensive black box functions. It operates as a global optimizer that uses probabilistic models to identify the peak value of unknown functions through iterative sampling. The tool is specifically designed for hyperparameter tuning in machine learning, where it maximizes model performance while minimizing the number of required training runs. It treats the target function as a black box, selecting optimal input parameters based on statistical priors to reduce manual trial and error. The system utilizes G
This is a specialized library for Bayesian optimization that provides the hyperparameter tuning component, but it lacks the broader model selection and end-to-end pipeline automation features required for a full AutoML framework.
This is a PyTorch library and framework for self-supervised vision learning. It provides an implementation of masked autoencoders and vision transformers designed to learn image representations by reconstructing masked image patches from unlabeled data. The project features a distributed training pipeline that scales workloads across multiple GPU nodes. This infrastructure includes multi-node orchestration and gradient accumulation to manage large batch sizes and coordinate resource requests across clusters. The toolkit covers a complete workflow from self-supervised masked pre-training to d
This is a specialized library for self-supervised vision pre-training and masked autoencoders rather than an AutoML framework designed for automated model selection and hyperparameter optimization.
DeepSpeedExamples is a collection of reference implementations and scripts for training, fine-tuning, and executing inference on large-scale AI models using DeepSpeed optimization. It provides a distributed model training guide and practical workflows for adapting large language models through memory-efficient techniques. The repository includes specialized implementations for pipeline parallelism to handle models exceeding single GPU memory and a suite of examples for ZeRO memory optimization to reduce per-device overhead. It also features standardized test suites for benchmarking the throug
This repository provides reference implementations and optimization techniques for distributed training and inference, but it is a collection of examples for a specific optimization library rather than an automated machine learning framework that handles model selection and hyperparameter tuning.
This project is an automated machine learning framework and toolkit designed for training and tuning custom models for classification, regression, and recommendations. It functions as a multimodal machine learning toolkit capable of processing and training models using a combination of text, image, audio, and sensor data. The framework distinguishes itself as a multimodal data processor that can handle and visualize large datasets on a single machine using column-oriented disk storage. It includes a core machine learning model generator that converts trained models into formats compatible wit
This framework provides automated tools for model selection and training across various data types, though it focuses more on simplifying model creation for native applications than on general-purpose hyperparameter optimization pipelines.
PyTorch Lightning is a high-level deep learning framework for PyTorch that automates training loops and removes repetitive engineering boilerplate. It functions as a structured pipeline for managing machine learning experiments, providing a distributed training orchestrator and tools for mixed-precision training. The framework decouples scientific model architecture from the engineering required for infrastructure and scaling. This separation allows the same model code to execute across CPUs, GPUs, or TPUs through a hardware-agnostic execution engine and a centralized trainer that manages the
This is a deep learning framework designed to simplify training loops and infrastructure management, but it does not perform automated model selection or hyperparameter optimization as required by the AutoML category.
This project is a comprehensive framework for the training, fine-tuning, and deployment of large language models. It functions as a distributed deep learning platform that enables users to scale model workflows across multiple hardware nodes while providing tools for model evaluation and performance benchmarking. The platform distinguishes itself by offering specialized utilities for model compression and weight transformation, allowing users to reduce memory footprints and latency through quantization and pruning. It supports the adaptation of large models for consumer-grade hardware, facili
This is a distributed deep learning and LLM-specific training platform, but it lacks the automated model selection and hyperparameter optimization capabilities required for an AutoML framework.
This project is a collection of scripts and workflows for training, fine-tuning, and deploying large language models using the Hugging Face Transformers toolkit. It functions as a distributed training framework, a library for natural language processing task implementations, and a system for building retrieval-augmented generation chatbots. The repository includes specialized tools for model optimization, such as a Bayesian hyperparameter optimizer for automatically tuning model settings. It provides implementations for scaling model training across multiple graphics processors using data par
This repository is a collection of scripts and training workflows for fine-tuning large language models rather than a general-purpose AutoML framework designed for automated model selection and pipeline optimization.
LightGBM is a gradient boosting framework used to train decision tree ensembles for classification, regression, and ranking tasks. It functions as a distributed machine learning library and a decision tree ensemble implementation that utilizes leaf-wise growth and histogram-based feature binning. The framework is distinguished by its ability to offload heavy computations to CUDA or OpenCL devices for GPU acceleration and its capacity to parallelize training across multiple nodes using sockets, MPI, or Dask. It includes a specialized categorical feature processor that optimizes partitions for
LightGBM is a high-performance gradient boosting library used to train individual models, but it does not provide the automated model selection or hyperparameter optimization pipelines required for an AutoML framework.
MONAI is a PyTorch-based deep learning framework and library specifically designed for healthcare imaging. It provides a suite of domain-specific neural network architectures, specialized loss functions, and preprocessing pipelines tailored for analyzing multi-dimensional medical data. The project distinguishes itself through a decentralized federated learning system that allows models to learn from datasets across multiple institutions without exchanging raw patient images. It also features AI-assisted medical image annotation tools and a standardized model bundling system to ensure consiste
This is a specialized deep learning framework for medical imaging rather than a general-purpose AutoML tool, as it provides the building blocks for model development rather than automating model selection and hyperparameter optimization.
Swin-Transformer is a deep learning framework designed for training and deploying hierarchical vision transformer models. It serves as a research library and toolkit for computer vision tasks, providing the infrastructure to build models that replace standard convolution operations with sliding window self-attention mechanisms. By utilizing a multi-scale feature hierarchy, the framework enables the processing of visual data at varying resolutions and spatial scales. The project distinguishes itself through its implementation of shifted window partitioning, which facilitates global information
This is a specialized deep learning library for vision transformer architectures rather than an AutoML framework designed to automate model selection and hyperparameter optimization across different algorithms.
Corenet is a deep learning training framework and computer vision model library designed for developing neural networks across vision, text, and audio modalities. It functions as a distributed training orchestrator for scaling workloads across multiple compute nodes and provides a multimodal data pipeline for processing image, text, and video data. The project includes a model conversion toolkit for transforming weights and architectures between different machine learning frameworks. It also provides tools for optimizing model performance on Apple Silicon and reducing response latency in gene
This is a deep learning training and orchestration framework for developing and scaling neural networks, but it lacks the automated model selection and hyperparameter search capabilities required for an AutoML framework.