11 个仓库
Utilities for configuring and tuning training parameters such as learning rates and scheduling policies.
Distinguishing note: Focuses on the configuration of training hyperparameters, distinct from the orchestration of the training loop itself.
Explore 11 awesome GitHub repositories matching artificial intelligence & ml · Hyperparameter Optimization Tools. Refine with filters or upvote what's useful.
Recommenders is a recommendation system framework designed for building, benchmarking, and deploying collaborative and content-based filtering models. It provides a machine learning model pipeline that standardizes the process of moving recommendation data from raw ingestion through training and evaluation. The project functions as a model benchmarking toolkit, utilizing standardized ranking and error metrics to compare the accuracy of different algorithms. It also serves as a hyperparameter tuning tool, allowing for the optimization of model behavior and performance via external configuratio
Includes utilities for optimizing model behavior and performance via external configuration of training hyperparameters.
This repository serves as a comprehensive educational resource and study guide for mastering deep learning principles and neural network architectures. It provides a structured curriculum that covers the fundamental components of artificial intelligence, including backpropagation, optimization algorithms, and model performance tuning. The collection distinguishes itself by offering curated academic materials and practical implementation examples that bridge the gap between theoretical concepts and hands-on application. It includes specialized instructional guides for developing models capable
Provides utilities for configuring and tuning training hyperparameters to control model convergence and behavior.
ai-edu is a comprehensive AI education curriculum and machine learning courseware collection. It provides theoretical tutorials, deep learning lab exercises, and project blueprints designed to teach artificial intelligence fundamentals through a combination of study and practical implementation. The project focuses on a learning-by-doing approach, guiding users from Python programming and neural network basics to advanced topics. It includes specialized instructional content on distributed AI training, MLOps educational guides for model quantization and pruning, and detailed frameworks for im
Provides guides for tuning training parameters such as learning rates and regularization to prevent model overfitting.
This project is a structured learning curriculum and technical reference for mastering deep learning with TensorFlow. It provides a comprehensive guide for building, training, and deploying neural networks, combining theoretical fundamentals with practical implementation examples. The repository distinguishes itself by covering the end-to-end machine learning workflow, from low-level tensor mathematics and linear algebra to the creation of complex model architectures. It includes specific guidance on developing data pipelines for diverse data types, such as images, text, and time-series seque
Tunes training parameters like learning rates and schedules to optimize model performance.
This is a scikit-learn automated machine learning framework designed to optimize model selection and hyperparameters. It functions as an automated model selector and hyperparameter optimization tool for classification and regression tasks, utilizing an automated ensemble builder to combine high-performing models for increased predictive accuracy. The system features a distributed search engine that uses Dask for parallel machine learning optimization across CPU cores or clusters. It implements a budget-based evaluation strategy through successive halving to prioritize promising model configur
Provides a framework for tuning model settings using advanced techniques like Bayesian optimization and successive halving.
Surprise is a Python library for building and analyzing recommendation systems. It provides a comprehensive toolkit for implementing collaborative filtering to predict user preferences and generate item suggestions based on historical rating patterns. The library includes dedicated tools for hyperparameter optimization and model evaluation. It allows for searching through parameter sets to find the most effective configurations and utilizes a suite of metrics to measure prediction accuracy. The framework covers the full development workflow, including data loading from various sources, the c
Includes a dedicated utility for searching parameter sets to find the most effective configuration for models.
Skorch 是一个将 PyTorch 神经网络包装在 scikit-learn 兼容接口中的库,允许在标准机器学习流水线和超参数优化工具中使用深度学习模型。它充当数据适配器、训练管理器和优化工具,弥合了深度学习模块与传统机器学习工作流之间的差距。 该项目通过提供用于自动化 PyTorch 训练生命周期的工具包而脱颖而出,包括集成的检查点保存、提前停止和学习率调度。它还通过用于冻结特定模型层和针对特定任务微调预训练权重的实用程序,实现了迁移学习。 能力面扩展到数据转换,包括将表格数据和数值数组转换为张量格式以及注册文本分词器。它还提供了用于硬件加速选择、即时模块编译以及用于不确定性量化的概率数据建模的工具。 该系统包括用于将超参数映射到命令行参数的实用程序,以确保实验的可重复性。
Implements a system for tuning PyTorch model hyperparameters via standard grid search and cross-validation.
FLAML 是一个自动化机器学习框架、超参数优化工具和大型语言模型代理编排器。它提供了一个用于跨各种学习器和数据集进行模型选择和调优的系统,同时也提供了一个用于优化大型语言模型推理参数和微调设置的工具包。 该项目具有元学习调优系统,可分析历史任务数据以生成依赖于数据的默认配置,从而加速模型收敛。它进一步通过对话式工作流和事件驱动编排,支持协作式多代理系统的设计。 能力涵盖了针对机器学习模型和任意 Python 函数的资源高效超参数搜索,支持分层搜索空间和字典序目标优化。该框架还包括用于自动化模型选择、堆叠集成构建、零样本配置以及强制执行公平性约束的实用工具。 该系统支持分布式调优扩展和跨计算集群的并发试验执行,以缩短总搜索时长。
Searches for optimal configurations for machine learning models and arbitrary Python functions under resource constraints.
This project is a collection of scripts and workflows for training, fine-tuning, and deploying large language models using the Hugging Face Transformers toolkit. It functions as a distributed training framework, a library for natural language processing task implementations, and a system for building retrieval-augmented generation chatbots. The repository includes specialized tools for model optimization, such as a Bayesian hyperparameter optimizer for automatically tuning model settings. It provides implementations for scaling model training across multiple graphics processors using data par
Includes a Bayesian optimization tool for automatically tuning language model training parameters.
This project is a collection of pretrained reinforcement learning agents and training scripts built on Stable Baselines3 and Gymnasium. It provides a framework for training agents to solve specific tasks, managing experiment reproducibility, and deploying pretrained models. The system includes a specialized benchmarking suite and optimization tools for tuning agent settings. It utilizes automated search spaces and distributed trials to maximize performance, while employing bootstrap sampling to generate statistically robust performance metrics and confidence intervals. Broad capabilities cov
Tunes agent settings through automated search spaces and distributed trials to maximize performance.
该项目是一个用于在 PyTorch 生态系统中实施知识蒸馏和管理深度学习实验的框架。它提供了一个结构化的环境,用于训练紧凑的学生模型以模仿大型教师模型的输出分布,旨在提高推理效率。 该框架通过将模型架构与损失函数解耦而脱颖而出,允许灵活组合教师和学生组件。它集成了自动超参数网格搜索功能,以系统地识别最佳训练配置,这些配置通过外部文件序列化进行管理,以确保实验运行的一致性。 除了核心的蒸馏和优化功能外,该工具包还包括针对训练会话的全面生命周期管理。这包括用于故障恢复的持久化检查点、实时进度监控,以及将性能指标聚合为结构化摘要以进行对比分析的功能。
Automates systematic searches across parameter ranges to identify the most effective settings for deep learning models.