Open Instruct

Open-Instruct is a distributed training and instruction tuning framework for large language models. It functions as a coordinator for supervised fine-tuning, reinforcement learning from human feedback pipelines, and tool-use training, providing specialized roles for dataset curation and model alignment.

The project distinguishes itself through a high-performance training architecture that utilizes actor-based distributed coordination and hybrid sharding to manage large GPU clusters. It implements advanced alignment techniques including direct preference optimization, group relative policy optimization, and a dynamic rubric system that evolves evaluation criteria via judge models.

The framework covers a broad capability surface including instruction dataset engineering with contamination detection, the generation of preference-pair datasets, and the integration of external environments for tool-use learning. It also includes GPU-efficient training kernels, tensor parallelism for layer splitting, and performance benchmarking tools.

Features

Instruction Tuning Frameworks - Provides a comprehensive framework for training large language models to follow specific user commands through supervised fine-tuning.

Large-Scale Model Training - Manages the training of large-scale models across GPU clusters using sharding and replication.

Reinforcement Learning Policy Improvement - Refines model policies through iterative rollouts and reward signals from judges or environments.

Reward Modeling - Trains reward models to evaluate and score text generation quality for preference alignment using optimized GPU operations.

Preference Alignment - Aligns model outputs with human preferences using reward models and direct preference optimization.

Text Dataset Curators - Implements pipelines to filter, format, and unify instruction datasets for large language model training.

Trainer Coordination - Provides a coordinator for managing environment initialization and worker scaling across large GPU clusters during distributed training.

Distributed Training Sharding - Balances memory efficiency and communication speed by partitioning model parameters and optimizer states across compute nodes.

Tool-Use Training - The project trains models to interact with external tools by appending environment outputs back into the conversation.

Instruction Tuning Datasets - Downloads and unifies diverse instruction datasets into a consistent chat format for model alignment.

Model Sharding and Replication - Combines full model sharding with replication across GPU groups for efficient large-scale training.

Reinforcement Learning Training Pipelines - Provides orchestration tools to manage the lifecycle of reinforcement learning training through rollouts and policy optimization.

RLHF Training Pipelines - Ships end-to-end pipelines that combine reward model training and policy optimization for human-feedback alignment.

Supervised Fine-Tuning - Performs supervised instruction tuning using GPU-efficient implementations and parameter-efficient methods like LoRA.

Tensor Parallelism - Distributes individual model layers across multiple GPUs using tensor parallelism to enable training of exceptionally large models.

Tool-Using Agents - Trains models to interact with external tools and environments by parsing calls and integrating feedback.

Training Dataset Preparation - Mixes, tokenizes, and filters multiple text datasets into a unified format for training.

Distributed Actor Frameworks - Utilizes a distributed actor framework to coordinate training and communication across large GPU clusters.

Agentic Tool-Use Frameworks - Provides a framework for teaching models to interact with external tools and environments by parsing calls and integrating feedback.

Direct Preference Optimization - Aligns model outputs with human preferences using a paired dataset without requiring a separate reward model.

Preference Pair Generators - Evaluates multiple completions using a judge model to create preferred and rejected response pairs.

Fused GPU Kernel Composition - Implements fused GPU kernels to maximize throughput during supervised fine-tuning and reward model training.

Native Tool Call Parsers - Implements native parsing of model outputs to extract structured tool calls and argument fragments.

Model Performance Benchmarking - Implements standardized tests and benchmarks to evaluate the quality and core capabilities of trained models.

Preference-Based Fine-Tuning - Refines model behavior using preference-based tuning with support for both full and quantized parameter updates.

Preference Alignment Datasets - Generates preference-pair datasets by using a judge model to rank multiple model completions.

Verifiable Reward Training - Refines model accuracy and reasoning using reinforcement learning with verifiable reward signals.

Group Relative Policy Optimization - Implements iterative reinforcement learning loops that update weights by comparing rewards across groups of trajectories.

Tool-Calling Schemas - Employs structured tool-calling schemas to extract tool calls from model text.

Tool-Use Environments - Supports the integration of stateless tools and stateful environments to provide interaction feedback during training.

Rubric-Based Evaluators - Uses rubric-based evaluators and judge models to iteratively refine training rewards.

Dataset Contamination Detection - Measures overlap between instruction tuning datasets and evaluation benchmarks to prevent biased assessment.

Single Agent Optimization - Post-training framework for open language models.

allenaiopen-instruct

Features

Star history