1 repo
Integrated tools for distilling and fine-tuning models for reasoning tasks.
Distinguishing note: Focuses on the complete training lifecycle for reasoning-specialized models.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Reasoning Model Training Suites. Refine with filters or upvote what's useful.
Open-r1 is a framework designed for the large-scale training, distillation, and optimization of language models focused on complex reasoning and programming tasks. It provides a comprehensive suite of tools for managing distributed training jobs across multi-node clusters, enabling the development of high-performance models through reinforcement learning and supervised fine-tuning. The project distinguishes itself by integrating secure, containerized code execution environments directly into the training and evaluation lifecycle. By allowing models to run and verify code snippets against test
Provides a collection of tools and workflows for distilling, fine-tuning, and optimizing large language models for complex reasoning and coding tasks.