1 repo

Awesome GitHub RepositoriesModel Distillation Pipelines

Frameworks for transferring knowledge from large reasoning models to smaller, more efficient ones.

Distinguishing note: Focuses on supervised fine-tuning distillation for reasoning tasks.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Model Distillation Pipelines. Refine with filters or upvote what's useful.

Find the best repos with AI.We'll search the best matching repositories with AI.

huggingface/open-r1
huggingface/open-r1
25,887View on GitHub
Open-r1 is a framework designed for the large-scale training, distillation, and optimization of language models focused on complex reasoning and programming tasks. It provides a comprehensive suite of tools for managing distributed training jobs across multi-node clusters, enabling the development of high-performance models through reinforcement learning and supervised fine-tuning. The project distinguishes itself by integrating secure, containerized code execution environments directly into the training and evaluation lifecycle. By allowing models to run and verify code snippets against test
Trains smaller models on outputs from larger reasoning models to improve task-specific performance.
Python
25,887View on GitHub