1 repo
Secure infrastructure for validating model-generated code.
Distinguishing note: Focuses on containerized isolation for AI training and evaluation.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Sandboxed Code Execution Environments. Refine with filters or upvote what's useful.
Open-r1 is a framework designed for the large-scale training, distillation, and optimization of language models focused on complex reasoning and programming tasks. It provides a comprehensive suite of tools for managing distributed training jobs across multi-node clusters, enabling the development of high-performance models through reinforcement learning and supervised fine-tuning. The project distinguishes itself by integrating secure, containerized code execution environments directly into the training and evaluation lifecycle. By allowing models to run and verify code snippets against test
Provides a secure infrastructure for running and validating model-generated code snippets within isolated containers during training and evaluation workflows.