1 repo

Awesome GitHub RepositoriesGRPO Training

Methods for training models using Group Relative Policy Optimization to improve reasoning capabilities.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · GRPO Training. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

unslothai/unsloth
unslothai/unsloth
52,461GitHubView on GitHub
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Pythonagentdeepseekdeepseek-r1

1 repo

Methods for training models using Group Relative Policy Optimization to improve reasoning capabilities.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · GRPO Training. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

unslothai/unsloth
unslothai/unsloth
52,461GitHubView on GitHub
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Pythonagentdeepseekdeepseek-r1

Awesome GRPO Training GitHub Repositories