1 repo
Frameworks and utilities for implementing reward-based training workflows to optimize model reasoning and performance.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Reinforcement Learning Toolkits. Refine with filters or upvote what's useful.
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade