2 repos
High-performance systems optimized for the execution, scaling, and fine-tuning of neural networks and transformer architectures, distinct from orchestration.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Model Training Engines. Refine with filters or upvote what's useful.
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade