1 repo
Specialized tools for training transformer-based models across single or multi-GPU environments.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Large Language Model Training Frameworks. Refine with filters or upvote what's useful.
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi