1 repo
High-performance computational engines built to accelerate the training process of transformer-based neural network architectures.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Transformer Training Engines. Refine with filters or upvote what's useful.
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi