←Backkarpathy/minbpe0Copy as MarkdownView on GitHub↗10,582 stars·1,071 forks·Python·MIT·0 viewsMinbpeMinimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. FeaturesToolkits - Provides a complete toolkit for learning BPE merge rules and building a vocabulary from text corpora.