makemore is a character-level language model and text generation engine. It serves as an educational implementation of the architecture and training processes used in large language models, built as a neural network using the PyTorch framework.
The system demonstrates sequence prediction by learning the probability distributions of characters within a dataset to generate novel text strings. It implements this through a progression of techniques, including n-gram probability modeling and the use of automatic differentiation for weight optimization.
The project covers the full machine learning lifecycle for sequence prediction, from processing text corpora during model training to producing output via probabilistic sampling. This includes the application of softmax normalization and multinomial sampling to convert model outputs into predictable character sequences.