← All repositories

rasbtLLMs-from-scratch

85,529 stars12,944 forksJupyter Notebookother1 view
amzn.to/4fqvn0D

LLMs From Scratch

Features

  • Language Model DevelopmentBuilding and training custom language models from scratch to understand the end-to-end lifecycle of data processing, architecture design, and optimization.
  • Backpropagation ImplementationsConstructs gradient-based optimization logic from first principles to expose the underlying mechanics of weight updates and loss minimization.
  • Deep Learning ImplementationsTranslating complex theoretical concepts into functional neural network code to gain a practical understanding of modern machine learning architectures.
  • Model Training FrameworksBuilding and training custom language models from scratch to understand the end-to-end lifecycle of data processing, architecture design, and optimization.
  • Educational Neural Network ImplementationsA pedagogical framework for building neural network components from first principles without relying on high-level abstraction libraries or frameworks.
  • Functional Model ArchitecturesDefines neural network components as pure mathematical transformations that process input tensors into output predictions without hidden side effects.
  • LLM Architecture TutorialsLearning the fundamental mechanics of large language models by building them from the ground up using accessible, educational code examples.
  • Machine Learning CurriculaA structured curriculum providing hands-on experience with the architecture, training, and implementation of large language models from the ground up.
  • Interactive NotebooksOrganizes complex technical concepts into sequential, executable code blocks that allow users to verify theoretical understanding through immediate practical implementation.
  • Technical Learning RepositoriesA collection of instructional materials and code examples designed to guide developers through the implementation of complex technical concepts.
  • Interactive Learning EnvironmentsUsing executable documents to experiment with algorithms and model components in a live development environment for deeper conceptual mastery.
  • Technical TutorialsExplore curated repositories containing instructional materials and documentation to learn specific technical concepts through guided examples and structured learning paths.
  • Video Courses[A 17-hour and 15-minute companion video course](https://www.manning.com/livevideo/master-and-build-large-language-models) where I code through each chapter of the book. The course is organized into chapters and sections
  • Low-Level Tensor LibrariesUtilizes low-level array manipulation libraries to implement neural network layers and mathematical operations without relying on high-level abstraction frameworks.
  • Modular ArchitecturesIsolates distinct stages of model development into independent directories to ensure clear separation of concerns and simplified dependency management.