# mlabonne/llm-course

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/mlabonne-llm-course).**

80,178 stars · 9,340 forks · Apache-2.0

## Links

- GitHub: https://github.com/mlabonne/llm-course
- Homepage: https://mlabonne.github.io/blog/
- awesome-repositories: https://awesome-repositories.com/repository/mlabonne-llm-course.md

## Topics

`course` `large-language-models` `llm` `machine-learning` `roadmap`

## Description

This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as well as the practical implementation of supervised instruction fine-tuning and preference-based model alignment.

The repository distinguishes itself by providing a deep dive into advanced model composition and optimization techniques. It details methodologies for weight-space model merging and mixture-of-experts strategies, alongside practical guidance on low-precision parameter quantization and inference optimization to manage hardware requirements. Furthermore, it explores the development of autonomous agentic systems capable of tool-use orchestration and the construction of retrieval-augmented generation pipelines to ground model outputs in external data.

The content spans the entire technical stack, from foundational deep learning concepts and neural network design to the complexities of deploying, evaluating, and securing models in production environments. It includes a curated collection of technical articles, blog posts, and interactive notebooks that track state-of-the-art research trends and experimental methodologies in generative artificial intelligence.

## Tags

### Artificial Intelligence & ML

- [Large Language Models](https://awesome-repositories.com/f/artificial-intelligence-ml/language-model-orchestration/large-language-models.md) — Serves as a comprehensive educational resource regarding the lifecycle and application of large language models. ([source](https://mlabonne.github.io/blog/))
- [Retrieval Augmented Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/language-model-orchestration/retrieval-augmented-generation.md) — Explains how to ground model responses in external data sources to improve factual accuracy.
- [Transformer](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/architectures/transformer.md) — Details the mechanics of stacked attention layers used to process sequences and capture long-range dependencies.
- [Fine-Tuning Strategies](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training/fine-tuning-and-alignment/fine-tuning-strategies.md) — Covers end-to-end processes for adapting pre-trained models through supervised learning and preference alignment.
- [Preference-Based Model Alignments](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training/fine-tuning-and-alignment/preference-based-model-alignments.md) — Outlines techniques for refining model behavior using human feedback and reward signals to improve safety.
- [Supervised Instruction Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training/fine-tuning-and-alignment/supervised-instruction-fine-tuning.md) — Demonstrates how to adapt base models to specific task formats using curated instruction datasets.
- [Supervised Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/supervised-fine-tuning.md) — Describes methods for refining pre-trained models on curated datasets to improve task-specific performance. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [RAG Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/language-model-orchestration/retrieval-augmented-generation/rag-pipelines.md) — Augments model outputs by dynamically retrieving and integrating relevant external documents from vector databases.
- [Inference Optimization Techniques](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/inference-optimization-and-tuning/inference-optimization-techniques.md) — Implements efficient attention mechanisms and optimization strategies to maximize inference throughput.
- [Machine Learning Training](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training.md) — Guides users through post-training pipelines to optimize model behavior and reduce toxicity. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Model Merging Strategies](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/architecture-and-operations/model-architecture/model-merging-strategies.md) — Explores strategies for combining multiple specialized model weights into a single unified architecture.
- [Neural Networks](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/machine-learning-concepts/network-architectures-and-layers/neural-networks.md) — Clarifies the design and mechanics of neural networks as the foundational architecture for deep learning. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Post-Training Datasets](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/machine-learning-datasets/post-training-datasets.md) — Organizes specialized datasets tailored for supervised fine-tuning and alignment processes to refine model behavior. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Quantization Methods](https://awesome-repositories.com/f/artificial-intelligence-ml/model-optimization/quantization/quantization-methods.md) — Details methods for reducing memory footprints by mapping high-precision weights to lower-bit integer representations. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Multi-Agent Orchestration Systems](https://awesome-repositories.com/f/artificial-intelligence-ml/multi-agent-orchestration-systems.md) — Facilitates autonomous task execution by teaching models to reason about environments and invoke external APIs.
- [Weight-Space Merging Techniques](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/architectures/instruction-tuned-language-models/weight-space-merging-techniques.md) — Demonstrates techniques for integrating multiple fine-tuned model checkpoints into a singular, unified architecture without additional training cycles.
- [Agentic Systems Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks.md) — Architects frameworks for building and managing autonomous systems capable of independent decision-making and tool interaction.
- [Agentic Reasoning Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/agent-orchestration-multi-agent/autonomous-agents/agentic-reasoning-frameworks.md) — Illustrates how models can autonomously reason and utilize external tools to execute complex, multi-step tasks within dynamic environments. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Large Language Model Training Resources](https://awesome-repositories.com/f/artificial-intelligence-ml/artificial-intelligence-research/large-language-model-training-resources.md) — Presents technical documentation and practical strategies for the fine-tuning, inference, and efficient training of large-scale models. ([source](https://mlabonne.github.io/blog/))

### Education & Learning Resources

- [LLM Engineering Guides](https://awesome-repositories.com/f/education-learning-resources/educational-resources/systems-applied-computing/machine-learning-education/llm-engineering-guides.md) — Offers practical documentation for building, fine-tuning, and deploying modern language models.
- [Model Fine-Tuning Guides](https://awesome-repositories.com/f/education-learning-resources/curricula-instructional-design/curricula-roadmaps/ai-machine-learning-roadmaps/generative-ai-curricula/model-fine-tuning-guides.md) — Supplies educational notebooks and guides explaining implementation techniques for fine-tuning machine learning models. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Machine Learning](https://awesome-repositories.com/f/education-learning-resources/developer-documentation-references/knowledge-bases/machine-learning.md) — Consolidates essential theoretical foundations, research insights, and technical methodologies required to master modern deep learning and language processing systems.
- [Educational Curriculum Repositories](https://awesome-repositories.com/f/education-learning-resources/educational-resources/courses-training-certifications/courses-structured-learning/computer-science-curricula/educational-curriculum-repositories.md) — Provides a structured collection of learning materials covering the entire lifecycle of modern language models.
- [Large Language Model Tutorials](https://awesome-repositories.com/f/education-learning-resources/educational-resources/reference-and-media/tutorials-media-curated-lists/technical-tutorials/machine-learning-ai/large-language-model-tutorials.md) — Curates technical tutorials covering the architecture, fine-tuning, and deployment of language models. ([source](https://mlabonne.github.io/blog/))
- [Local Model Execution](https://awesome-repositories.com/f/education-learning-resources/educational-resources/systems-applied-computing/machine-learning-education/llm-engineering-guides/local-model-execution.md) — Shares best practices for successfully executing large language models on local hardware. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Large Language Model Architectures](https://awesome-repositories.com/f/education-learning-resources/educational-resources/systems-applied-computing/machine-learning-education/large-language-model-architectures.md) — Breaks down the structural components and operational mechanics of transformer-based models, from tokenization to final output generation. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Machine Learning Mathematics](https://awesome-repositories.com/f/education-learning-resources/machine-learning-curricula/machine-learning-mathematics.md) — Explains the core mathematical principles, such as linear algebra, that underpin the functionality of modern machine learning algorithms. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
- [Retrieval Augmented Generation Guides](https://awesome-repositories.com/f/education-learning-resources/retrieval-augmented-generation-guides.md) — Walks through the implementation of complex retrieval pipelines that leverage databases and external APIs to enhance model responses. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))

### Repository Format

- [Awesome List](https://awesome-repositories.com/f/repository-format/awesome-list.md) — A community-curated directory that catalogs and links out to other open-source projects, rather than a standalone tool you run yourself.

### Software Engineering & Architecture

- [AI Research Repositories](https://awesome-repositories.com/f/software-engineering-architecture/project-management-governance/repository-maintenance/repository-identities/ai-research-repositories.md) — Synthesizes state-of-the-art methodologies and experimental trends for building and refining generative models.

### Part of an Awesome List

- [AI and Data Science](https://awesome-repositories.com/f/awesome-lists/ai/ai-and-data-science.md) — Practical course for LLM development.
- [Large Language Models](https://awesome-repositories.com/f/awesome-lists/ai/large-language-models.md) — Structured roadmap for learning LLM development from basics to advanced.
- [Educational Resources](https://awesome-repositories.com/f/awesome-lists/learning/educational-resources.md) — Comprehensive roadmap and notebooks for learning about language models.
- [Learning and Reference](https://awesome-repositories.com/f/awesome-lists/learning/learning-and-reference.md) — Comprehensive course on large language models.

### Security & Cryptography

- [LLM Security](https://awesome-repositories.com/f/security-cryptography/security/ai-and-machine-learning/llm-security.md) — Identifies critical vulnerabilities and mitigation strategies for common threats like prompt injection and adversarial attacks in language models. ([source](https://cdn.jsdelivr.net/gh/mlabonne/llm-course@main/README.md))
