# hiyouga/llamafactory

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/hiyouga-llamafactory).**

72,213 stars · 8,837 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/hiyouga/LlamaFactory
- Homepage: https://llamafactory.readthedocs.io
- awesome-repositories: https://awesome-repositories.com/repository/hiyouga-llamafactory.md

## Topics

`agent` `ai` `deepseek` `fine-tuning` `gemma` `gpt` `instruction-tuning` `large-language-models` `llama` `llama3` `llm` `lora` `moe` `nlp` `peft` `qlora` `quantization` `qwen` `rlhf` `transformers`

## Description

LlamaFactory is a unified framework for fine-tuning and adapting large language models. It provides a comprehensive platform that standardizes training workflows across diverse machine learning architectures, allowing users to execute both full-tuning and parameter-efficient methods through a single interface.

The project distinguishes itself by offering a low-code visual dashboard that enables users to configure experiments and monitor performance metrics in real time without writing extensive custom scripts. It also features a configuration-driven orchestration system that decouples experiment logic from the underlying execution engine, alongside an OpenAPI-compliant server that exposes trained models as standard network endpoints for integration with external software.

Beyond its core training capabilities, the platform supports real-time experiment tracking by streaming performance data to external monitoring services. This allows for the evaluation of model progress and the optimization of parameters throughout the development lifecycle. The software is designed to be installed and configured as a standalone environment for managing the end-to-end lifecycle of language model adaptation.

## Tags

### Artificial Intelligence & ML

- [Large Language Model Fine-Tuning Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/integrated-development-platforms/machine-learning-platforms/large-language-model-fine-tuning-frameworks.md) — Consolidates diverse optimization techniques into a single environment for end-to-end model adaptation and training.
- [Model Inference Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-inference-serving/engines-runtimes-servers/model-inference-servers.md) — Exposes trained models via standardized network protocols to facilitate scalable and reliable prediction services.
- [Language Model Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/fine-tuning-and-customization/language-model-fine-tuning.md) — Simplifies complex model refinement by offering a unified interface for both full-parameter and efficient training methods. ([source](https://cdn.jsdelivr.net/gh/hiyouga/LlamaFactory@main/README.md))
- [Experiment Tracking](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/training-monitoring-and-profiling/training-observability-systems/experiment-tracking.md) — Captures real-time performance metrics and training progress to assist in evaluating model quality during development. ([source](https://cdn.jsdelivr.net/gh/hiyouga/LlamaFactory@main/README.md))
- [Model Fine-Tuning and Adaptation](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/model-fine-tuning-adaptation.md) — Enables the customization of pre-trained models to specific domains or tasks through a streamlined training pipeline.
- [Multi-Backend Model Construction](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/model-construction/model-definition/multi-backend-model-construction.md) — Standardizes data loading and optimization logic across various hardware backends and model architectures.
- [Low-Code Machine Learning Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/integrated-development-platforms/machine-learning-platforms/low-code-machine-learning-tools.md) — Offers a visual interface that allows users to manage training workflows without writing extensive custom code.
- [Parameter Efficient Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/parameter-efficient-fine-tuning.md) — Reduces computational overhead by updating only a subset of model parameters during the adaptation process.
- [Integrated Development Platforms](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/integrated-development-platforms.md) — Bundles configuration, training, and monitoring tools into a single environment for the entire model lifecycle.
- [Machine Learning Training](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training.md) — Utilizes structured configuration files to decouple training parameters from execution logic, ensuring reproducible experiments.
- [Model Inference APIs](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/inference-servers-and-runtimes/model-inference-apis.md) — Deploys refined models as local API endpoints for immediate integration into external software applications.
- [Local Model Inference Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/local-and-on-device-inference/local-model-inference-servers.md) — Hosts models locally to serve low-latency predictions through standard network APIs. ([source](https://cdn.jsdelivr.net/gh/hiyouga/LlamaFactory@main/README.md))

### System Administration & Monitoring

- [Experiment Tracking Systems](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/ai-agent-observability/experiment-tracking-systems.md) — Streams training loop metrics to external systems for real-time visualization and comparative analysis.
