# paddlepaddle/paddleformers

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/paddlepaddle-paddleformers).**

12,981 stars · 2,192 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/PaddlePaddle/PaddleFormers
- awesome-repositories: https://awesome-repositories.com/repository/paddlepaddle-paddleformers.md

## Topics

`model`

## Description

PaddleFormers is a framework for the training, fine-tuning, and deployment of large language models. It provides a full lifecycle pipeline for executing large-scale model training and applying adaptation methods to align models with specialized tasks.

The project focuses on scaling model operations through distributed training and hardware accelerator integration. It employs pipeline parallelism and mixed-precision training to manage memory and increase throughput across multiple hardware devices.

The library includes a curated model zoo for serving pre-trained architectures and tools for production inference integration. It also provides data preparation utilities for chat templates and supports exporting model weights into standardized tensor formats for compatibility with external deployment engines.

## Tags

### Artificial Intelligence & ML

- [Language Model Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/fine-tuning-and-customization/language-model-fine-tuning.md) — Provides a comprehensive framework for training and fine-tuning large language models using specialized adaptation workflows.
- [Distributed Training](https://awesome-repositories.com/f/artificial-intelligence-ml/distributed-training-frameworks/distributed-training.md) — Scales model training across multiple hardware accelerators using data and model parallelism strategies.
- [Hardware Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/hardware-acceleration.md) — Integrates specialized hardware accelerators into training routines to improve computational throughput and memory efficiency. ([source](https://github.com/paddlepaddle/paddleformers#readme))
- [Large Language Model Serving](https://awesome-repositories.com/f/artificial-intelligence-ml/large-language-model-serving.md) — Enables the hosting and serving of large language models via APIs for text generation and inference tasks.
- [Hardware Acceleration](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-optimization-and-inference/hardware-and-acceleration/hardware-acceleration.md) — Interfaces with specialized hardware accelerators to optimize tensor operations and reduce model training time.
- [Large Language Model Training Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/training-frameworks/large-language-model-training-frameworks.md) — Provides a specialized framework for the pre-training and fine-tuning of transformer-based large language models. ([source](https://github.com/paddlepaddle/paddleformers#readme))
- [Parameter Efficient Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/parameter-efficient-fine-tuning.md) — Provides adaptation methods to update a small subset of model weights, aligning large models with specialized tasks efficiently.
- [Pipeline Parallelism Partitioners](https://awesome-repositories.com/f/artificial-intelligence-ml/pipeline-parallelism-partitioners.md) — Implements pipeline parallelism to partition large model tensors across multiple GPUs to manage memory and throughput.
- [Mixed Precision Training](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training/distributed-and-accelerated-compute/training-acceleration-tools/mixed-precision-training.md) — Employs mixed-precision training to reduce memory consumption and accelerate computation during the training process.
- [Model Export Formats](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-optimization-and-inference/serving-and-runtime/inference-optimization-utilities/model-export-formats.md) — Supports converting trained model weights into standardized industry formats for compatibility with external deployment engines.
- [Pre-trained Model Zoos](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/fine-tuning-and-customization/model-fine-tuning/pre-trained-model-zoos.md) — Maintains a curated model zoo of pre-trained architectures and weights for standardized loading and inference.
- [Model Weight Export Formats](https://awesome-repositories.com/f/artificial-intelligence-ml/model-weight-export-formats.md) — Saves trained model weights into standardized formats to ensure compatibility across different deep learning frameworks.

### Development Tools & Productivity

- [Inference Runtime Integrations](https://awesome-repositories.com/f/development-tools-productivity/third-party-integrations/inference-runtime-integrations.md) — Integrates with specialized inference engines to optimize the execution of models in production environments. ([source](https://github.com/paddlepaddle/paddleformers#readme))