# openlm-research/open_llama

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/openlm-research-open-llama).**

7,526 stars · 405 forks · Apache-2.0

## Links

- GitHub: https://github.com/openlm-research/open_llama
- awesome-repositories: https://awesome-repositories.com/repository/openlm-research-open-llama.md

## Description

Open Llama is an open source large language model and pre-trained transformer designed as a permissively licensed alternative to proprietary weights. It serves as a base model reproduction of the Llama architecture, providing a set of weights for a decoder-only transformer.

The project provides a transparently trained model based on the RedPajama dataset, supporting unrestricted commercial and research use. It includes systems for serving pre-trained weights in various sizes.

The project covers natural language processing research and performance benchmarking through text quality evaluation harnesses. It is designed for integration with deep learning frameworks and research pipelines.

## Tags

### Artificial Intelligence & ML

- [Transformer Architecture Implementation](https://awesome-repositories.com/f/artificial-intelligence-ml/transformer-architecture-implementation.md) — Provides a complete implementation of the Llama architecture as a permissively licensed alternative to proprietary weights.
- [Large Language Models](https://awesome-repositories.com/f/artificial-intelligence-ml/language-model-orchestration/large-language-models.md) — Ships a pre-trained base model designed for downstream research and fine-tuning applications.
- [Transformer Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/model-construction/transformer-architectures.md) — Implements a transformer-based architecture using stacked self-attention layers and feed-forward networks.
- [Natural Language Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing.md) — Serves as a reproducible baseline for developing and testing new natural language processing techniques.
- [Causal Language Modeling](https://awesome-repositories.com/f/artificial-intelligence-ml/text-generation-strategies/token-prediction/causal-language-modeling.md) — Implements the architectural objective of predicting the next token in a sequence by masking future tokens.
- [Deep Learning Framework Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-framework-implementations.md) — Ensures compatibility between model implementations and industry-standard libraries like PyTorch and JAX. ([source](https://github.com/openlm-research/open_llama/blob/main/README.md))
- [Deep Learning Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-pipelines.md) — Provides compatibility with deep learning pipelines to integrate the model into existing research workflows.
- [Text](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-content-apis/quality-evaluators/text.md) — Provides standardized harnesses to measure the accuracy and performance of generated text. ([source](https://github.com/openlm-research/open_llama#readme))
- [LLM Benchmarking](https://awesome-repositories.com/f/artificial-intelligence-ml/large-language-models/llm-benchmarking.md) — Includes standardized evaluation harnesses to measure text generation quality and accuracy against other models.
- [Rotary Positional Embeddings](https://awesome-repositories.com/f/artificial-intelligence-ml/positional-embedding-techniques/rotary-positional-embeddings.md) — Uses rotary positional embeddings to encode relative token positions through vector rotation in a complex plane.
- [RMS Normalizations](https://awesome-repositories.com/f/artificial-intelligence-ml/rms-normalizations.md) — Employs root mean square normalization to stabilize internal activations and improve training convergence.

### Part of an Awesome List

- [Data Curation](https://awesome-repositories.com/f/awesome-lists/ai/data-curation.md) — Utilizes the RedPajama dataset through rigorous filtering and refinement to ensure a high-quality training corpus.
- [Open Source Models](https://awesome-repositories.com/f/awesome-lists/ai/open-source-models.md) — Provides a transparently trained model that allows for unrestricted commercial and research use.
- [Pre-trained Models](https://awesome-repositories.com/f/awesome-lists/ai/pre-trained-models.md) — Supplies a set of pre-trained weights for a decoder-only transformer based on large-scale data.
- [KV Cache Management](https://awesome-repositories.com/f/awesome-lists/ai/kv-cache-management.md) — Includes strategies for optimizing the key-value cache to avoid redundant computations during text generation.
- [Large Language Model Deployments](https://awesome-repositories.com/f/awesome-lists/ai/local-model-deployment/large-language-model-deployments.md) — Includes systems for serving pre-trained weights in various sizes for deployment on private hardware. ([source](https://github.com/openlm-research/open_llama#readme))
- [Natural Language Processing](https://awesome-repositories.com/f/awesome-lists/ai/natural-language-processing.md) — Listed in the “Natural Language Processing” section of the FunNLP awesome list.
