# thudm/glm-130b

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/thudm-glm-130b).**

7,649 stars · 603 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/THUDM/GLM-130B
- awesome-repositories: https://awesome-repositories.com/repository/thudm-glm-130b.md

## Description

GLM-130B is a pre-trained foundation model and bilingual large language model designed for natural language processing tasks in both English and Chinese. It functions as an autoregressive language model and text generator capable of producing long-form content and predicting missing phrases.

The model utilizes an autoregressive blank-filling architecture and a bidirectional dense transformer to process text. This approach allows the system to transition between understanding context through masked language modeling and generating sequential text using specific mask tokens.

The project covers capabilities for bilingual text generation, high-performance model inference, and large language model evaluation. It supports hardware-specific quantization to reduce memory usage and increase inference speed, alongside a configuration-driven system for measuring performance across various datasets.

## Tags

### Artificial Intelligence & ML

- [Bilingual Language Models](https://awesome-repositories.com/f/artificial-intelligence-ml/bilingual-language-models.md) — Functions as a large-scale model trained for high proficiency and natural dialogue in English and Chinese.
- [Bilingual Text Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/bilingual-text-generation.md) — A system for producing long-form content and predicting missing phrases in English and Chinese.
- [Long-Form Text Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/long-form-text-generation.md) — Produces sequential, long-form text from left-to-right using generative mask triggers. ([source](https://github.com/thudm/glm-130b#readme))
- [Masked Language Modeling](https://awesome-repositories.com/f/artificial-intelligence-ml/masked-language-modeling.md) — Employs training techniques to predict randomly hidden tokens within a sequence to learn semantic relationships.
- [Natural Language Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing.md) — Performs comprehensive natural language processing tasks across both English and Chinese languages. ([source](https://github.com/thudm/glm-130b#readme))
- [Masked](https://awesome-repositories.com/f/artificial-intelligence-ml/text-generation-strategies/token-prediction/masked.md) — Predicts missing words or phrases within a sentence by identifying and replacing mask tokens. ([source](https://github.com/thudm/glm-130b#readme))
- [Transformer Language Models](https://awesome-repositories.com/f/artificial-intelligence-ml/transformer-language-models.md) — Implements a transformer-based language model for sequential text generation.
- [Bidirectional Processing Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/bidirectional-processing-architectures.md) — Utilizes a bidirectional processing architecture to analyze text sequences in both directions for deeper contextual understanding.
- [Bilingual Embeddings](https://awesome-repositories.com/f/artificial-intelligence-ml/bilingual-language-models/bilingual-tokenizers/bilingual-embeddings.md) — Maps English and Chinese characters into a shared high-dimensional vector space for cross-lingual semantic understanding.
- [High-Throughput Text Inference](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/inference-runtimes/high-performance-ai-inference/high-throughput-text-inference.md) — Optimizes model execution for high-volume text generation with low computational latency.
- [Inference Acceleration Techniques](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-inference-serving/inference-optimization/inference-acceleration-techniques.md) — Increases text generation speed through hardware-specific optimizations and quantization techniques. ([source](https://github.com/thudm/glm-130b#readme))
- [Open-Weights Models](https://awesome-repositories.com/f/artificial-intelligence-ml/open-weights-models.md) — Provides a pre-trained model with publicly available weights for deployment and custom NLP tasks.
- [Precision Quantization](https://awesome-repositories.com/f/artificial-intelligence-ml/precision-quantization.md) — Employs hardware-specific quantization to reduce model weight precision, lowering memory usage and accelerating inference.
- [Autoregressive Text Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/sequence-generation/autoregressive-text-generation.md) — Implements a text generation system that predicts tokens sequentially by feeding previous outputs back into the model.
- [Generative Mask Tokens](https://awesome-repositories.com/f/artificial-intelligence-ml/text-generation-strategies/token-prediction/masked/generative-mask-tokens.md) — Uses specific mask tokens to trigger the transition from blank-filling mode to sequential text generation.

### Part of an Awesome List

- [Pre-trained Models](https://awesome-repositories.com/f/awesome-lists/ai/pre-trained-models.md) — Serves as a large-scale pre-trained architecture ready for downstream fine-tuning and general-purpose text generation.
- [Decoder Models](https://awesome-repositories.com/f/awesome-lists/ai/decoder-models.md) — Bilingual open-source pre-trained language model.
- [Foundation Models](https://awesome-repositories.com/f/awesome-lists/ai/foundation-models.md) — Large-scale bilingual pre-trained language model.
- [Large Language Models](https://awesome-repositories.com/f/awesome-lists/ai/large-language-models.md) — High-performance model optimized for limited compute environments.
- [Open Source Models](https://awesome-repositories.com/f/awesome-lists/ai/open-source-models.md) — Open bilingual pre-trained model with high parameter count.
