# compvis/stable-diffusion

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/compvis-stable-diffusion).**

73,125 stars · 10,599 forks · Jupyter Notebook · NOASSERTION

## Links

- GitHub: https://github.com/CompVis/stable-diffusion
- Homepage: https://ommer-lab.com/research/latent-diffusion-models/
- awesome-repositories: https://awesome-repositories.com/repository/compvis-stable-diffusion.md

## Description

Stable Diffusion is a generative machine learning pipeline that synthesizes high-resolution visual content by performing iterative denoising within a compressed latent space. By mapping natural language embeddings into pixel outputs through conditioned probabilistic processes, the framework enables the generation of images from text prompts and the transformation of existing visual inputs based on semantic instructions.

The architecture utilizes a modular execution environment that decouples model loading, scheduler logic, and inference components to support diverse hardware configurations. It distinguishes itself through a symmetric encoder-decoder backbone that preserves spatial information during refinement, alongside integrated safety filters and invisible watermarking for generated outputs.

The system provides a comprehensive suite of tools for latent space generative modeling, including capabilities for inpainting, outpainting, and style transfer. These functions are exposed through standardized interfaces, allowing for the integration of advanced diffusion-based inference into broader software workflows.

## Tags

### Artificial Intelligence & ML

- [Cross-Attention Mechanisms](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-architectures/cross-attention-mechanisms.md) — Aligns generated visual features with semantic input prompts by integrating text-derived embeddings into neural network layers.
- [Image Synthesis Models](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-capabilities/image-synthesis-models.md) — Leverages denoising autoencoders within latent representations to synthesize detailed visual content efficiently. ([source](https://ommer-lab.com/research/latent-diffusion-models/))
- [Denoising Schedulers](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-models/denoising-schedulers.md) — Manages the progressive transformation of latent noise into coherent images through configurable step-wise variance reduction.
- [Latent Space Generative Models](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-models/latent-space-generative-models.md) — Manipulates compressed latent representations to perform complex generative tasks on standard consumer hardware.
- [Text-to-Image Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-image-generators.md) — Converts natural language embeddings into high-resolution pixel outputs through conditioned probabilistic diffusion processes.
- [Latent Diffusion Models](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-models/latent-diffusion-models.md) — Executes iterative denoising inside a compressed latent space to produce high-fidelity visual results.
- [Text-to-Image Synthesis](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/generative-ai/text-to-image-synthesis.md) — Transforms natural language prompts into high-resolution imagery using sophisticated generative pipelines.
- [Generative Media Models](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/architectures/generative-media-models.md) — Maps pixel data into compact latent spaces to facilitate the synthesis of new visual media.
- [Model Inference and Serving](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-inference-serving.md) — Coordinates model loading, hardware acceleration, and output processing to streamline production-ready inference.
- [Image Diffusion Models](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/image-diffusion-models.md) — Creates structured visual patterns by iteratively refining noise through a specialized generative machine learning pipeline. ([source](https://cdn.jsdelivr.net/gh/CompVis/stable-diffusion@main/README.md))
- [Generative Model Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/generative-ai/generative-model-integrations.md) — Exposes modular interfaces that allow developers to embed iterative denoising inference capabilities directly into custom software.

### Graphics & Multimedia

- [Generative Image Engines](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing-workflows/generative-visual-engines/generative-image-engines.md) — Applies guided noise injection and iterative refinement to generate high-resolution visual content.

### DevOps & Infrastructure

- [Modular](https://awesome-repositories.com/f/devops-infrastructure/cicd-pipeline-automation/cicd-pipeline-management/pipeline-orchestration/modular.md) — Decouples model loading, scheduler logic, and inference execution into interchangeable components for flexible workflow integration.

### Part of an Awesome List

- [Computer Vision](https://awesome-repositories.com/f/awesome-lists/ai/computer-vision.md) — Latent diffusion models for text-to-image generation.
- [Foundation Models](https://awesome-repositories.com/f/awesome-lists/ai/foundation-models.md) — Latent diffusion model for text-to-image generation.
- [Text to Image](https://awesome-repositories.com/f/awesome-lists/more/text-to-image.md) — Listed in the “Text to Image” section of the The Incredible Pytorch awesome list.
