# mikubill/sd-webui-controlnet

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/mikubill-sd-webui-controlnet).**

17,853 stars · 2,014 forks · Python · GPL-3.0

## Links

- GitHub: https://github.com/Mikubill/sd-webui-controlnet
- awesome-repositories: https://awesome-repositories.com/repository/mikubill-sd-webui-controlnet.md

## Description

This project is an extension for Stable Diffusion that provides an image-to-image control framework. It serves as a multi-control constraint manager and structural data preprocessor, allowing users to guide the layout and composition of generated images through spatial maps and structural constraints.

The system enables multi-constraint image generation by combining several different control inputs to enforce multiple stylistic or spatial rules within a single generation pass. It provides tools for visual image referencing and precise geometric or anatomical templating to ensure generated images follow a specific visual reference.

The framework covers automated image preprocessing to extract control maps, as well as tuning for guidance strength, timing, and weight balancing between text prompts and spatial references. It also includes capabilities for high-resolution image upscaling to maintain structural consistency.

Programmatic access is provided via API integration for managing control units, executing preprocessors, and triggering generation requests.

## Tags

### Artificial Intelligence & ML

- [Image Composition Controls](https://awesome-repositories.com/f/artificial-intelligence-ml/image-composition-controls.md) — Provides a comprehensive framework for managing spatial layout and composition using external maps and visual references.
- [Multi-Control Synthesis](https://awesome-repositories.com/f/artificial-intelligence-ml/multi-control-synthesis.md) — Manages multiple parallel control networks to apply simultaneous spatial and stylistic constraints in a single generation pass.
- [Attention Layer Injectors](https://awesome-repositories.com/f/artificial-intelligence-ml/attention-mechanisms/attention-layer-injectors.md) — Modifies internal attention layers during the denoising process to inject precise spatial control signals.
- [Control Map Extractors](https://awesome-repositories.com/f/artificial-intelligence-ml/control-map-extractors.md) — Extracts structural control maps and annotations from images to prepare them for guidance in diffusion models.
- [Diffusion Structural Control](https://awesome-repositories.com/f/artificial-intelligence-ml/diffusion-structural-control.md) — Extracts structural data and creates control maps to guide the spatial layout of AI-generated images.
- [Multi-Constraint Orchestration](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-configurations/generation-constraints/multi-constraint-orchestration.md) — Enables the combination of several different control inputs to enforce multiple stylistic or spatial rules simultaneously.
- [Multi-Constraint Integration](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-configurations/multi-constraint-integration.md) — Combines several different control inputs into a single generation to enforce multiple spatial and stylistic constraints. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Generation Controls](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/decoding-generation-controls/generation-controls.md) — Sets the influence strength and specific generation steps where spatial guidance is applied. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Precise Guidance Templates](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-models/diffusion-models/precise-guidance-templates.md) — Ensures generated images follow strict geometric or anatomical templates using visual references and preprocessors.
- [Guidance Weight Balancers](https://awesome-repositories.com/f/artificial-intelligence-ml/guidance-weight-balancers.md) — Adjusts the relative importance between text prompts and control maps to prioritize prompt adherence or structural guidance. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Guidance Weight Interpolators](https://awesome-repositories.com/f/artificial-intelligence-ml/model-weight-management/dynamic-weight-updates/guidance-weight-interpolators.md) — Implements dynamic weight blending to balance the influence of text prompts against structural constraints.
- [Denoising Step Controllers](https://awesome-repositories.com/f/artificial-intelligence-ml/step-based-schedulers/step-execution-engines/execution-step-controllers/denoising-step-controllers.md) — Applies control signals only during specific denoising steps to balance structural fidelity and creative freedom.
- [Structural Guidance](https://awesome-repositories.com/f/artificial-intelligence-ml/structural-guidance.md) — Provides precise spatial guidance and structural control over the image generation process using specialized models. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Structural Image Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/structural-image-generation.md) — Guides the layout and composition of generated images through the use of spatial maps and structural constraints.
- [Visual Reference Guidance](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-models/diffusion-models/visual-reference-guidance.md) — Provides a way to guide image generation using an image as a direct visual reference via attention layer linking. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Image-to-Image Diffusion Toolkits](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-image-generators/image-inpainting/image-to-image-diffusion-toolkits.md) — Offers a comprehensive toolkit for triggering image-to-image and text-to-image diffusion tasks programmatically. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Image Super Resolution Models](https://awesome-repositories.com/f/artificial-intelligence-ml/image-super-resolution-models.md) — Maintains structural consistency and detail when scaling images using combined base and high-resolution control passes.
- [Resolution Scaling](https://awesome-repositories.com/f/artificial-intelligence-ml/inference-scaling/resolution-scaling.md) — Generates scaled control images for base and high-resolution passes to ensure structural consistency during upscaling. ([source](https://github.com/mikubill/sd-webui-controlnet#readme))
- [Latent Scaling Mechanisms](https://awesome-repositories.com/f/artificial-intelligence-ml/inference-scaling/resolution-scaling/latent-scaling-mechanisms.md) — Aligns control map resolutions precisely with the latent space dimensions of the generative model.

### Graphics & Multimedia

- [Structural Map Generators](https://awesome-repositories.com/f/graphics-multimedia/structural-map-generators.md) — Transforms input images into simplified structural maps using computer vision models to guide generative networks.

### Software Engineering & Architecture

- [Image Preprocessor Registries](https://awesome-repositories.com/f/software-engineering-architecture/integration-extensibility/third-party-service-connectors/built-in-integration-nodes/modular-extension-registries/image-preprocessor-registries.md) — Executes specific image preprocessors to generate structural control maps independently of the full generation process. ([source](https://github.com/Mikubill/sd-webui-controlnet/wiki/API))
