ComfyUI GGUF | Awesome Repository

ComfyUI-GGUF is a memory optimizer and model loader for ComfyUI that enables the execution of large transformer-based generative models using quantized weights. It provides a system for loading GGUF formatted weights within a node-based diffusion interface to reduce GPU memory consumption.

The project includes a quantization tool for converting standard model checkpoints into compressed binary formats and a tensor fixer to restore missing keys and correct architectures in binary model files. These utilities ensure that compressed models remain functional during inference on hardware with limited VRAM.

The framework covers model weight optimization and low-memory inference by supporting the loading of quantized diffusion models and text encoders. It manages the process of on-the-fly precision recovery and weight mapping to maintain performance while reducing the total memory footprint.

Features

ComfyUI Custom Node Suites - Integrates compressed GGUF diffusion models and text encoders as custom nodes within ComfyUI workflows.
Diffusion Model Memory Optimizers - Provides a framework for running large transformer-based generative models using quantized weights.
Diffusion Models - Initializes and runs generative image synthesis models based on quantized diffusion architectures.
GGUF Execution - Implements optimized runtime execution and loading for models using the GGUF quantization format.

Features

ComfyUI Custom Node Suites - Integrates compressed GGUF diffusion models and text encoders as custom nodes within ComfyUI workflows.
Diffusion Model Memory Optimizers - Provides a framework for running large transformer-based generative models using quantized weights.
Diffusion Models - Initializes and runs generative image synthesis models based on quantized diffusion architectures.
GGUF Execution - Implements optimized runtime execution and loading for models using the GGUF quantization format.