Diffusers is a PyTorch-based library and generative AI framework used to build, train, and deploy diffusion pipelines for producing multi-modal media. It provides a suite of tools for generating images, video, and audio from natural language descriptions, as well as specialized systems for text-to-image generation. The project differentiates itself through a modular architecture that separates noise schedulers, pretrained model blocks, and pipeline compositions. This structure allows for the construction of custom generation workflows and the ability to swap individual components of the diffu
ComfyUI-nunchaku is a 4-bit diffusion inference engine and a set of nodes for running low-precision quantized diffusion models within ComfyUI visual workflows. It provides a backend that reduces memory overhead and increases generation speed for transformer models. The project includes specialized tools for identity-preserving generation and an image-to-image guidance toolkit that uses depth maps and reference images. It also features a multimodal visual question answering implementation and a utility for merging multiple quantized model files into single unified files. The engine covers a b
DiffusionBee is a Stable Diffusion desktop client for macOS that functions as an AI image generator and editor. It allows for the local generation of images from text prompts and the management of diffusion models without requiring external cloud services or technical setup. The application includes a local diffusion model manager for importing and switching between custom trained model files to achieve specific artistic styles. It also features a system for tracking generation history and uploading assets to a public gallery. The software covers several image synthesis and manipulation work
IF is a text-to-image diffusion system that translates natural language descriptions into visual imagery. The project provides a generative pipeline for creating images, an inpainting tool for modifying specific image sections, and a super-resolution upscaler to increase pixel density and clarity. The system includes a concept fine-tuning framework that allows for the teaching of new visual concepts by updating a small set of parameters. It also supports image style transfer to apply the aesthetic characteristics of a reference image to a new output.
This project is an integrated software framework designed to facilitate generative image synthesis and high-performance model inference on Intel processor and graphics hardware. It provides a specialized inference engine that executes latent diffusion models to transform natural language descriptions into visual outputs.
The main features of bes-dev/stable_diffusion.openvino are: Stable Diffusion Inference Engines, Text-to-Image Generators, OpenVINO Inference Acceleration, Diffusion Pipelines, Image Inpainting, Image-to-Image Translation, Hardware Acceleration Kernels, Model Intermediate Representations.
Open-source alternatives to bes-dev/stable_diffusion.openvino include: huggingface/diffusers — Diffusers is a PyTorch-based library and generative AI framework used to build, train, and deploy diffusion pipelines… nunchaku-ai/comfyui-nunchaku — ComfyUI-nunchaku is a 4-bit diffusion inference engine and a set of nodes for running low-precision quantized… divamgupta/diffusionbee-stable-diffusion-ui — DiffusionBee is a Stable Diffusion desktop client for macOS that functions as an AI image generator and editor. It… deep-floyd/if — IF is a text-to-image diffusion system that translates natural language descriptions into visual imagery. The project… kohya-ss/sd-scripts — sd-scripts is a suite of utilities designed for fine-tuning generative models, preprocessing datasets, and converting… lucidrains/imagen-pytorch — This is a PyTorch-based implementation of diffusion models for synthesizing photorealistic images and video. It…