What are the best open-source alternatives to Latent Diffusion?

30 open-source projects similar to compvis/latent-diffusion, ranked by shared features. Top picks: lucidrains/dalle2-pytorch, stability-ai/stablecascade, kwai-kolors/kolors, huggingface/diffusion-models-class, facebookresearch/dit, lucidrains/imagen-pytorch, luosiallen/latent-consistency-model, levihsu/ootdiffusion, haoheliu/audioldm, hlky/stable-diffusion-webui.

Is lucidrains/dalle2-pytorch a good alternative to Latent Diffusion?

This is a PyTorch implementation of a text-to-image model designed for synthesizing high-fidelity images from natural language descriptions. It utilizes a diffusion image generator to transform latent embeddings into visual data through an iterative denoising process. The system employs a two-stag…

Is stability-ai/stablecascade a good alternative to Latent Diffusion?

StableCascade is a generative AI system and latent diffusion framework designed for text-to-image synthesis and image-to-image transformations. It utilizes a multi-stage cascade architecture that encodes and decodes images via a latent space to produce high-fidelity visual imagery. The system incl…

Is kwai-kolors/kolors a good alternative to Latent Diffusion?

Kolors is a generative model implementation for synthesizing photorealistic images from natural language descriptions and visual references. It utilizes a latent diffusion model framework to produce high-fidelity imagery, operating within a compressed latent space to improve generation efficiency a…

Is huggingface/diffusion-models-class a good alternative to Latent Diffusion?

This project is an educational course and collection of training materials focused on generative diffusion models. It provides a curriculum and practical guides for training, fine-tuning, and deploying models capable of synthesizing images, audio, and video. The material covers specific implementa…

Is facebookresearch/dit a good alternative to Latent Diffusion?

DiT is a latent diffusion model and transformer-based generative AI framework implemented in PyTorch. It functions as a class-conditional image generator that replaces traditional convolutional backbones with a transformer architecture to synthesize high-fidelity images. The project utilizes patch…

Is lucidrains/imagen-pytorch a good alternative to Latent Diffusion?

This is a PyTorch-based implementation of diffusion models for synthesizing photorealistic images and video. It provides a framework for text-to-image and text-to-video generation, as well as unconditional image synthesis. The system utilizes a cascading diffusion pipeline to produce high-resoluti…

Is luosiallen/latent-consistency-model a good alternative to Latent Diffusion?

This project is a framework for training consistency models and performing diffusion model distillation. It functions as a few-step text-to-image generator and an image-to-image transformation tool designed to produce high-resolution visuals from text prompts or existing images. The system focuses…

Is levihsu/ootdiffusion a good alternative to Latent Diffusion?

OOTDiffusion is an AI virtual try-on system designed for controllable image synthesis. It generates images of people wearing specific clothing items by superimposing garments onto human figures for both half-body and full-body compositions. The project facilitates digital fashion prototyping and v…

Is haoheliu/audioldm a good alternative to Latent Diffusion?

AudioLDM is a latent diffusion framework for generating high-fidelity audio, music, and sound effects. It functions as a text-to-audio generator that converts natural language descriptions into synthetic audio signals with control over pitch and environment. The system provides specialized tools f…

Is hlky/stable-diffusion-webui a good alternative to Latent Diffusion?

Stable Diffusion Web UI is a browser-based interface for generating, editing, and upscaling images and videos using latent diffusion models. It functions as a text-to-image generator, an AI image editor, and a tool for increasing image resolution and clarity. The system includes capabilities for c…

Back to compvis/latent-diffusion

Open-source alternatives to Latent Diffusion

30 open-source projects similar to compvis/latent-diffusion, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Latent Diffusion alternative.

lucidrains/dalle2-pytorch
lucidrains/DALLE2-pytorch
11,310View on GitHub
This is a PyTorch implementation of a text-to-image model designed for synthesizing high-fidelity images from natural language descriptions. It utilizes a diffusion image generator to transform latent embeddings into visual data through an iterative denoising process. The system employs a two-stage latent mapping process, using a CLIP-based latent prior to map text embeddings to image embeddings before decoding them into pixels. It features a cascading diffusion decoder that produces high-resolution imagery by passing low-resolution outputs through a sequence of models at increasing scales.
Pythonartificial-intelligencedeep-learningtext-to-image
View on GitHub11,310
stability-ai/stablecascade
Stability-AI/StableCascade
6,548View on GitHub
StableCascade is a generative AI system and latent diffusion framework designed for text-to-image synthesis and image-to-image transformations. It utilizes a multi-stage cascade architecture that encodes and decodes images via a latent space to produce high-fidelity visual imagery. The system includes a cascade diffusion pipeline for controlling image structure through inpainting, outpainting, and super-resolution. It also provides a toolkit for image-to-image generation and the creation of image variations using embeddings. The framework supports model optimization through low-rank adaptati
Jupyter Notebook
View on GitHub6,548
kwai-kolors/kolors
Kwai-Kolors/Kolors
4,607View on GitHub
Kolors is a generative model implementation for synthesizing photorealistic images from natural language descriptions and visual references. It utilizes a latent diffusion model framework to produce high-fidelity imagery, operating within a compressed latent space to improve generation efficiency and quality. The system functions as a multilingual image generator, interpreting text prompts in multiple languages to produce semantically accurate visual outputs. It includes a custom model training pipeline that uses low-rank adaptation to teach the model specific subjects or artistic styles from
Python
View on GitHub4,607

Open-source alternatives to Latent Diffusion

lucidrains/DALLE2-pytorch

Stability-AI/StableCascade

Kwai-Kolors/Kolors

huggingface/diffusion-models-class

facebookresearch/DiT

lucidrains/imagen-pytorch

luosiallen/latent-consistency-model

levihsu/OOTDiffusion

haoheliu/AudioLDM

hlky/stable-diffusion-webui

ali-vilab/VACE

Stability-AI/generative-models

CompVis/stable-diffusion

XavierXiao/Dreambooth-Stable-Diffusion

yisol/IDM-VTON

hojonathanho/diffusion

huggingface/notebooks

CompVis/taming-transformers

timothybrooks/instruct-pix2pix

lucidrains/DALLE-pytorch

Wan-Video/Wan2.1

kohya-ss/sd-scripts

Sygil-Dev/sygil-webui

Tencent-Hunyuan/HunyuanDiT

bytedance/LatentSync

huggingface/diffusers

leejet/stable-diffusion.cpp

openai/shap-e

Picsart-AI-Research/Text2Video-Zero

zai-org/CogVideo