What are the best open-source alternatives to Stable Diffusion?

30 open-source projects similar to compvis/stable-diffusion, ranked by shared features. Top picks: lucidrains/dalle2-pytorch, hpcaitech/open-sora, nvlabs/sana, stability-ai/generative-models, huggingface/diffusers, timothybrooks/instruct-pix2pix, wan-video/wan2.1, lucidrains/dalle-pytorch, sgl-project/sglang, salesforce/lavis.

Is lucidrains/dalle2-pytorch a good alternative to Stable Diffusion?

This is a PyTorch implementation of a text-to-image model designed for synthesizing high-fidelity images from natural language descriptions. It utilizes a diffusion image generator to transform latent embeddings into visual data through an iterative denoising process. The system employs a two-stag…

Is hpcaitech/open-sora a good alternative to Stable Diffusion?

Open-Sora is a video generation framework designed to produce cinematic sequences from text prompts and images. It functions as a generative system that transforms written descriptions or reference images into video content featuring realistic textures and lighting. The project includes a dedicate…

Is nvlabs/sana a good alternative to Stable Diffusion?

Sana is a framework for high-resolution image and video synthesis based on a linear diffusion transformer. It provides a toolkit for the training, fine-tuning, and execution of text-to-image and text-to-video models, as well as a video generative world model capable of simulating physical environme…

Is stability-ai/generative-models a good alternative to Stable Diffusion?

This is a framework for training and sampling diffusion models to generate high-fidelity images, video, and 4D assets. It provides a modular environment for managing generative AI training pipelines, including the handling of datasets, noise sampling, and loss weighting to stabilize the creation of…

Is huggingface/diffusers a good alternative to Stable Diffusion?

Diffusers is a PyTorch-based library and generative AI framework used to build, train, and deploy diffusion pipelines for producing multi-modal media. It provides a suite of tools for generating images, video, and audio from natural language descriptions, as well as specialized systems for text-to-…

Is timothybrooks/instruct-pix2pix a good alternative to Stable Diffusion?

Instruct-pix2pix is an instruction-based image model and PyTorch library designed to modify visual content by following natural language directions. It functions as a diffusion model image editor that applies human-written instructions to existing pictures rather than using traditional text-to-imag…

Is wan-video/wan2.1 a good alternative to Stable Diffusion?

Wan2.1 is a generative video synthesis framework that provides foundation models for creating high-fidelity video sequences and static images from descriptive text prompts. The system utilizes a unified architecture trained on both static and dynamic datasets, allowing it to function as a comprehen…

Is lucidrains/dalle-pytorch a good alternative to Stable Diffusion?

This project is a PyTorch implementation of a text-to-image transformer. It is a generative AI model designed to map discrete text tokens to image pixels using a transformer network to create visual content from textual descriptions. The system utilizes a discrete VAE image encoder to compress vis…

Is sgl-project/sglang a good alternative to Stable Diffusion?

Sglang is a high-performance inference engine and serving system designed for large language and multimodal models. It provides a programmable interface for orchestrating complex generation workflows, enabling developers to coordinate multi-turn dialogues, tool invocations, and reasoning chains thr…

Is salesforce/lavis a good alternative to Stable Diffusion?

LAVIS is a multimodal large language model framework and vision-language model library. It provides tools for training and evaluating models that integrate visual, textual, and audio data, serving as a cross-modal feature extractor and a zero-shot visual reasoning engine. The framework distinguish…

Back to compvis/stable-diffusion

Open-source alternatives to Stable Diffusion

30 open-source projects similar to compvis/stable-diffusion, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Stable Diffusion alternative.

lucidrains/dalle2-pytorch
lucidrains/DALLE2-pytorch
11,310View on GitHub
This is a PyTorch implementation of a text-to-image model designed for synthesizing high-fidelity images from natural language descriptions. It utilizes a diffusion image generator to transform latent embeddings into visual data through an iterative denoising process. The system employs a two-stage latent mapping process, using a CLIP-based latent prior to map text embeddings to image embeddings before decoding them into pixels. It features a cascading diffusion decoder that produces high-resolution imagery by passing low-resolution outputs through a sequence of models at increasing scales.
Pythonartificial-intelligencedeep-learningtext-to-image
View on GitHub11,310
hpcaitech/open-sora
hpcaitech/Open-Sora
29,101View on GitHub
Open-Sora is a video generation framework designed to produce cinematic sequences from text prompts and images. It functions as a generative system that transforms written descriptions or reference images into video content featuring realistic textures and lighting. The project includes a dedicated prompt engineering tool that uses large language models to expand simple user inputs into detailed descriptions. It also features a motion controller for adjusting movement intensity in generated sequences and evaluating motion levels in existing video files. The framework incorporates text-to-vid
Python
View on GitHub29,101
nvlabs/sana
NVlabs/Sana
8,310View on GitHub
Sana is a framework for high-resolution image and video synthesis based on a linear diffusion transformer. It provides a toolkit for the training, fine-tuning, and execution of text-to-image and text-to-video models, as well as a video generative world model capable of simulating physical environments with precise spatial control. The project is distinguished by its use of linear complexity layers to handle high resolutions and its support for long-form, minute-length video generation in real time. It implements a two-stage inference paradigm that separates structural generation from visual t
Python
View on GitHub8,310

Open-source alternatives to Stable Diffusion

lucidrains/DALLE2-pytorch

hpcaitech/Open-Sora

NVlabs/Sana

Stability-AI/generative-models

huggingface/diffusers

timothybrooks/instruct-pix2pix

Wan-Video/Wan2.1

lucidrains/DALLE-pytorch

sgl-project/sglang

salesforce/LAVIS

hlky/stable-diffusion-webui

levihsu/OOTDiffusion

CompVis/latent-diffusion

Stability-AI/StableCascade

Sygil-Dev/sygil-webui

borisdayma/dalle-mini

haoheliu/AudioLDM

Tencent-Hunyuan/HunyuanImage-3.0

XavierXiao/Dreambooth-Stable-Diffusion

huggingface/notebooks

leejet/stable-diffusion.cpp

deep-floyd/IF

lllyasviel/IC-Light

comfyanonymous/ComfyUI

zhaochenyang20/Awesome-ML-SYS-Tutorial

Wan-Video/Wan2.2

opencv/opencv

ml-explore/mlx-examples

microsoft/unilm

vercel/vercel