What are the best open-source alternatives to ComfyUI WanVideoWrapper?

30 open-source projects similar to kijai/comfyui-wanvideowrapper, ranked by shared features. Top picks: thudm/cogvideo, tencent-hunyuan/hunyuanvideo-1.5, pku-yuangroup/open-sora-plan, hvision-nku/storydiffusion, guoyww/animatediff, zai-org/cogvideo, nvlabs/sana, skyworkai/skyreels-v2, hpcaitech/open-sora, ailab-cvc/videocrafter.

Is thudm/cogvideo a good alternative to ComfyUI WanVideoWrapper?

CogVideo is a generative video framework that uses diffusion models and transformer-based architectures to synthesize high-resolution video clips. It functions as both a text-to-video and image-to-video generator, converting textual descriptions or static images into temporal visual sequences. The…

Is tencent-hunyuan/hunyuanvideo-1.5 a good alternative to ComfyUI WanVideoWrapper?

HunyuanVideo-1.5 is a video generation foundation model and text-to-video diffusion framework. It utilizes a latent video diffusion model and a spatio-temporal transformer architecture to generate high-definition video sequences from text descriptions and images. The project enables cinematic came…

Is pku-yuangroup/open-sora-plan a good alternative to ComfyUI WanVideoWrapper?

Open-Sora-Plan is a text-to-video framework and distributed video training system. It utilizes a diffusion transformer architecture and large language model components to transform written descriptions or image prompts into high-quality video sequences. The system features a distributed infrastruc…

Is hvision-nku/storydiffusion a good alternative to ComfyUI WanVideoWrapper?

StoryDiffusion is a generative AI system designed for consistent character image and video generation. It utilizes a pluggable cross-attention module to inject shared character representations into pretrained diffusion models, allowing for visual identity stability across multiple images and scenes…

Is guoyww/animatediff a good alternative to ComfyUI WanVideoWrapper?

AnimateDiff is a latent diffusion video generator and text-to-video diffusion framework. It converts existing text-to-image diffusion models into animation generators by applying specialized motion modules, allowing for the creation of video sequences without modifying the original base model. The…

Is zai-org/cogvideo a good alternative to ComfyUI WanVideoWrapper?

CogVideo is a video generation framework and large language model architecture designed for synthesizing high-resolution video clips from natural language descriptions and images. It functions as a text-to-video and image-to-video generator, while also providing a model for video captioning to anal…

Is nvlabs/sana a good alternative to ComfyUI WanVideoWrapper?

Sana is a framework for high-resolution image and video synthesis based on a linear diffusion transformer. It provides a toolkit for the training, fine-tuning, and execution of text-to-image and text-to-video models, as well as a video generative world model capable of simulating physical environme…

Is skyworkai/skyreels-v2 a good alternative to ComfyUI WanVideoWrapper?

SkyReels-V2 is a video generation system that creates, extends, and refines video clips from text descriptions, images, or both. It operates as a diffusion-based video generation model that can produce videos of any duration by denoising frames sequentially, with each new frame conditioned on the o…

Is hpcaitech/open-sora a good alternative to ComfyUI WanVideoWrapper?

Open-Sora is a video generation framework designed to produce cinematic sequences from text prompts and images. It functions as a generative system that transforms written descriptions or reference images into video content featuring realistic textures and lighting. The project includes a dedicate…

Is ailab-cvc/videocrafter a good alternative to ComfyUI WanVideoWrapper?

Videocrafter is a latent diffusion model designed for AI video synthesis. It functions as both a text-to-video and image-to-video generation system, synthesizing high-quality video sequences from descriptive text prompts or static image inputs. The model utilizes a diffusion-based neural network t…

Back to kijai/comfyui-wanvideowrapper

Open-source alternatives to ComfyUI WanVideoWrapper

30 open-source projects similar to kijai/comfyui-wanvideowrapper, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best ComfyUI WanVideoWrapper alternative.

thudm/cogvideo
THUDM/CogVideo
12,792View on GitHub
CogVideo is a generative video framework that uses diffusion models and transformer-based architectures to synthesize high-resolution video clips. It functions as both a text-to-video and image-to-video generator, converting textual descriptions or static images into temporal visual sequences. The system integrates large language model capabilities to expand short user prompts into detailed descriptions for better visual alignment. It supports the animation of static images through latent seeding and provides the ability to extend the length of existing video sequences. The project includes
Python
View on GitHub12,792
tencent-hunyuan/hunyuanvideo-1.5
Tencent-Hunyuan/HunyuanVideo-1.5
4,440View on GitHub
HunyuanVideo-1.5 is a video generation foundation model and text-to-video diffusion framework. It utilizes a latent video diffusion model and a spatio-temporal transformer architecture to generate high-definition video sequences from text descriptions and images. The project enables cinematic camera control for directing pans and tilts and provides image-to-video animation capabilities. It supports visual style adaptation through low-rank adaptation tuning and uses a language model for prompt refinement to improve visual alignment. The model covers high-resolution video upscaling via a super
Pythonimage-to-videotext-to-videovideo-generation
View on GitHub4,440
pku-yuangroup/open-sora-plan
PKU-YuanGroup/Open-Sora-Plan
12,163View on GitHub
Open-Sora-Plan is a text-to-video framework and distributed video training system. It utilizes a diffusion transformer architecture and large language model components to transform written descriptions or image prompts into high-quality video sequences. The system features a distributed infrastructure designed for large-scale video training and inference. It employs sequence parallelism to split high-resolution or long-duration video samples across multiple GPUs and uses a sparse attention mechanism to increase processing speed. The project includes capabilities for both text-to-video and im
Python
View on GitHub12,163

Open-source alternatives to ComfyUI WanVideoWrapper

THUDM/CogVideo

Tencent-Hunyuan/HunyuanVideo-1.5

PKU-YuanGroup/Open-Sora-Plan

HVision-NKU/StoryDiffusion

guoyww/AnimateDiff

zai-org/CogVideo

NVlabs/Sana

SkyworkAI/SkyReels-V2

hpcaitech/Open-Sora

ailab-cvc/videocrafter

thu-ml/TurboDiffusion

ml-explore/mlx-examples

firebase/genkit

HumanAIGC/AnimateAnyone

showlab/Tune-A-Video

meituan-longcat/LongCat-Video

Wan-Video/Wan2.1

Wan-Video/Wan2.2

Comfy-Org/ComfyUI

Tencent-Hunyuan/HunyuanVideo

AIDC-AI/Pixelle-Video

antgroup/echomimic

lucidrains/imagen-pytorch

lucidrains/video-diffusion-pytorch

open-mmlab/mmagic

Picsart-AI-Research/Text2Video-Zero

Sygil-Dev/sygil-webui

QuantumNous/new-api

steven2358/awesome-generative-ai

lllyasviel/Paints-UNDO