ComfyUI WanVideoWrapper

Open-source alternatives to ComfyUI WanVideoWrapper

Similar open-source projects, ranked by how many features they share with ComfyUI WanVideoWrapper.

thudm/cogvideo
THUDM/CogVideo
12,792View on GitHub
CogVideo is a generative video framework that uses diffusion models and transformer-based architectures to synthesize high-resolution video clips. It functions as both a text-to-video and image-to-video generator, converting textual descriptions or static images into temporal visual sequences. The system integrates large language model capabilities to expand short user prompts into detailed descriptions for better visual alignment. It supports the animation of static images through latent seeding and provides the ability to extend the length of existing video sequences. The project includes
Python
View on GitHub12,792
hvision-nku/storydiffusion
HVision-NKU/StoryDiffusion
6,430View on GitHub
StoryDiffusion is a generative AI system designed for consistent character image and video generation. It utilizes a pluggable cross-attention module to inject shared character representations into pretrained diffusion models, allowing for visual identity stability across multiple images and scenes without retraining the base model. The project features a video generation pipeline that produces temporally coherent sequences from text prompts or condition images. It employs a latent space motion interpolator to predict intermediate frames and semantic motion, enabling long-range video generati
Jupyter Notebook
View on GitHub6,430
guoyww/animatediff
guoyww/AnimateDiff
12,144View on GitHub
AnimateDiff is a latent diffusion video generator and text-to-video diffusion framework. It converts existing text-to-image diffusion models into animation generators by applying specialized motion modules, allowing for the creation of video sequences without modifying the original base model. The project provides an image-to-video animation framework that uses sparse RGB images, sketches, or structural keyframe constraints to guide generation. It further distinguishes itself with a motion adapter system that injects cinematic camera movements, such as zooming, panning, and tilting, into anim
Python
View on GitHub12,144
pku-yuangroup/open-sora-plan
PKU-YuanGroup/Open-Sora-Plan
12,163View on GitHub
Open-Sora-Plan is a text-to-video framework and distributed video training system. It utilizes a diffusion transformer architecture and large language model components to transform written descriptions or image prompts into high-quality video sequences. The system features a distributed infrastructure designed for large-scale video training and inference. It employs sequence parallelism to split high-resolution or long-duration video samples across multiple GPUs and uses a sparse attention mechanism to increase processing speed. The project includes capabilities for both text-to-video and im
Python
View on GitHub12,163

See all 30 alternatives to ComfyUI WanVideoWrapper

kijaiComfyUI-WanVideoWrapper

Features

Open-source alternatives to ComfyUI WanVideoWrapper

THUDM/CogVideo

HVision-NKU/StoryDiffusion

guoyww/AnimateDiff

PKU-YuanGroup/Open-Sora-Plan

Star history

Open-source alternatives to ComfyUI WanVideoWrapper

THUDM/CogVideo

HVision-NKU/StoryDiffusion

guoyww/AnimateDiff

PKU-YuanGroup/Open-Sora-Plan