# ali-vilab/vace

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/ali-vilab-vace).**

3,645 stars · 250 forks · Python · apache-2.0

## Links

- GitHub: https://github.com/ali-vilab/VACE
- Homepage: https://ali-vilab.github.io/VACE-Page/
- awesome-repositories: https://awesome-repositories.com/repository/ali-vilab-vace.md

## Topics

`video-editing` `video-generation`

## Description

VACE is a set of software tools and frameworks for reference-guided video generation, diffusion-based editing, and video-to-video translation. It provides utilities to produce new video content and modify existing sequences by using reference materials to guide visual style, subject matter, and composition.

The framework enables video-to-video translation and synthesis, allowing for the update of visual styles and depth. It also functions as a video editor for modifying properties and content through reference-guided transformations.

The system covers localized video editing and inpainting, which allows for the replacement or modification of specific objects and areas using masks or bounding boxes. It also includes capabilities for general video content transformation and visual structure generation.

## Tags

### Artificial Intelligence & ML

- [Latent Diffusion Models](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-models/latent-diffusion-models.md) — Utilizes latent diffusion models as the primary generative framework for denoising compressed video representations.
- [Spatio-Temporal Attention](https://awesome-repositories.com/f/artificial-intelligence-ml/attention-mechanisms/spatio-temporal-attention.md) — Employs spatio-temporal attention mechanisms to ensure visual stability and consistency across sequential video frames.
- [Cross-Attention Conditioning](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-video-generators/cross-attention-conditioning.md) — Implements cross-attention conditioning to map reference images and masks to visual features in the latent space.
- [Video-to-Video Synthesis](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-tasks/video-to-video-synthesis.md) — Performs video-to-video synthesis by injecting original structural information into a diffusion process to maintain consistency.
- [Reference-Guided Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/reference-guided-generation.md) — Injects external style and subject information into the model to guide the appearance of the final video output.
- [Reference-Based Video Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/reference-based-video-generators.md) — Synthesizes new video content that follows the visual style and composition of provided reference materials.
- [Diffusion-Based Editors](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/prompt-based-video-editors/diffusion-based-editors.md) — Provides a video editor that modifies properties and content using diffusion models and reference-guided transformations.

### Graphics & Multimedia

- [Latent Inpainting Masks](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/face-portrait-manipulation/image-masking/face-mask-generation/latent-inpainting-masks.md) — Provides latent inpainting masks to replace specific video regions by blending generated content into the latent space.
- [Generative Video Editing](https://awesome-repositories.com/f/graphics-multimedia/ai-video-editing-automation/generative-video-editing.md) — Enables generative video editing to replace or inpaint objects within sequences using masks. ([source](https://cdn.jsdelivr.net/gh/ali-vilab/vace@main/README.md))
- [Video Inpainting Systems](https://awesome-repositories.com/f/graphics-multimedia/area-filling-and-clearing/video-inpainting-systems.md) — Implements video inpainting systems to replace or modify specific objects using masks and bounding boxes.
- [Video Transformation and Enhancement](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-manipulation/media-processing-workflows/video-transformation-enhancement.md) — Provides tools for high-end aesthetic refinement and stylistic video manipulation using reference materials.

### Part of an Awesome List

- [Video Generation](https://awesome-repositories.com/f/awesome-lists/ai/video-generation.md) — All-in-one video creation and editing framework.
