# skyworkai/skyreels-v2

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/skyworkai-skyreels-v2).**

6,356 stars · 1,306 forks · Python · other

## Links

- GitHub: https://github.com/SkyworkAI/SkyReels-V2
- Homepage: https://platform.skyreels.ai
- awesome-repositories: https://awesome-repositories.com/repository/skyworkai-skyreels-v2.md

## Description

SkyReels-V2 is a video generation system that creates, extends, and refines video clips from text descriptions, images, or both. It operates as a diffusion-based video generation model that can produce videos of any duration by denoising frames sequentially, with each new frame conditioned on the ones that came before it. The system supports generating videos from scratch using text prompts, starting from a single image and producing subsequent frames, or constraining both the first and last frames to match user-provided images.

What distinguishes SkyReels-V2 is its combination of infinite-length video generation, frame-level control, and motion quality refinement through reinforcement learning. The system can extend videos indefinitely by denoising tokens at independent noise levels per frame, enabling seamless continuation of footage beyond typical length limits. It also applies direct preference optimization on preference pairs to train the model toward physically plausible, large-motion sequences, improving temporal coherence and motion quality. A prompt expansion language model automatically expands brief text descriptions into more detailed prompts, while a vision-language captioning model generates detailed textual descriptions of video content including shot types and camera movements.

The system includes multi-GPU pipeline parallelism that distributes frame batches across multiple GPUs to reduce end-to-end inference time for large-scale outputs. It also supports video extension, appending new frames to an existing clip by conditioning on its last frames for seamless continuation.

## Tags

### Artificial Intelligence & ML

- [Video Clip Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/video-clip-generators.md) — Creates video clips from text descriptions, images, or both, using a diffusion-based denoising process.
- [Video Motion Refiners](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/agentic-workflows/iterative-refinement-workflows/reinforcement-learning-policy-improvement/video-motion-refiners.md) — Improves motion quality in video generation by applying direct preference optimization on preference pairs.
- [Preference-Based Motion Refiners](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-video-generators/video-motion-controllers/preference-based-motion-refiners.md) — Refines the model's ability to generate physically plausible, large-motion sequences by training on preference pairs and applying direct preference optimization. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))
- [Text-to-Video Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-video-generators.md) — Synthesizes dynamic video content from descriptive text prompts using a diffusion-based denoising process. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))
- [Image-Conditioned Video Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/image-generation-models/conditional-image-generation/image-conditioned-video-generators.md) — Generates video frames conditioned on a starting image, using the image as the first frame and producing subsequent frames from a text prompt.
- [Per-Frame Noise Level Schedulers](https://awesome-repositories.com/f/artificial-intelligence-ml/noise-level-sampling-strategies/per-frame-noise-level-schedulers.md) — Denoises tokens at different noise levels independently per frame, enabling infinite-length video generation by extending sequences frame-by-frame.
- [Motion Quality Refiners](https://awesome-repositories.com/f/artificial-intelligence-ml/preference-optimization/motion-quality-refiners.md) — Applies direct preference optimization on preference pairs to train the model toward physically plausible, large-motion sequences.
- [Image-to-Video Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/image-to-video-generation.md) — Produces a video starting from a single input image, using the image as the first frame and generating subsequent frames based on a text prompt. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))
- [Infinite-Length Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/infinite-length-generators.md) — Creates videos of any duration by extending a sequence frame-by-frame with a model that denoises tokens at independent noise levels. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))
- [Video Continuation Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/video-clip-generators/video-continuation-tools.md) — A model that appends new frames to an existing video clip by conditioning on its last frames, enabling seamless continuation of footage. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))
- [Prompt Expansion](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-workflows/text-to-video-generation/prompt-expansion.md) — Expands brief text descriptions into detailed prompts using a language model for better video alignment. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))
- [Multi-GPU Video Inference Accelerators](https://awesome-repositories.com/f/artificial-intelligence-ml/gpu-accelerated-inference/multi-gpu-video-inference-accelerators.md) — Distributes video generation workloads across multiple GPUs to reduce inference time for large-scale outputs.
- [Vision-Language Models](https://awesome-repositories.com/f/artificial-intelligence-ml/on-device-models/vision-language-models.md) — Uses a vision-language model to automatically generate detailed textual descriptions of video content, including shot types and camera movements.
- [Video Captioning](https://awesome-repositories.com/f/artificial-intelligence-ml/video-captioning.md) — Generates detailed text descriptions of video content, including shot types, camera movements, and subject actions, using a vision-language model. ([source](https://cdn.jsdelivr.net/gh/skyworkai/skyreels-v2@main/README.md))

### Graphics & Multimedia

- [Autoregressive Frame Denoisers](https://awesome-repositories.com/f/graphics-multimedia/frame-buffer-snapshots/sequential-frame-buffers/temporal-frame-interpolation/autoregressive-frame-denoisers.md) — Generates video by denoising each frame sequentially, conditioning each new frame on the previous ones to produce coherent temporal sequences.
- [Frame-Controlled Generators](https://awesome-repositories.com/f/graphics-multimedia/video-frame-capture/frame-controlled-generators.md) — A tool that produces videos with user-specified start and end frames, ensuring the output begins and ends with specific visuals.
- [Frame Constraint Encoders](https://awesome-repositories.com/f/graphics-multimedia/video-encoding-and-decoding/frame-constraint-encoders.md) — Encodes both the first and last user-provided images as constraints, ensuring the generated video begins and ends with specific visuals.

### Web Development

- [Frame-Batch Distributors](https://awesome-repositories.com/f/web-development/performance-optimizations/computational-parallelization/parallel-gpu-schedulers/frame-batch-distributors.md) — Splits the video generation workload across multiple GPUs by distributing frame batches, reducing end-to-end inference time through parallel processing.