# aidc-ai/pixelle-video

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/aidc-ai-pixelle-video).**

23,403 stars · 3,354 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/AIDC-AI/Pixelle-Video
- Homepage: https://aidc-ai.github.io/Pixelle-Video/zh
- awesome-repositories: https://awesome-repositories.com/repository/aidc-ai-pixelle-video.md

## Topics

`aigc` `comfyui` `image-generation` `tts` `video-generation`

## Description

Pixelle-Video is a text-to-video automation platform and generation engine that converts text topics into complete videos with synchronized narration, images, and music. It functions as a modular system for producing short-form content, utilizing large language models to automate script composition, visual asset generation, and voiceover production.

The platform features a node-based workflow orchestrator that allows the composition of custom generation pipelines by linking different AI models. It includes a dynamic video layout designer that uses HTML templates to define aspect ratios and visual arrangements, as well as a voice cloning system that creates synthetic speech by analyzing uploaded audio reference files.

The system covers broad capabilities in audio-visual styling, including the application of aesthetic themes and global style configurations. It manages content production through AI-driven script synchronization, motion video generation, and multi-engine audio mixing.

Infrastructure and backend management include the configuration of AI model endpoints, cloud compute resource management for GPU memory and concurrency, and the integration of custom JSON-defined workflows.

## Tags

### Artificial Intelligence & ML

- [Node-Based Generative Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/workflow-execution-backends/node-based-generative-pipelines.md) — Uses a modular, node-based visual graph system to orchestrate multi-stage generative AI video pipelines.
- [Short-Form Video Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/short-form-video-generation.md) — Automates the end-to-end creation of short videos, including scripts and visual assets, from text prompts.
- [AI Video Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-video-generators.md) — Implements custom node-based pipelines to coordinate multiple AI models for video synthesis.
- [Automated Script Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/automated-script-generation.md) — Automatically generates narration and commentary scripts from a given theme using LLMs. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/))
- [Automated Video Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/automated-video-generators.md) — Coordinates scripts, voiceovers, and visuals into finished short videos from a single text prompt. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/))
- [Text-to-Video Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-video-generators.md) — Converts a single text topic into a complete video with synchronized narration, images, and music.
- [Script-Visual Synchronization](https://awesome-repositories.com/f/artificial-intelligence-ml/script-visual-synchronization.md) — Creates AI-generated images or clips that correspond specifically to individual lines of the video script. ([source](https://aidc-ai.github.io/Pixelle-Video/zh))
- [Voice Cloning Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-synthesis-models/voice-cloning-engines.md) — Creates synthetic voiceovers from text using reference audio samples for voice cloning.
- [Text-to-Speech Synthesis](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-speech-synthesis.md) — Converts written script text into spoken audio narration using AI synthesis engines. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/user-guide/workflows/))
- [Narrated Video Creators](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-speech/narrated-video-creators.md) — Converts written topics into complete narrated videos by combining text-to-speech with AI-generated visuals.
- [Multimodal Asset Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/video-generation/video-clip-generators/multimodal-asset-generation.md) — Generates accompanying images or dynamic video clips using configurable workflows and style prompts. ([source](https://cdn.jsdelivr.net/gh/aidc-ai/pixelle-video@main/README.md))
- [Voice Cloning](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-cloning.md) — Replicates specific human vocal characteristics by analyzing uploaded audio samples.
- [Asynchronous Video Job Managers](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/diffusion-visual-models/generative-ai-pipelines/text-to-video-generators/asynchronous-video-job-managers.md) — Manages the lifecycle of video generation tasks through synchronous execution or asynchronous tracking. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/reference/api-overview/))
- [LLM Provider Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/llm-provider-integrations.md) — Provides connectivity and credential management for integrating external large language model providers. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/reference/config-schema/))
- [Cloud GPU Orchestration](https://awesome-repositories.com/f/artificial-intelligence-ml/local-ai-runtimes/gpu-resource-management/cloud-gpu-orchestration.md) — Controls external cloud compute instances by specifying GPU memory and managing concurrency limits for rendering.
- [Model Provider Plugins](https://awesome-repositories.com/f/artificial-intelligence-ml/model-provider-plugins.md) — Provides an extensible system for registering third-party AI model providers via API endpoints.

### Development Tools & Productivity

- [Custom AI Workflows](https://awesome-repositories.com/f/development-tools-productivity/workflow-automation-tools/workflow-collections/ai-automation-workflows/custom-ai-workflows.md) — Combines atomic AI capabilities into tailored, user-defined generation pipelines. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/))

### Graphics & Multimedia

- [Audio Layering](https://awesome-repositories.com/f/graphics-multimedia/multi-track-audio-visual-composition/audio-layering.md) — Blends synthetic voice narration with background music tracks into a synchronized audio layer.
- [Social Media Content Creation](https://awesome-repositories.com/f/graphics-multimedia/social-media-content-creation.md) — Produces short-form video assets with customizable layouts and aspect ratios tailored for social media platforms.
- [Video Frame Templates](https://awesome-repositories.com/f/graphics-multimedia/video-frame-templates.md) — Formats video output using predefined HTML templates supporting various social media aspect ratios. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/user-guide/web-ui/))

### User Interface & Experience

- [Frame Layout Templates](https://awesome-repositories.com/f/user-interface-experience/layout-utilities/presentation-engines/template-engines/server-side-rendering-engines/html-template-renderers/frame-layout-templates.md) — Defines visual layouts and aspect ratios by rendering video frames using customizable HTML and CSS variables.
- [Template-Based Layout Designers](https://awesome-repositories.com/f/user-interface-experience/html-content-processing/html-content-processing/html-to-image-converters/html-to-video-converters/template-based-layout-designers.md) — Uses HTML templates to define visual arrangements and aspect ratios for social media video content.
- [Generative Aesthetic Themes](https://awesome-repositories.com/f/user-interface-experience/visual-styling-systems/visual-style-references/generative-aesthetic-themes.md) — Applies preset aesthetic themes like cinematic or minimalist to match the tone of generated videos. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/user-guide/templates/))
- [Global Output Configurations](https://awesome-repositories.com/f/user-interface-experience/visual-styling-systems/visual-style-references/global-output-configurations.md) — Provides global configurations for text-to-speech engines, voice cloning references, and output dimensions. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/getting-started/quick-start/))

### Part of an Awesome List

- [AI Motion Video Synthesis](https://awesome-repositories.com/f/awesome-lists/ai/video-and-motion-synthesis/ai-motion-video-synthesis.md) — Produces motion video and animated backgrounds from text prompts via local or cloud APIs. ([source](https://aidc-ai.github.io/Pixelle-Video/zh/user-guide/workflows/))
- [AI Tools](https://awesome-repositories.com/f/awesome-lists/ai/ai-tools.md) — Automated short video generation engine.

### Software Engineering & Architecture

- [Asynchronous Task Queues](https://awesome-repositories.com/f/software-engineering-architecture/asynchronous-task-queues.md) — Implements background job execution to offload long-running video generation processes.
