ChatTTS-ui is a web-based interface and API wrapper for the ChatTTS model, designed to convert written text and mixed language input into spoken audio. It functions as an AI speech synthesis dashboard and a programmatic generator for creating naturalistic voice output. The project focuses on custom voice profiling and speech nuance control. It allows for the maintenance of consistent speaker characteristics using seed values and data files, while providing controls for tone, laughter, and pauses through behavioral prompts and sampling parameters. The system includes a client-server architect
This Python SDK provides a comprehensive toolkit for synthetic audio generation, voice cloning, and the development of conversational AI agents. It enables the creation of lifelike spoken audio from text, the replication of human voices through custom cloning, and the deployment of real-time voice agents capable of interacting with external large language models. The library distinguishes itself through deep integration of conversational AI capabilities, including the design of agent personas and the execution of real-time actions via APIs. It supports professional-grade audio production thro
Pixelle-Video is a text-to-video automation platform and generation engine that converts text topics into complete videos with synchronized narration, images, and music. It functions as a modular system for producing short-form content, utilizing large language models to automate script composition, visual asset generation, and voiceover production. The platform features a node-based workflow orchestrator that allows the composition of custom generation pipelines by linking different AI models. It includes a dynamic video layout designer that uses HTML templates to define aspect ratios and vi
ShortGPT is an automated short-form video creation framework that combines large language model-driven scripting with neural voice synthesis, visual asset retrieval, and programmatic video editing. The project provides a modular pipeline architecture that chains script generation, voiceover synthesis, caption rendering, and video assembly into automated workflows, enabling the production of complete short videos from a topic prompt. The framework distinguishes itself through an LLM-oriented editing language that controls video assembly and rendering tasks programmatically, and a multilingual