TRELLIS is a 3D generative AI model and latent diffusion framework designed to transform natural language descriptions or reference images into textured 3D assets. It operates as a text-to-3D asset generator that utilizes structured latent representations to produce high-quality 3D meshes, Gaussians, and Radiance Fields.
The system functions as a multi-format 3D decoder, converting internal representations into standard exchange formats such as GLB and PLY. It also serves as a 3D asset editing tool, enabling the modification of specific regions of generated objects through targeted text or image-based prompts.
The framework covers a broad range of capabilities including cross-modal conditioning and diffusion-based latent generation. It supports large-scale model training across single or multi-node GPU configurations and provides workflows for creating visual variations of existing assets.