This project is a framework for training and sampling generative models designed to produce high-quality images in few steps. It provides implementations for image generation models that transform random noise into structured visual data through an optimized sampling process.
The system specializes in accelerating image generation through consistency distillation and consistency training. It includes tools to transform pre-trained diffusion models into faster versions by distilling knowledge from a teacher model into a student model, as well as methods to train consistency models from scratch.
The project covers a broad surface of generative AI development, including text-to-image sampling and image dataset preparation. It also features an evaluation suite for benchmarking generative quality using metrics such as Fréchet Inception Distance, Precision, Recall, and Inception Score.