CVAT is an open-source computer vision annotation tool and visual dataset management platform. It provides a self-hosted interface for labeling images, videos, and 3D data to create datasets for vision AI models.
The platform features AI-assisted data labeling to automate the creation of masks and bounding boxes, utilizing a plug-in system to connect external machine learning models. It includes a consensus-based quality assurance system that verifies label accuracy by comparing independent annotations.
The system covers collaborative team management, project organization through task decomposition, and remote cloud storage integration. It also provides a REST API for programmatic workflow control and the import and export of data in industry-standard formats.