Labelme is a Python-based image annotation tool used to create computer vision datasets. It serves as a visual editor for semantic segmentation, allowing users to define object boundaries using polygons, rectangles, points, and circles. The application also functions as a multispectral image annotator, supporting high-bit depth TIFF files used in satellite and scientific imagery.
The tool incorporates AI-assisted labeling capabilities to automate the creation of masks and polygons. These features allow for shape generation driven by text prompts or interactive point selections, which propose boundaries based on user-placed positive and negative points.
The software covers a broad range of data management and annotation tasks, including the creation of dense pixel masks, rotated bounding boxes, and video frame sequencing. It includes a pipeline for translating internal JSON state persistence into standard dataset formats such as COCO and Pascal VOC. Additional capabilities include image-level classification flags, geometry refinement tools, and batch image importing.