Albumentations is an image augmentation library and computer vision preprocessing tool designed to expand datasets for deep learning models. It provides a collection of transformations that modify pixel values and spatial geometry to increase the diversity of training samples and improve model generalization. The library supports both 2D image augmentation and 3D volumetric data augmentation. It handles a variety of labels alongside images, ensuring that bounding boxes, keypoints, and segmentation masks remain accurately aligned when spatial transformations are applied. The tool incorporates
PointNet is a deep learning architecture designed to process and classify raw 3D point clouds directly without voxelization. It provides a system for 3D object classification, semantic segmentation frameworks for partitioning clouds into categories, and tools for visualizing 3D shapes. The project utilizes a transform network to align point clouds into a canonical coordinate space and employs symmetric-function-based aggregation to condense point-wise features into global vectors regardless of point order. It also features a multi-scale grouping architecture to extract hierarchical geometric
Created by Charles R. Qi , Li (Eric) Yi , Hao Su , Leonidas J. Guibas from Stanford University.