sam-3d-body is a machine learning framework for 3D human mesh recovery and pose estimation. It utilizes a 3D human mesh recovery model to reconstruct full-body meshes, including the body, hands, and feet, from a single image.
The project implements a specialized extension of the Segment Anything Model to guide the extraction and refinement of human body shapes. This integration allows for prompt-guided mesh recovery, where 2D masks and keypoints constrain the inference of 3D pose and shape parameters.
The system covers a range of computer vision capabilities, including 3D spatial alignment to place human meshes and objects into a shared coordinate frame and digital human modeling for virtual environments. It also includes tools for differentiable mesh rendering and visualization via reconstruction result overlays on original 2D images.