# xuebinqin/u-2-net

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/xuebinqin-u-2-net).**

9,773 stars · 1,614 forks · Python · Apache-2.0

## Links

- GitHub: https://github.com/xuebinqin/U-2-Net
- awesome-repositories: https://awesome-repositories.com/repository/xuebinqin-u-2-net.md

## Topics

`computer-vision` `deep-learning` `image-background-removal` `image-processing` `image-segmentation` `u-2-net` `u2net`

## Description

U-2-Net is a PyTorch image segmentation framework and computer vision saliency model designed to generate high-resolution foreground-background masks. It functions as an AI background removal tool that identifies and isolates the most visually prominent objects within an image.

The model utilizes a nested U-structure design to detect salient objects, creating precise cutouts by predicting saliency maps. These capabilities enable the separation of main subjects from their surroundings to create transparent images.

The framework covers several image processing workflows, including automatic background removal, salient object detection, and human portrait extraction for isolating head and shoulder regions. It employs a symmetric encoder-decoder path and multi-scale feature aggregation to localize object boundaries.

## Tags

### DevOps & Infrastructure

- [Background Removal Tools](https://awesome-repositories.com/f/devops-infrastructure/background-processing/background-removal-tools.md) — Provides an AI-powered tool for isolating subjects from image backgrounds to create transparent cutouts.
- [Portrait Masking](https://awesome-repositories.com/f/devops-infrastructure/background-processing/background-removal-tools/portrait-masking.md) — Extracts human head and shoulder regions using face detection and cropping to create high-quality masks. ([source](https://github.com/xuebinqin/u-2-net#readme))

### Artificial Intelligence & ML

- [Saliency Models](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-models/saliency-models.md) — Implements a research-based model that detects visually significant regions for portrait extraction and masking.
- [Salient](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/object-detection-tracking/object-detection/salient.md) — Isolates visually prominent objects within an image to create a map for background removal. ([source](https://github.com/xuebinqin/u-2-net#readme))
- [Image Segmentation](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/image-segmentation.md) — Partitions images into distinct foreground and background regions for downstream computer vision tasks.
- [Salient](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/architectures/computer-vision-segmentation-models/object-detection-models/salient.md) — Uses a nested U-structure architecture to identify and isolate the most prominent objects in an image.
- [Nested U-Structures](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-architectures/u-net-architectures/nested-u-structures.md) — Utilizes a nested U-structure design to extract deep features through repeated downsampling and upsampling paths.
- [PyTorch Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/pytorch-implementations.md) — Provides a complete model implementation using the PyTorch framework for image segmentation research.
- [PyTorch Semantic Segmentation Libraries](https://awesome-repositories.com/f/artificial-intelligence-ml/pytorch-semantic-segmentation-libraries.md) — Implements a PyTorch framework for generating high-resolution binary masks for image segmentation.
- [Saliency-Driven Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/saliency-mapping/saliency-driven-learning.md) — Trains the network to produce probability maps that isolate the most visually prominent objects from the background.
- [Symmetric Encoder-Decoders](https://awesome-repositories.com/f/artificial-intelligence-ml/encoder-decoder-architectures/symmetric-encoder-decoders.md) — Employs a symmetric encoder-decoder path to recover spatial resolution for precise object boundary localization.
- [Multi-Scale Feature Aggregation](https://awesome-repositories.com/f/artificial-intelligence-ml/inference-scaling/resolution-scaling/hierarchical-feature-pyramids/multi-scale-feature-pyramids/multi-scale-feature-aggregation.md) — Implements multi-scale feature aggregation to improve the accuracy of object masks across various sizes.

### Graphics & Multimedia

- [Image Background Removal](https://awesome-repositories.com/f/graphics-multimedia/image-background-removal.md) — Separates the main object from its surroundings to create transparent cutouts. ([source](https://xuebinqin.github.io/dis/index.html))
