This project is a deep learning image matting framework and computer vision tool designed to separate people from their backgrounds. It functions as a real-time video matting engine and a trainable foreground isolation model that generates per-pixel alpha mattes to isolate subjects from photos and videos. The system utilizes reference-based alpha matting, incorporating a specific background image to simulate green screen effects without a physical screen. This approach allows for the removal and replacement of backgrounds in high-resolution footage, including live video streams. The framewor
BackgroundMattingV2 is a deep learning background matting tool and real-time image segmentation framework. It provides a system for isolating foreground subjects from high-resolution images and video feeds in real time. The project includes a deep learning model trainer for optimizing matting models through base convergence and end-to-end refinement. It also functions as a cross-runtime model exporter, converting trained neural networks into interchangeable formats for deployment across different software environments and hardware runtimes. The framework supports streaming processed webcam f
YOLOv6 is a single-stage deep learning framework designed for industrial object detection. It serves as a computer vision model trainer for identifying and locating objects within images, as well as an instance segmentation tool that delineates precise object boundaries using masks. The project includes a specialized mobile inference optimizer and a model quantization toolkit. These components focus on reducing model size and resolution to improve execution speed on ARM-based chipsets and converting models to low-precision formats to decrease file size. The framework covers a broad range of
RobustVideoMatting is a deep learning video matting tool and PyTorch library designed to remove backgrounds from videos and extract human subjects. It utilizes a temporal video segmentation model to ensure consistent matting and reduce flickering across video frames. The project includes a cross-platform model exporter that converts trained neural networks into various runtime formats. This allows for model deployment across multiple environments, including web and mobile applications. The framework provides capabilities for temporal video background removal and AI video post-production with