30 open-source projects similar to plemeri/transparent-background, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Transparent Background alternative.
This project is a deep learning image matting framework and computer vision tool designed to separate people from their backgrounds. It functions as a real-time video matting engine and a trainable foreground isolation model that generates per-pixel alpha mattes to isolate subjects from photos and videos. The system utilizes reference-based alpha matting, incorporating a specific background image to simulate green screen effects without a physical screen. This approach allows for the removal and replacement of backgrounds in high-resolution footage, including live video streams. The framewor
Backgroundremover is an AI-powered tool that removes backgrounds from both images and videos, accessible through a command-line interface and a Python API. At its core, it uses a pre-trained deep learning model to classify each pixel as foreground or background, producing a binary mask for removal. The tool distinguishes itself through multiple integration methods and output capabilities. It can process images and videos via Unix pipeline data streams, operate as an HTTP API server, or be called programmatically within Python scripts. Users can choose among different AI models to balance proc
Perfect Green Screen Keys
Real-Time-Person-Removal is a web-based computer vision application designed to identify and remove human figures from live video streams. Using TensorFlow.js, the tool functions as a real-time background subtraction system that analyzes scene composition to isolate static backgrounds from moving people. The project enables browser-based computer vision by processing webcam video feeds directly in the client. It utilizes machine learning to differentiate between dynamic scene elements and the background, allowing for the real-time removal of people from the visual field.
RobustVideoMatting is a deep learning video matting tool and PyTorch library designed to remove backgrounds from videos and extract human subjects. It utilizes a temporal video segmentation model to ensure consistent matting and reduce flickering across video frames. The project includes a cross-platform model exporter that converts trained neural networks into various runtime formats. This allows for model deployment across multiple environments, including web and mobile applications. The framework provides capabilities for temporal video background removal and AI video post-production with
U-2-Net is a PyTorch image segmentation framework and computer vision saliency model designed to generate high-resolution foreground-background masks. It functions as an AI background removal tool that identifies and isolates the most visually prominent objects within an image. The model utilizes a nested U-structure design to detect salient objects, creating precise cutouts by predicting saliency maps. These capabilities enable the separation of main subjects from their surroundings to create transparent images. The framework covers several image processing workflows, including automatic ba
Rembg is a machine learning-based toolkit designed for automated image background removal and subject segmentation. It functions as a versatile engine that identifies and extracts subjects from images, supporting diverse input methods including individual files, directory-based batch processing, and live binary data streams. The project distinguishes itself through its flexible integration options, offering a command-line interface for local automation, a library for programmatic access, and an HTTP service for remote requests. It utilizes deep learning architectures to classify pixels and ge
MODNet is a deep learning image segmenter and portrait matting model designed to isolate human subjects from backgrounds. It generates high-quality alpha mattes for images and video using only standard RGB input, removing the requirement for manual trimap guides. The framework is optimized for real-time inference and provides utilities to export pre-trained model weights into specialized formats for deployment on target hardware. The project covers the full workflow for portrait isolation, including supervised matting model training on labeled datasets, real-time video background removal, an
This project is a plugin for OBS Studio that uses neural networks to isolate subjects from backgrounds in real-time video streams. It functions as an AI video segmentation tool that predicts portrait masks to create virtual green-screen effects without the need for physical hardware. The software includes a real-time depth estimation filter that identifies scene depth to produce a blurred background while keeping the foreground subject in focus. It also provides low-light video enhancement to improve visibility and visual quality for portrait video captured in poorly lit environments. The pl
IOPaint is an AI image editor and Stable Diffusion inpainting tool providing a web interface for removing objects and replacing image content. It utilizes latent diffusion image processing to synthesize high-resolution replacements for erased sections of an image. The project features a specialized AI background remover for isolating subjects and an AI image upscaler that employs super-resolution models for general photos and anime artwork. The software covers a broad range of capabilities including image segmentation for object isolation, face restoration for improving facial details, and t
VDO.Ninja is a low-latency peer-to-peer media routing service and video streaming platform designed to integrate remote audio and video feeds into professional production workflows. It functions as a WebRTC broadcast integration tool and studio controller, allowing for the direct transmission of high-definition media between publishers and viewers with minimal delay. The platform distinguishes itself through extensive protocol bridging, converting between WebRTC, WHIP, WHEP, SRT, and RTMP to ensure compatibility across diverse network environments and professional studio software. It includes
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. 📷
YASB is a customizable status bar framework and desktop shell component for Windows. It provides a toolkit for building personalized information bars using a modular class-based widget architecture and CSS-based styling. The framework distinguishes itself through deep integration with Windows tiling window managers, allowing users to display active workspaces, tiling layouts, and window focus states. It also features automated visual consistency by generating system color schemes based on the current desktop wallpaper. The project covers a wide range of capabilities, including real-time syst
ar-cutpaste is an augmented reality asset extraction tool and prototype designed to isolate objects from a live camera feed and transfer them into image editing software. It functions as a mobile-to-desktop bridge that uses machine learning to remove backgrounds from live images, creating digital cutouts for use in image composition. The system establishes a local server connection to transmit image data and spatial coordinates from a mobile device to a design application. This bridge uses a remote socket mechanism and a secure password to inject captured assets directly into a desktop worksp
This project is a system-level utility for Linux that intercepts, modifies, and presents live webcam feeds as standard virtual video devices. By creating a bridge between physical hardware and user-space applications, it allows video conferencing software to consume processed streams as if they were native camera inputs. The software distinguishes itself through its ability to manage the lifecycle of video processing tasks as persistent background services. It monitors virtual device activity to dynamically allocate resources, ensuring that image processing and hardware usage are suspended wh
BackgroundMattingV2 is a deep learning background matting tool and real-time image segmentation framework. It provides a system for isolating foreground subjects from high-resolution images and video feeds in real time. The project includes a deep learning model trainer for optimizing matting models through base convergence and end-to-end refinement. It also functions as a cross-runtime model exporter, converting trained neural networks into interchangeable formats for deployment across different software environments and hardware runtimes. The framework supports streaming processed webcam f
PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic differentiation system that allows for flexible, non-static graph execution. The framework is designed for deep integration with Python, enabling natural usage alongside standard scientific computing ecosystems. It distinguishes itself through a comprehensive distributed training sui
PicGo is a cross-platform desktop utility designed to automate image hosting and asset management. It functions as a pipeline-based engine that processes image inputs—such as local files, base64 strings, or clipboard data—through a configurable sequence of transformations and uploads to various cloud storage providers. The application distinguishes itself through a modular, plugin-based architecture that allows users to extend core functionality without modifying the main binary. By utilizing a lifecycle hook system, developers can register custom logic to intercept and modify data at specifi
Wox is a keyboard-driven command interface designed to centralize desktop productivity, system navigation, and workflow automation. It functions as a launcher that enables users to open applications, manage files, and execute system commands through a unified, text-based overlay. By integrating a modular plugin framework, the application allows for the expansion of its core capabilities to include specialized search providers and custom system utilities. The platform distinguishes itself through the integration of language models, which enable natural language search and the automation of tex
CameraView is a high-level Android camera library and hardware wrapper designed for capturing photos and videos. It provides an abstraction layer for managing camera hardware and a media capture API for recording high-resolution video and RAW photos with configurable bitrates and resolutions. The project features a real-time camera filter framework and a preview manager. These systems allow for the application of custom shaders and visual effects to live camera streams and the rendering of previews with customizable aspect ratios, overlays, and composition grids. The library covers a wide ra
This project is a remote desktop software suite and administration tool designed for controlling remote devices via web browsers or desktop applications across different operating systems. It functions as a secure remote access gateway and device manager, providing a centralized backend for auditing sessions and deploying private infrastructure to target machines. The system distinguishes itself through the use of GPU-accelerated video streaming and hardware encoding to reduce latency. It enables multi-device monitoring via a screen wall and supports the creation of virtual display emulations
Deep Painterly Harmonization is a deep learning image harmonization tool and convolutional neural network framework. It is designed to automate the process of blending the visual appearance of a foreground object into a background image to make composite images look natural. The system functions as a computer vision blending tool that adjusts colors and lighting of inserted objects. It ensures that these elements match the global illumination and color palette of the target background environment. The project covers digital image compositing and visual content integration by resolving color
Chainer is an open-source deep learning framework built around define-by-run automatic differentiation, where computation graphs are constructed dynamically during forward execution. This imperative approach allows networks to be built using standard Python control flow, with gradients computed automatically through reverse-mode differentiation on the dynamically recorded graph. The framework supports GPU acceleration through a NumPy-compatible array backend with CUDA and cuDNN support, and provides a pluggable device abstraction that lets users switch between CPU and GPU computation without c
JavaCV provides a Java-based interface for native computer vision and video processing libraries. It functions as a wrapper for native vision libraries, allowing Java applications to perform image analysis, object detection, and video stream processing. The project integrates comprehensive computer vision capabilities, including facial recognition, image segmentation, and optical flow analysis for motion tracking. It also provides tools for hardware geometry calibration and projector-camera alignment to ensure accurate spatial representation. The system covers high-performance media renderin
LFLiveKit is an iOS live streaming SDK designed for capturing, encoding, and transmitting real-time audio and video streams over the RTMP protocol. The toolkit includes a hardware media encoder that uses H264 and AAC acceleration to compress streams, as well as a GPU-accelerated video filter for applying real-time beauty effects and watermarks. It also features an adaptive bitrate streamer that dynamically adjusts transmission rates and manages frame drops to maintain stability during network fluctuations. The system supports media capture from external peripheral devices and screen recordin
Darknet is a high-performance C-based inference engine and computer vision library designed for real-time object identification and localization. It serves as a neural network framework for training and deploying detection models using the YOLO architecture, providing a toolset for deep learning training and deployment. The project differentiates itself through a C and CUDA implementation that enables hardware acceleration for matrix multiplication and inference speed optimization. It provides a shared library interface for embedding detection capabilities into external applications and suppo
DeepDream is a deep learning image processor and convolutional neural network art generator designed to synthesize psychedelic imagery and visualize how neural networks interpret visual data. It functions as a tool for generating generative AI art by amplifying patterns recognized by a pre-trained model to produce dream-like effects. The project utilizes a TensorFlow image visualizer to explore how different layers of a neural network perceive images. This is achieved through algorithmic image manipulation and deep learning visualization techniques that transform standard photographs into sty
FreeRDP is a full software implementation of the Remote Desktop Protocol, providing both client and server capabilities for remote session management. It functions as an RDP client library and a standalone remote desktop client, enabling remote connectivity and interoperability across different operating systems. The project includes a dedicated network device redirector and an RDP gateway client to handle authentication and proxy routing. It allows developers to integrate remote desktop functionality into third-party software applications via its client library. The software covers a wide r
This project is a biomedical image segmentation framework and PyTorch computer vision library. It provides a deep learning pipeline for isolating specific anatomical structures within medical imagery using pixel-level binary classification. The system utilizes an encoder-decoder neural architecture combined with attention-based feature refinement to highlight relevant anatomical regions and suppress background noise. The toolkit covers a full training workflow, including stochastic data augmentation for biomedical datasets, hyperparameter optimization, and model persistence for restoring pre