30 open-source projects similar to nadermx/backgroundremover, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Backgroundremover alternative.
ar-cutpaste is an augmented reality asset extraction tool and prototype designed to isolate objects from a live camera feed and transfer them into image editing software. It functions as a mobile-to-desktop bridge that uses machine learning to remove backgrounds from live images, creating digital cutouts for use in image composition. The system establishes a local server connection to transmit image data and spatial coordinates from a mobile device to a design application. This bridge uses a remote socket mechanism and a secure password to inject captured assets directly into a desktop worksp
Perfect Green Screen Keys
BackgroundMattingV2 is a deep learning background matting tool and real-time image segmentation framework. It provides a system for isolating foreground subjects from high-resolution images and video feeds in real time. The project includes a deep learning model trainer for optimizing matting models through base convergence and end-to-end refinement. It also functions as a cross-runtime model exporter, converting trained neural networks into interchangeable formats for deployment across different software environments and hardware runtimes. The framework supports streaming processed webcam f
This project is a plugin for OBS Studio that uses neural networks to isolate subjects from backgrounds in real-time video streams. It functions as an AI video segmentation tool that predicts portrait masks to create virtual green-screen effects without the need for physical hardware. The software includes a real-time depth estimation filter that identifies scene depth to produce a blurred background while keeping the foreground subject in focus. It also provides low-light video enhancement to improve visibility and visual quality for portrait video captured in poorly lit environments. The pl
Background Removal JS is a client-side neural matting library that runs a lightweight AI model directly in the browser to compute alpha mattes, removing image backgrounds without uploading any data to a server. It functions as a browser-based image background removal SDK and client-side image matting library, keeping all processing on the user's device to eliminate server costs and preserve privacy. The project provides a cross-platform creative editing engine that produces identical image and video output across web, mobile, desktop, and server environments from a single codebase. It offers
chaiNNer is a GPU-accelerated AI image upscaling application that uses a visual node-based interface for constructing image processing pipelines. At its core, it provides a node-based visual programming environment where users connect processing nodes in a directed acyclic graph, with a graph execution scheduler that traverses the pipeline in topological order. The application includes an iterator-based batch processing system that automatically applies the same pipeline to multiple files, and a model format conversion pipeline that transforms neural network models between PyTorch, ONNX, and N
This project is a collection of optional, community-contributed algorithms and specialized vision tools that extend the core OpenCV framework. It serves as a comprehensive library of extra modules for computer vision research, providing advanced toolsets for image processing, visual data analysis, and object detection. The library includes specialized frameworks for augmented reality tracking, biometric face recognition, and three-dimensional pose estimation. It provides distinct capabilities for identifying AR markers, tracking 3D object silhouettes, and performing neural network vulnerabili
Rembg is a machine learning-based toolkit designed for automated image background removal and subject segmentation. It functions as a versatile engine that identifies and extracts subjects from images, supporting diverse input methods including individual files, directory-based batch processing, and live binary data streams. The project distinguishes itself through its flexible integration options, offering a command-line interface for local automation, a library for programmatic access, and an HTTP service for remote requests. It utilizes deep learning architectures to classify pixels and ge
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. 📷
Real-Time-Person-Removal is a web-based computer vision application designed to identify and remove human figures from live video streams. Using TensorFlow.js, the tool functions as a real-time background subtraction system that analyzes scene composition to isolate static backgrounds from moving people. The project enables browser-based computer vision by processing webcam video feeds directly in the client. It utilizes machine learning to differentiate between dynamic scene elements and the background, allowing for the real-time removal of people from the visual field.
RobustVideoMatting is a deep learning video matting tool and PyTorch library designed to remove backgrounds from videos and extract human subjects. It utilizes a temporal video segmentation model to ensure consistent matting and reduce flickering across video frames. The project includes a cross-platform model exporter that converts trained neural networks into various runtime formats. This allows for model deployment across multiple environments, including web and mobile applications. The framework provides capabilities for temporal video background removal and AI video post-production with
Human is a TensorFlow.js computer vision library used for face, body, and hand tracking within the browser or Node.js. It provides a framework for human pose and gesture tracking, facial recognition, and biometric liveness detection to verify a live human presence. The project distinguishes itself through a full suite of identity and motion tools, including a facial recognition framework that generates embeddings for similarity matching and a background segmenter for separating humans from their environment. It incorporates a liveness detector to prevent spoofing during facial analysis. The
U-2-Net is a PyTorch image segmentation framework and computer vision saliency model designed to generate high-resolution foreground-background masks. It functions as an AI background removal tool that identifies and isolates the most visually prominent objects within an image. The model utilizes a nested U-structure design to detect salient objects, creating precise cutouts by predicting saliency maps. These capabilities enable the separation of main subjects from their surroundings to create transparent images. The framework covers several image processing workflows, including automatic ba
Natron is an open-source node-based video compositor and motion graphics software. It functions as a visual effects tool that combines images and video using a node graph and a linear color pipeline to create complex effects and set extensions. The software includes a 2D motion tracking suite for stabilizing footage and tracking pixel movement across frames. It is a Python-scriptable environment, allowing for custom automation and user interface extensions through an embedded Python interpreter. The platform provides a comprehensive range of compositing capabilities, including chroma keying,
Photopea is a browser-based image editor that runs entirely in the browser without requiring any downloads or installations, capable of editing both raster and vector graphics. It functions as a multi-format image converter, a PSD file editor, a RAW photo processor, and a vector graphics editor, all within a single web application. The editor distinguishes itself through its extensive file format support, opening and saving industry-standard formats including PSD, AI, Sketch, Figma, and XD, as well as native files from Affinity, InDesign, and Krita. It also supports RAW camera files from Niko
GPAC is an open-source multimedia framework built around a pluggable filter graph pipeline, where modular processing units called filters connect into a directed graph to handle media workflows. At its core, the framework centers all media packaging and manipulation on the ISO Base Media File Format (ISOBMFF), with specialized tools for reading, writing, fragmenting, and encrypting MP4 and related containers. It also provides a declarative scene graph composition system for describing interactive multimedia scenes using MPEG-4 BIFS, X3D, SVG, or VRML syntax, alongside a hardware-accelerated re
Tailspin is a regex-based text colorizer and terminal log viewer designed to transform plain text streams into colorized output. It functions as a command line log highlighter and tailer that applies syntax highlighting to logs using regular expressions. The tool distinguishes itself through its ability to monitor files in real time and pipe live output through a highlighter. It recognizes and colors common data types such as IP addresses, UUIDs, HTTP methods, JSON objects, dates, and memory pointers. Users can define custom highlight styles and regex patterns to assign specific colors to uni
Translate-shell is a command-line translation tool and terminal dictionary client. It allows for the translation of words, phrases, and sentences between multiple languages and provides dictionary definition retrieval and language metadata display directly within the terminal. The tool functions as a shell-based text translator that can process input from standard streams, local files, or URLs. It includes text-to-speech capabilities to play audio pronunciations of source and translated text and can automatically detect the source language of a given string. The system supports interactive s
OpenCVSharp is a .NET library that wraps native OpenCV functions, providing C# developers with access to OpenCV's computer vision capabilities through an API that mirrors the native C/C++ style. It serves as a managed wrapper for image processing, feature detection, object detection, and image manipulation tasks, while also handling automatic disposal of unmanaged OpenCV resources like Mat objects to prevent memory leaks in .NET applications. The library enables keypoint detection and descriptor extraction using algorithms such as AKAZE, BRISK, or FAST, with brute-force or FLANN-based matchin
fzf-lua is a fuzzy finder integration for Neovim that utilizes fzf to search files, buffers, and project symbols. It serves as a code navigation framework providing a dynamic result generator that populates search windows using real-time shell commands or custom Lua functions. The project distinguishes itself through specialized integration tools for Git and Language Server Protocols. It includes a Git search interface for navigating commits, branches, stashes, and diffs, alongside an LSP integration tool that bridges language server providers to locate definitions and references across a cod
This project is a keyboard-driven menu framework and application launcher for X11 and Wayland desktop environments. It serves as a customizable system control interface for building themed popup menus and applets. The framework allows for extensive visual style customization and the creation of themed launchers through configuration files and style presets. It enables the construction of personalized Linux desktop interfaces by separating functional applets into a modular script architecture. The system covers broad management capabilities, including hardware control for audio and brightness
MoviePy is a Python video editing library and automated video processor designed for programmatically cutting, concatenating, and manipulating video and audio files. It serves as a non-linear video editor and an interface for FFmpeg to handle the reading, writing, and conversion of diverse media formats and codecs. The library enables automated video composition through the layering of multiple video and audio streams using transparency and coordinate-based positioning. It supports dynamic content generation by inserting text overlays and performing custom video frame processing where raw fra
Skim is an interactive text filter and terminal selection tool written in Rust. It functions as a command line interface utility that processes input streams to isolate specific entries through real-time user queries and sorting. The tool differentiates itself through ANSI compatibility, allowing it to parse color codes and maintain text formatting during the search process. It supports multiple matching strategies, including configurable fuzzy matching algorithms and regular expression integration. The application covers a broad range of capabilities including field-specific filtering, resu
Deep Painterly Harmonization is a deep learning image harmonization tool and convolutional neural network framework. It is designed to automate the process of blending the visual appearance of a foreground object into a background image to make composite images look natural. The system functions as a computer vision blending tool that adjusts colors and lighting of inserted objects. It ensures that these elements match the global illumination and color palette of the target background environment. The project covers digital image compositing and visual content integration by resolving color
Shell GPT is an AI-powered command-line interface that generates shell commands and source code from natural language prompts. It serves as a terminal-based tool for automating technical tasks, producing executable commands, and generating code snippets directly within the shell. The tool distinguishes itself through a read-eval-print loop for interactive chatting and the ability to maintain stateful conversational history via named sessions. It supports flexible backend routing, allowing users to connect to cloud-based APIs or local language model hosts for offline operation and data privacy
Civitai is a platform for generative media creation and AI model distribution. It provides a centralized service for producing images, videos, audio, and music, while serving as a repository where users can share, discover, and browse custom model weights and fine-tuned adaptations. The platform distinguishes itself through a provider-agnostic orchestration layer that manages multi-step generation pipelines and complex workflows across different backends. It integrates with autonomous AI agents and editors via the Model Context Protocol, allowing external tools to access generation pipelines
RapidRAW is a non-destructive RAW photo editor and digital asset manager designed for decoding manufacturer RAW formats and applying tonal and color adjustments. It functions as a professional image processor that ensures original source data remains unmodified by saving all edits, masks, and crops to sidecar files. The software features a specialized color grading suite using 3D LUTs, color wheels, and HSL mixers, alongside AI-powered utilities for subject isolation, automatic masking, and generative inpainting for object removal. It distinguishes itself with AI-assisted photo retouching and
Inpaint-Anything is a diffusion-based image editor and inpainting tool designed to remove or replace objects in images, videos, and 3D scenes. It functions as a text-guided manipulator that uses natural language descriptions and mask-based filling to modify visual content. The system provides specialized capabilities for multi-view 3D scene editing and video object removal. It tracks selected objects across multiple frames or perspectives to synthesize consistent backgrounds and maintain spatial coherence after an element is removed. The tool covers a range of image manipulation tasks, inclu
Polybar is a customizable status bar for X11 window managers that functions as a desktop shell interface for displaying real-time system metrics and application status. It serves as a shell script runner and system resource monitor, rendering a persistent graphical overlay on the desktop to track CPU load, memory usage, battery levels, and network connection details. The project distinguishes itself by acting as a system tray host to manage background application icons and as a window manager interface that communicates via protocol messages to show active workspace indicators and window titl