28 open-source projects similar to luanfujun/deep-painterly-harmonization, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Deep Painterly Harmonization alternative.
vibrant.js is a JavaScript color extraction library used to identify dominant color palettes from images based on the Android Palette algorithm. It functions as an image palette generator and a color processing tool that converts extracted data between RGB, HSL, and Hexadecimal formats. The library analyzes image pixels to categorize colors into specific profiles, including vibrant, muted, dark, and light. It also includes an accessible text color calculator that determines high-contrast hex colors for text overlays based on a selected background color. The toolset covers automated palette g
This is a jQuery plugin that extracts the dominant color from an image or CSS background image and applies it as a background color on a target element. It uses canvas-based pixel analysis with an RGB quantization algorithm to identify the most prominent color, then injects that color as an inline CSS background-color style. The plugin automatically normalizes text contrast by calculating the relative luminance of the extracted color and toggling between dark and light text to maintain readability. It includes an image preloading pipeline to ensure pixel data is available synchronously from t
Chameleon is a color framework for Swift and Objective-C applications. It provides systems for programmatic color palette generation, global theme orchestration, hexadecimal conversion, and the extraction of visual data from images. The library includes a dynamic theme engine for applying global visual styles and luminance adjustments across an interface. It features a palette generator for creating analogous, complementary, and triadic color schemes based on a seed color, and an image color extractor to derive average colors or palettes from images. The framework covers a range of color man
Chameleon is a color framework for Swift and Objective-C applications, providing a toolkit for managing dynamic palettes, gradient libraries, and hexadecimal conversions. It functions as a system for creating harmonious color schemes and calculating contrasting text colors based on background luminance. The project includes an image color extractor that analyzes images to generate matching color schemes or calculate average colors for user interfaces. It also features a gradient color library for creating and applying smooth transitions to backgrounds and text elements. The framework covers
This project is an AI upscaling framework and deep learning image restorer designed to estimate original source pixels from low-resolution inputs. It functions as a super-resolution reconstruction system that transforms pixelated images into high-resolution versions by restoring high-frequency details and sharpening edges. The system utilizes a convolutional neural network pipeline to analyze pixel data and perform digital image restoration. It employs pixel-shuffle upsampling to rearrange channel dimensions into spatial dimensions, which increases resolution while reducing checkerboard artif
Clarity-upscaler is an AI image upscaler and enhancement tool that uses deep learning models to increase image resolution and restore visual detail. It functions as a super-resolution inference engine that employs neural networks to predict missing pixels and synthesize high-frequency details from low-resolution sources. The project is delivered as a programmable API, allowing the integration of automated high-resolution image processing and sharpening into external applications and workflows. This interface enables the programmatic upscaling of images to create high-resolution assets. The s
Pywal is an image-based theme engine and dynamic color scheme generator that extracts dominant colors from images to create coordinated system-wide color palettes. It functions as a cross-application theme synchronizer and terminal color palette manager, updating interface colors and environment configurations in real-time. The system synchronizes generated palettes across third-party software, window managers, and supported hardware, including RGB backlight controllers for keyboards and laptops. It integrates wallpaper management by applying a source image as the system background while simu
Color-thief is a color quantization library and image color palette extractor designed to identify the most prominent colors in visual media. It functions as a semantic color classifier and color space converter, providing tools to extract dominant colors and generate representative palettes from images, videos, and canvas elements. The project utilizes a WebAssembly color processor and background workers to perform high-performance pixel analysis. It implements a WCAG contrast analyzer to calculate color contrast ratios and determine accessible foreground text colors based on accessibility s
DeepDream is a deep learning image processor and convolutional neural network art generator designed to synthesize psychedelic imagery and visualize how neural networks interpret visual data. It functions as a tool for generating generative AI art by amplifying patterns recognized by a pre-trained model to produce dream-like effects. The project utilizes a TensorFlow image visualizer to explore how different layers of a neural network perceive images. This is achieved through algorithmic image manipulation and deep learning visualization techniques that transform standard photographs into sty
This project is a TensorFlow-based neural style transfer tool and deep learning image processor. It uses convolutional neural networks to apply the artistic style of one image to the content of another through neural image synthesis. The system supports multi-style blending to combine artistic characteristics from several different images into a single output. It also includes color-preserving stylization, which maintains the original color palette of the source image by merging source color data with the luminance of the stylized result. The tool provides capabilities for style abstraction
DeOldify is a deep learning system and a set of pre-trained computer vision models designed to apply realistic colors to grayscale photographs and video footage. It functions as a neural media restoration tool that uses trained networks to estimate original hues for black-and-white media and remove glitches and artifacts from aged images and film. The project employs a NoGAN colorization technique that removes the GAN discriminator during training to prevent artifacts and avoid over-saturation of pixels. For cinematic sequences, it applies temporal frame consistency to maintain color stabilit
Style2paints is a deep learning image processor designed for the automated colorization of grayscale line art. It functions as a generative style transfer engine that maps artistic color palettes and textures onto monochrome sketches, allowing users to transform black and white drawings into finished illustrations through neural network inference. The system distinguishes itself by incorporating user-provided color guidance and style references to influence the final output. It utilizes coordinate-mapped color points and hint-driven optimization to ensure that specific colors are applied prec
PRNet is a Python library for 3D facial reconstruction. It uses a deep learning regression model to predict 3D facial geometry and vertex colors from a single 2D input image to generate a textured mesh. The project provides tools for digital face swapping, allowing the replacement of a target face with a new image and blending textures to match the original pose. It also includes a framework for face texture swapping and blending to fit specific 3D poses. Additional capabilities cover facial analysis, including the detection and alignment of facial landmarks and the estimation of head pose a
DenseNet is a computer vision model and convolutional neural network implementation designed for image recognition and classification tasks. It utilizes a densely connected network architecture where each layer is connected to every other layer to improve feature propagation. The implementation reduces the number of parameters while maintaining accuracy through a dense-connectivity pattern and layer-aggregation concatenation. It supports model construction using both standard and bottleneck-compressed architectures, with configurable network depth and growth rates to balance inference time an
This software is a computer vision utility designed for automated subject isolation and background removal. It provides a graphical desktop interface that allows users to extract foreground subjects from static images, video files, and live webcam streams without requiring command-line interaction. The application leverages deep learning models to generate high-fidelity alpha masks, enabling the creation of transparent backgrounds or the application of custom replacements. By utilizing hardware-accelerated tensor processing, the system performs real-time segmentation on live camera feeds and
Backgroundremover is an AI-powered tool that removes backgrounds from both images and videos, accessible through a command-line interface and a Python API. At its core, it uses a pre-trained deep learning model to classify each pixel as foreground or background, producing a binary mask for removal. The tool distinguishes itself through multiple integration methods and output capabilities. It can process images and videos via Unix pipeline data streams, operate as an HTTP API server, or be called programmatically within Python scripts. Users can choose among different AI models to balance proc
This project is a collection of optional, community-contributed algorithms and specialized vision tools that extend the core OpenCV framework. It serves as a comprehensive library of extra modules for computer vision research, providing advanced toolsets for image processing, visual data analysis, and object detection. The library includes specialized frameworks for augmented reality tracking, biometric face recognition, and three-dimensional pose estimation. It provides distinct capabilities for identifying AR markers, tracking 3D object silhouettes, and performing neural network vulnerabili
Gluon-CV is an MXNet computer vision library that provides a comprehensive collection of pre-implemented vision architectures and training pipelines. It serves as a deep learning research toolkit and a model zoo containing state-of-the-art pre-trained weights for image and video analysis. The project includes a specialized human pose estimation library and a model compression toolkit. These tools allow for the pruning and quantization of deep learning models to increase inference speed and facilitate deployment on constrained edge hardware. The library covers a broad range of vision capabili
This project is a PyTorch implementation of a research architecture designed for high-resolution representation learning. It serves as a computer vision framework focused on precise keypoint detection, human pose estimation, and semantic image segmentation. The implementation provides specialized tools for identifying anatomical landmarks on the human body and predicting facial keypoint coordinates to analyze orientation and alignment. It utilizes a system of multi-resolution parallel streams and repeated multi-scale fusion to maintain high-resolution representations throughout the network.
This project is a QML desktop shell designed for desktop environment orchestration and interface customization. It functions as a system status dashboard and a declarative user interface for managing system hardware, window metadata, and user sessions. The shell features a dynamic theme generator that extracts dominant colors from wallpapers to automatically synchronize the global visual color palette. It utilizes an inter-process communication system to orchestrate shell functions and a hierarchical JSON configuration framework to manage global and per-monitor interface layouts. The system
This project is a modular image manipulation framework and processing pipeline library designed for Ruby applications. It provides a programmable interface for executing complex image transformations, including resizing, cropping, rotating, and compositing, while managing file input and output parameters. The library distinguishes itself through a fluent, chainable interface that allows developers to construct sophisticated processing workflows by linking modular operations together. It functions as an image transformation engine that delegates heavy lifting to external native binary tools, e
This project is a modular PyTorch framework for training and evaluating object detection and instance segmentation models. It serves as a computer vision research tool and a deep learning inference engine designed to identify object locations, classes, and pixel-level masks within images. The framework implements a two-stage inference pipeline that utilizes region proposal networks and a symmetric mask-head architecture. It provides specialized capabilities for instance segmentation, object bounding box detection, and human pose estimation via anatomical keypoint detection. The system includ
UGATIT is an unsupervised generative adversarial network and image-to-image translation model implemented in TensorFlow. It serves as the official research implementation of an ICLR 2020 paper, providing a framework for converting images between different visual styles without requiring paired training examples. The system utilizes an unsupervised generative attentional network and attention maps to deform geometric shapes and modify textures during the translation process. It employs a cycle-consistent framework to ensure translation quality by requiring images to return to their original st
This project is a deep learning computer vision implementation focused on low-light image restoration. It uses a neural network to process raw sensor data, mapping underexposed images to well-exposed versions to improve visibility and restore natural colors. The implementation is based on CVPR 2018 research and utilizes TensorFlow to execute the computational graph. It employs a convolutional neural network and pixel-wise regression to reconstruct scene lighting directly from unprocessed raw image data. The project includes a framework for supervised pair learning, where models are trained u
Magick.NET is a C# image processing library that serves as a .NET wrapper for ImageMagick. It provides a raster graphics engine for rendering text, drawing graphics, and manipulating images using a native interface. The library handles the conversion of vector files, such as PDF, EPS, and PostScript, into raster formats. It also includes tools for extracting image metadata, such as EXIF data and raw thumbnails. The system covers a wide range of image manipulation capabilities, including resizing, format conversion, watermarking, and the merging of multiple images into static or animated file
TagSpaces is an offline-first file tagging and organization platform that lets you manage local files with portable metadata stored directly in filenames or sidecar JSON files, eliminating the need for a central database. It functions as a full-text file search engine, a Kanban board file organizer, a local AI file assistant, an S3-compatible cloud file manager, and a web clipper and bookmark manager, all within a single application. The project distinguishes itself through a local-first architecture where all file operations, indexing, and AI processing run entirely on the device, with cloud
This is an image segmentation framework and masking toolkit for constructing binary and multi-class neural network architectures. It serves as a deep learning encoder wrapper that integrates pre-trained convolutional neural network architectures into semantic segmentation models. The library enables the use of pre-trained backbones to isolate complex patterns and leverages transfer learning to accelerate training. It provides a collection of overlap-based loss functions and precision metrics specifically designed to evaluate and refine the accuracy of image masks. The toolkit covers the full
This project is a computer vision pipeline and volumetric rendering system used to transform photos and videos into high-fidelity 3D models. It implements a deformable neural radiance field framework that optimizes deformation fields to represent non-rigid moving subjects in three dimensions. The system utilizes volumetric deformation fields to map 3D coordinates from a static canonical space to a deformed state. This allows for the reconstruction of photorealistic scenes and the synthesis of high-fidelity images from camera perspectives not present in the original input data. The framework