30 open-source projects similar to jantic/deoldify, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best DeOldify alternative.
This project is a comprehensive computer vision library for the PyTorch ecosystem, providing a standardized collection of neural network architectures, datasets, and high-performance transformation utilities. It serves as a foundational framework for building, training, and deploying deep learning models, offering a centralized model registry that allows developers to instantiate architectures with pre-trained weights for tasks such as image classification, object detection, and semantic segmentation. The library distinguishes itself through its modular approach to data and compute management
Yolact is a computer vision framework and real-time instance segmentation model. It utilizes a fully convolutional neural network to detect objects and generate pixel-level masks for images and video feeds. The system employs prototypical mask generation to create global mask prototypes that are linearly combined for instance-specific results. It incorporates deformable convolutional layers and deformable region-of-interest pooling to adapt spatial sampling to the irregular shapes of objects. The framework covers the full model development lifecycle, including training on custom datasets, ac
Style2paints is a deep learning image processor designed for the automated colorization of grayscale line art. It functions as a generative style transfer engine that maps artistic color palettes and textures onto monochrome sketches, allowing users to transform black and white drawings into finished illustrations through neural network inference. The system distinguishes itself by incorporating user-provided color guidance and style references to influence the final output. It utilizes coordinate-mapped color points and hint-driven optimization to ensure that specific colors are applied prec
This project is an AI upscaling framework and deep learning image restorer designed to estimate original source pixels from low-resolution inputs. It functions as a super-resolution reconstruction system that transforms pixelated images into high-resolution versions by restoring high-frequency details and sharpening edges. The system utilizes a convolutional neural network pipeline to analyze pixel data and perform digital image restoration. It employs pixel-shuffle upsampling to rearrange channel dimensions into spatial dimensions, which increases resolution while reducing checkerboard artif
This project is a collection of educational Jupyter Notebooks providing tutorials on neural network construction and tensor operations using the TensorFlow framework. It serves as a machine learning educational repository and implementation guide for deep learning students. The suite focuses on specific advanced architectures, including convolutional networks for image classification, residual networks with skip connections for training stability, and variational autoencoders for generative modeling and data synthesis. It also includes guides for building denoising and deep autoencoders to pe
This project is a machine learning educational repository providing a collection of implementations and guides for machine learning and deep learning algorithms. It serves as a deep learning model library and a reference for training workflows, covering foundational machine learning, convolutional, recurrent, and transformer architectures. The collection includes a generative adversarial network suite for synthesizing realistic images and performing image-to-image translation. It also functions as a computer vision implementation guide for object detection and semantic segmentation, alongside
This project is a collection of educational examples and code for implementing deep learning architectures using the PyTorch framework. It serves as a tutorial and implementation guide for building various neural network architectures for machine learning tasks. The project provides practical implementations for computer vision, including image classification and neural style transfer, as well as natural language processing examples for building sequence models and language predictors. It also covers generative models using adversarial and variational networks to synthesize or transform visua
StyleGAN is a TensorFlow-based generative adversarial network framework designed for the synthesis of high-resolution synthetic imagery. It utilizes a style-based generator architecture to create realistic visual assets from latent vectors, focusing on the production of high-fidelity images. The system incorporates style mixing and stochastic noise injection to control visual attributes and fine-grained details. It uses adaptive instance normalization and progressive resolution upsampling to manage image quality and variety across different resolutions. The framework covers the full lifecycl
This project is a library of pretrained computer vision architectures and backbones for image classification and feature extraction. It serves as a comprehensive model zoo and collection of standardized image encoders, including ResNet, Vision Transformers, and EfficientNet, for use in visual analysis and as backbones for object detection and image segmentation. The library provides a framework for distributed training and evaluation of image models using advanced data augmentation and optimization scripts. It includes a dedicated toolset for converting trained PyTorch vision models into the
PyTorch-GAN is a research-oriented framework providing a collection of modular implementations for generative adversarial network architectures. It serves as a toolkit for training and evaluating models that utilize adversarial minimax optimization to produce synthetic data, offering a structured environment for exploring complex generative tasks within the PyTorch ecosystem. The library distinguishes itself through a comprehensive suite of image synthesis and manipulation capabilities, including super-resolution, inpainting, and cross-domain style translation. It supports advanced training m
This project is a comprehensive library of state-of-the-art neural network architectures designed for image classification and feature extraction. It provides a complete deep learning training framework that supports distributed execution, allowing users to build, train, and fine-tune vision models using optimized schedulers and pre-configured training recipes. The library distinguishes itself through a modular backbone architecture that treats neural networks as decoupled feature extractors, enabling the retrieval of multi-scale outputs for downstream tasks like object detection and segmenta
This project is a generative adversarial network designed for image animation and motion transfer. It functions as a computer vision framework that synthesizes video sequences by applying motion patterns extracted from a driving video onto a static source image. The model distinguishes itself by using a keypoint-based representation to decouple object appearance from temporal movement. By tracking structural deformations through learned latent coordinates, it performs motion retargeting and synthetic media production without requiring manual annotations or object-specific training data. The
This project is a comprehensive machine learning educational resource and tutorial series delivered as a collection of interactive Jupyter Notebooks. It provides practical Python implementations for the end-to-end machine learning lifecycle, covering supervised and unsupervised learning, deep learning, and reinforcement learning. The resource distinguishes itself by providing detailed implementation guides for complex architectures, including transformers, generative adversarial networks, and convolutional neural networks. It also features specialized courseware for developing reinforcement l
Paper2gui is a multi-modal AI toolkit and model GUI wrapper designed to deploy and run various artificial intelligence models through a visual interface. Its primary purpose is to provide a way to execute complex AI research papers and models without requiring manual software installation or coding. The project distinguishes itself by using a wrapper-based model interface that abstracts command line arguments into visual input fields, utilizing template-driven UI generation to create parameter sliders and forms based on the specific requirements of the underlying model. It includes a centrali
This is a PyTorch implementation of EfficientNet convolutional neural networks. It serves as a computer vision model library providing architectures for image classification and high-level feature extraction, including pre-trained weights for immediate image categorization. The library supports transfer learning by allowing the modification of model architectures and output layers to accommodate a custom number of classes for new datasets. It also includes a model exporter to convert trained PyTorch weights into the ONNX format for production inference. The system covers broader computer vis
This project is a TensorFlow-based neural style transfer framework designed to apply the artistic textures and colors of a painting to images and videos. It utilizes a feed-forward image stylizer that transforms visual appearance in a single pass, avoiding the need for iterative optimization. The system includes a deep learning training pipeline that teaches convolutional neural networks to replicate specific styles using perceptual loss functions. It also features a video frame processor that decomposes video files into individual images for sequential stylization and reassembly. The softwa
This project is a PyTorch implementation of a research architecture designed for high-resolution representation learning. It serves as a computer vision framework focused on precise keypoint detection, human pose estimation, and semantic image segmentation. The implementation provides specialized tools for identifying anatomical landmarks on the human body and predicting facial keypoint coordinates to analyze orientation and alignment. It utilizes a system of multi-resolution parallel streams and repeated multi-scale fusion to maintain high-resolution representations throughout the network.
YOLOv9 is a real-time computer vision framework and deep learning model designed for image classification, object detection, and instance segmentation. It functions as both a vision model and a trainer, allowing for the optimization of neural network weights on custom datasets using single or multiple GPUs. The framework utilizes programmable gradient information to perform high-speed identification and location of multiple objects within images and video streams. It extends beyond bounding box detection to provide instance segmentation and panoptic segmentation, which labels every pixel in a
Neural Doodle is a collection of neural network tools designed for image upscaling, texture synthesis, and semantic-guided style transfer between visual inputs. It provides a semantic style transfer engine and an example-based image upscaler that increase image resolution by referencing visual details from a target style example. The project includes a neural texture synthesizer for creating seamless bitmap textures and repeating patterns from a single input style image. It also functions as an image generation tool capable of transforming simple sketches and photos into detailed artwork. Th
StyleGAN2 is a TensorFlow generative adversarial network and image synthesis model designed to produce high-resolution synthetic visual content. It functions as a deep learning architecture that learns patterns from image datasets to synthesize new images. The project includes a latent space projection tool for mapping existing images to latent vectors to analyze their representation within a generative model. It also provides an image quality evaluation framework to measure the visual fidelity and diversity of synthetic outputs. The system covers the full generative pipeline, including imag
Code release for ConvNeXt model
This project provides a deep residual network framework and pre-trained PyTorch models designed for high-accuracy image recognition. It implements a neural network architecture that utilizes skip connections to enable the training of very deep models without gradient degradation. The system is designed for computer vision tasks, including image classification, object detection, and visual data segmentation. It includes weights trained on ImageNet to support transfer learning and the fine-tuning of models on custom image datasets. The architectural design focuses on residual learning blocks,
Faceai is a computer vision toolkit designed for facial analysis, identity recognition, and image processing. It provides integrated engines for detecting human faces in static images and live video streams, matching facial encodings against identity databases, and mapping facial landmarks to understand geometric structure and alignment. The project enables real-time augmented reality applications, such as applying virtual makeup and digital accessories by scaling assets to detected facial coordinates. It also includes a suite for digital image restoration capable of removing noise, erasing w
Openface is a deep learning toolkit designed for facial recognition and identity verification. It provides a comprehensive pipeline for detecting faces, aligning landmarks, and transforming facial images into compact numerical vectors. By utilizing these embeddings, the system enables identity classification and similarity comparison through geometric distance calculations. The project distinguishes itself by integrating research-oriented diagnostic tools alongside its core recognition capabilities. It includes utilities for visualizing high-dimensional feature clusters, inspecting internal c
CodeFormer is a deep learning framework designed for the restoration and enhancement of facial images and video sequences. It functions as a comprehensive processing engine capable of reconstructing high-quality facial features from degraded, blurry, or damaged inputs, while also providing tools for image upscaling and generative inpainting to fill missing or corrupted regions. The system distinguishes itself by utilizing a codebook-based quantization approach that maps input patches to high-quality facial representations, supported by transformer-based global modeling to ensure structural co
Albumentations is a computer vision image augmentation library designed to increase training data diversity for deep learning models. It provides a toolset for applying geometric and color transformations to images and annotations, including a specialized collection of 3D operations for volumetric data used in medical and scientific imaging. The library functions as an image mask and bounding box transformer, automatically updating masks, bounding boxes, and keypoints when images undergo geometric changes. This ensures that spatial alterations remain synchronized across images and their assoc
pix2pixHD is a conditional generative adversarial network designed to transform semantic label maps into high-resolution photorealistic images. It functions as a high-resolution image synthesizer and an image-to-image translation model capable of producing synthetic images at 2048x1024 resolution. The system includes a semantic image editor that allows for the modification of high-resolution visuals by updating the underlying semantic label maps. This enables interactive image editing and the generation of photorealistic images based on source images or discrete label maps. The framework pro
OpenPose is a real-time pose estimation engine designed to detect and track human body, face, hand, and foot landmarks. It functions as a multi-person motion tracker, identifying the spatial coordinates of multiple individuals simultaneously within video streams or static images. Beyond two-dimensional detection, the software acts as a three-dimensional kinematics processor, reconstructing spatial movement data from single or multiple synchronized camera perspectives. The system distinguishes itself through a bottom-up approach that utilizes part-affinity fields to associate body parts across
pytorch implementation of fast-neural-style