30 open-source projects similar to systemerrorwang/white-box-cartoonization, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best White Box Cartoonization alternative.
This project is a PyTorch implementation of AnimeGANv2, a generative adversarial network and image-to-image translation model designed to transform real-world photographs into stylized anime imagery. The repository includes a model weight converter that enables the translation of checkpoints across different runtime environments. This utility performs weight key remapping and tensor dimension permutation to ensure compatibility between framework implementations. The system supports AI photo stylization through pre-trained weight loading and provides configurable upsampling alignment to maint
photo2cartoon is a vision-based software tool and training framework designed to convert real human portrait photographs into stylized cartoon images. It utilizes generative adversarial networks to translate images from a real-world domain to a cartoon style. The project includes a training framework for these models that supports paired-data supervision and multi-GPU distributed training. It employs identity-preserving loss functions to ensure that the resulting cartoon outputs retain the original facial features of the subject. The system incorporates a full preprocessing pipeline that han
AnimeGANv2 is a generative adversarial network training framework and image stylization tool designed to convert real-world photographs and videos into anime-style imagery. It functions as an anime style generator that transforms real-world scenes into animation through supervised style transfer. The project provides a system for training style models and extracting specific generator weight parameters from deep learning checkpoints to create lightweight models for inference. It focuses on landscape image stylization and the ability to mimic specific artistic styles from provided datasets. T
This repository is a deep learning educational resource and a neural network project suite. It provides a collection of practical TensorFlow implementations and coding projects designed to demonstrate the application of various neural network architectures to real-world data. The project includes specific samples for generative adversarial networks, focusing on synthetic image generation and style translation. It also provides examples of deep learning model construction across different learning paradigms. The codebase covers a broad range of capabilities, including computer vision for imag
This PyTorch-based image super-resolution tool provides a deep learning pipeline for upscaling low-resolution images. It utilizes generative adversarial networks to increase pixel density and reconstruct high-resolution image details. The system includes a GAN-based image upscaler and a training pipeline that optimizes neural network weights using paired datasets and custom loss functions. To manage hardware resources, a patch-based image processor splits high-resolution files into smaller segments to prevent memory allocation errors and system crashes. Additional capabilities include the ap
Tensorpack is a high-level TensorFlow neural network framework and research library designed for building and training deep learning models. It provides a collection of reproducible neural network architectures for computer vision, generative tasks, reinforcement learning, and natural language processing. The project distinguishes itself through a specialized deep learning data pipeline that uses pure Python for parallel data loading and streaming. It includes a multi-GPU training orchestrator for distributing workloads via data-parallel strategies and a dedicated interpretability toolkit for
AnimeGAN is a generative adversarial network and image translator developed with TensorFlow. It is designed for photo-to-anime style transfer, utilizing a deep learning system to transform real-world photographs and video frames into anime-style imagery. The system includes a video-to-anime converter that applies consistent visual transformations across sequential frames. It supports both the training of generative networks on artistic datasets to replicate specific styles and the extraction of generator weights from checkpoints for efficient inference. The project provides utilities for ima
ESRGAN is a deep learning image restoration framework designed for image super-resolution. It uses a generative adversarial network system to upscale low-resolution images into high-quality versions with sharp visual details and recovered fine textures. The framework implements a perceptual super-resolution model that optimizes the trade-off between perceived visual quality and pixel-level signal-to-noise ratio. It includes weight-interpolation blending to allow for the adjustment of visual sharpness and signal-to-noise ratios by mixing weights from different trained models. The system cover
This is a library of generative model architectures built using the TensorFlow framework. It provides implementations for producing synthetic data and realistic images, specifically focusing on Variational Autoencoders and various Generative Adversarial Network variants. The collection includes specific GAN architectures such as WGAN-GP, LSGAN, InfoGAN, and EBGAN. It also features Variational Autoencoders designed to learn latent representations and synthesize new samples from learned distributions. The project covers image processing pipelines for normalizing and cropping data, as well as a
Srez is a deep learning image super-resolution framework designed to upscale low-resolution images into sharp, high-resolution visual features. It functions as a neural network training tool that employs generative adversarial networks to synthesize realistic image details. The project includes a model evolution visualizer that generates animations and image batches to track visual improvements during the training process. It utilizes a combination of adversarial and L1 loss functions to optimize model weights and supports periodic state checkpointing for recovery and deployment. The system
PaddleGAN is a generative AI framework and deep learning computer vision library built on the PaddlePaddle framework. It serves as a toolkit for image and video synthesis, providing a collection of generative adversarial network implementations for creating synthetic visual content. The library focuses on advanced synthesis capabilities, including the generation of talking heads through lip motion synchronization and the creation of synthetic videos via motion transfer from driving sequences. It provides tools for domain-to-domain translation, allowing for image style transfer and the transfo
This project is a TensorFlow implementation of an image-to-image translation framework based on conditional generative adversarial networks. It provides the tools to train models that map input images to output images based on learned visual patterns, as well as a server for processing image translation requests and serving trained model checkpoints to web clients. The framework includes a system for converting trained model weights into a portable format for browser-based inference. It also features a validation process that generates comparative reports by analyzing input, output, and targe
This project is a collection of TensorFlow 2.x machine learning tutorials and practical code examples. It serves as a deep learning implementation guide for constructing diverse neural network architectures, including convolutional, recurrent, and generative networks. The repository provides templates and examples for several specialized domains, including computer vision for image classification and object detection, natural language processing for text generation and language understanding, and generative AI for synthesizing data using adversarial networks and autoencoders. It also includes
This project is a machine learning educational repository providing a collection of implementations and guides for machine learning and deep learning algorithms. It serves as a deep learning model library and a reference for training workflows, covering foundational machine learning, convolutional, recurrent, and transformer architectures. The collection includes a generative adversarial network suite for synthesizing realistic images and performing image-to-image translation. It also functions as a computer vision implementation guide for object detection and semantic segmentation, alongside
RestorePhotos is an AI face restoration tool and deep learning image upscaler designed to remove blur and reconstruct lost details in degraded facial photographs. It functions as a face photo enhancer and a generative adversarial network image processor that transforms low-quality pixels into high-resolution facial features. The system utilizes a GPU-accelerated inference engine to run machine learning models for real-time image restoration. This hardware acceleration supports the heavy matrix multiplications and tensor-based operations required to sharpen facial images and improve visual fid
This project is a generative adversarial network implementation and research framework. It provides the tools and hyperparameters necessary to train and evaluate generative models across various datasets, specifically designed to reproduce results from academic research. The framework includes a Parzen density likelihood estimator to calculate model log likelihood. This allows for the quantitative evaluation of generative distributions and the measurement of overall model performance. The codebase covers machine learning research capabilities, focusing on the training of adversarial networks
TecoGAN is a generative adversarial network designed for video super-resolution. It functions as a spatio-temporal video upscaler that increases the resolution of video sequences while reconstructing high-quality imagery from lower-resolution inputs. The system utilizes a temporal coherence framework to ensure visual stability and reduce flickering in generated frames. It achieves this by employing spatio-temporal discriminators that evaluate both individual frame quality and movement consistency. The project covers the training and optimization of generative adversarial networks, specifical
DragGAN is an interactive generative image editor and manipulator that allows users to reshape visual output by moving handle points on a generative network manifold. It functions as a tool for point-based image editing, mapping user-defined coordinate shifts to a generative model's latent space to deform images. The system includes a generative image inversion tool that converts real photographs into latent representations. This process enables the interactive manipulation of non-generated content by bringing real-world images into a compatible format for the generative adversarial network.
This is a generative AI model library containing a collection of PyTorch and TensorFlow implementations for creating synthetic data and modeling complex probability distributions. It serves as a multi-framework repository of deep learning models designed for learning and replicating data patterns. The project provides specialized implementation suites for several generative architectures. This includes Generative Adversarial Networks using competing generator and discriminator models, Variational Autoencoder frameworks that map data to a latent space, and Restricted Boltzmann Machine and Deep
This project provides Rust bindings for the TensorFlow C API, serving as a tensor computation interface and machine learning library. It enables the construction and execution of machine learning models and neural networks by bridging a systems language to high-performance backends. The framework supports GPU-accelerated computing to increase the speed of model training and inference by offloading mathematical operations to graphics processing units. It offers both graph-based computation for defining static network architectures and an eager execution mode for immediate operation calls durin
This is an open-source research repository providing a collection of machine learning implementations designed to reproduce results from published academic papers. It serves as a public archive of code and datasets used to validate scientific claims within the field of artificial intelligence. The repository contains neural network code implemented using both JAX and PyTorch to support scalable research and experimentation. The codebase covers a range of research and development activities, including the implementation of specific AI models, the validation of deep learning benchmarks, and th
This project is an AI research implementation library and machine learning research repository. It provides a collection of reference code, illustrative implementations, and open-source research datasets used to verify hypotheses and build upon existing models in artificial intelligence. The repository focuses on scientific research reproduction by translating theoretical findings from published papers into executable code. It includes specialized scientific simulation environments designed to test the behavior of autonomous agents and models within controlled settings. The project covers AI
This is a quantitative finance library built on TensorFlow for financial engineering, asset pricing, and risk management. It serves as a financial derivative pricing engine, a model calibration tool, and a hardware-accelerated math library for numerical tasks. The library provides specialized capabilities for pricing financial assets using standard models and American option logic, as well as calibrating pricing models to market data through local volatility. It includes tools for constructing yield curves via bootstrapping algorithms and monotone convex interpolation. The framework covers a
This project is a collection of deep learning research implementations and a reproduction kit designed to translate theoretical AI papers into working code. It provides a library of neural network architectures and reference implementations for reproducing seminal research concepts through interactive notebooks. The repository distinguishes itself through the implementation of AI theory and scaling laws, covering complexity dynamics, information theory, and the simulation of universal AI agents. It also includes a benchmarking suite for synthetic reasoning, allowing for the evaluation of mode
This project is a deep learning library built for single-image super-resolution and visual enhancement. It provides a framework for training and deploying neural network architectures designed to reconstruct high-resolution images from low-resolution sources, effectively recovering fine details and removing artifacts caused by downscaling or compression. The library distinguishes itself through the implementation of generative adversarial networks and residual block architectures, which work together to improve the realism and clarity of upscaled outputs. It supports training through both pix
VITS-fast-fine-tuning is a pipeline for adapting speech synthesis models to specific target voices using small audio datasets. It functions as a fast speaker adaptation tool and a multilingual speech synthesizer capable of generating spoken audio across different languages. The system provides a framework for many-to-many voice conversion, transforming the identity of one speaker into another while preserving the original linguistic content. It allows for the adaptation of a voice for text-to-speech by fine-tuning a pre-trained model with audio clips or video sources. The project covers end-
This project is an end-to-end text-to-speech engine and deep learning voice synthesizer. It functions as a neural speech synthesis framework that converts written text directly into audio waveforms using a single neural network. The system implements an adversarial framework and a conditional variational autoencoder to generate high-fidelity artificial speech. It utilizes a generative adversarial network to ensure synthesized audio is indistinguishable from real human speech. The toolkit provides capabilities for neural speech synthesis, text-to-audio generation, and the training of custom v
This repository is a collection of guided tutorials for building and training machine learning models using the TensorFlow framework. It provides practical walkthroughs and examples for implementing a variety of model architectures to solve data prediction and analysis problems. The guides cover the construction of feedforward, convolutional, and recurrent neural networks to analyze complex data patterns. It includes specific tutorials for unsupervised learning, such as denoising autoencoders and word-to-vec embeddings, as well as examples for training generative adversarial networks to synth
graph_nets is a graph-structured deep learning framework and library for building message-passing neural networks. It provides tools for designing architectures that operate on nodes and edges to process and reason about data structured as graphs using TensorFlow. The framework implements a message-passing paradigm for iterative information exchange between nodes. This approach enables the development of models that can reason about complex graph-structured inputs for tasks such as path-finding and sorting, or serve as a predictor for the future states and trajectories of physical systems.
This is a TensorFlow implementation of the Deep Convolutional Generative Adversarial Network (DCGAN) architecture, providing a framework for training generative models that produce synthetic images from random noise vectors. The project implements the core DCGAN design, using transposed convolutions for upsampling, batch normalization for training stability, and leaky ReLU activations in the discriminator, all executed as static TensorFlow computation graphs. The implementation supports training on custom image datasets by accepting user-supplied image folders without requiring a predefined f