30 open-source projects similar to thunil/tecogan, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best TecoGAN alternative.
ComfyUI-SeedVR2_VideoUpscaler is an AI video upscaling tool that uses diffusion models to increase the resolution of videos and images while maintaining visual consistency across frames. The project implements distributed video rendering by splitting datasets into chunks for parallel processing across multiple GPUs. It utilizes model compilation and specialized attention backends to reduce inference latency and increase throughput. Additional capabilities include video color correction using wavelet and LAB matching methods to preserve color fidelity. Hardware memory is managed through block
This project is a machine learning educational repository providing a collection of implementations and guides for machine learning and deep learning algorithms. It serves as a deep learning model library and a reference for training workflows, covering foundational machine learning, convolutional, recurrent, and transformer architectures. The collection includes a generative adversarial network suite for synthesizing realistic images and performing image-to-image translation. It also functions as a computer vision implementation guide for object detection and semantic segmentation, alongside
This is a generative AI model library containing a collection of PyTorch and TensorFlow implementations for creating synthetic data and modeling complex probability distributions. It serves as a multi-framework repository of deep learning models designed for learning and replicating data patterns. The project provides specialized implementation suites for several generative architectures. This includes Generative Adversarial Networks using competing generator and discriminator models, Variational Autoencoder frameworks that map data to a latent space, and Restricted Boltzmann Machine and Deep
This project is a generative adversarial network implementation and research framework. It provides the tools and hyperparameters necessary to train and evaluate generative models across various datasets, specifically designed to reproduce results from academic research. The framework includes a Parzen density likelihood estimator to calculate model log likelihood. This allows for the quantitative evaluation of generative distributions and the measurement of overall model performance. The codebase covers machine learning research capabilities, focusing on the training of adversarial networks
PaddleGAN is a generative AI framework and deep learning computer vision library built on the PaddlePaddle framework. It serves as a toolkit for image and video synthesis, providing a collection of generative adversarial network implementations for creating synthetic visual content. The library focuses on advanced synthesis capabilities, including the generation of talking heads through lip motion synchronization and the creation of synthetic videos via motion transfer from driving sequences. It provides tools for domain-to-domain translation, allowing for image style transfer and the transfo
This PyTorch-based image super-resolution tool provides a deep learning pipeline for upscaling low-resolution images. It utilizes generative adversarial networks to increase pixel density and reconstruct high-resolution image details. The system includes a GAN-based image upscaler and a training pipeline that optimizes neural network weights using paired datasets and custom loss functions. To manage hardware resources, a patch-based image processor splits high-resolution files into smaller segments to prevent memory allocation errors and system crashes. Additional capabilities include the ap
Srez is a deep learning image super-resolution framework designed to upscale low-resolution images into sharp, high-resolution visual features. It functions as a neural network training tool that employs generative adversarial networks to synthesize realistic image details. The project includes a model evolution visualizer that generates animations and image batches to track visual improvements during the training process. It utilizes a combination of adversarial and L1 loss functions to optimize model weights and supports periodic state checkpointing for recovery and deployment. The system
mmagic is a multimodal training pipeline and framework for generative AI, focusing on visual synthesis and restoration. It provides the infrastructure to build and train models for tasks such as text-to-image and text-to-video generation, 3D-aware content synthesis, and high-fidelity image translation using diffusion models and generative adversarial networks. The project distinguishes itself through specialized capabilities for generative model personalization, including techniques for fine-tuning subjects and styles. It also supports advanced visual manipulations such as latent space interp
This project is a generative adversarial network designed for image animation and motion transfer. It functions as a computer vision framework that synthesizes video sequences by applying motion patterns extracted from a driving video onto a static source image. The model distinguishes itself by using a keypoint-based representation to decouple object appearance from temporal movement. By tracking structural deformations through learned latent coordinates, it performs motion retargeting and synthetic media production without requiring manual annotations or object-specific training data. The
Fastai is a high-level deep learning library built on PyTorch that provides a unified interface for managing the entire machine learning lifecycle. It functions as a comprehensive training toolkit, abstracting hardware management and automating complex training loops to simplify the construction and execution of neural network models. The framework is distinguished by its notebook-centric development environment and a type-dispatching data pipeline that automatically applies transformations based on input data formats. It emphasizes transfer learning through discriminative layer-wise optimiza
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
Waifu2x-Extension-GUI is a desktop application designed for high-fidelity media restoration and enhancement. It functions as a graphical interface that orchestrates specialized deep learning engines to upscale, denoise, and interpolate images and videos, improving visual clarity and motion smoothness. The software distinguishes itself through its ability to manage complex, automated media processing pipelines. Users can chain multiple tasks—such as format conversion, scene detection, and frame rate interpolation—into sequential workflows that execute without manual intervention. It provides g
Video2x is a modular processing framework designed for AI-enhanced video upscaling and frame rate conversion. It functions as a comprehensive toolset for increasing the resolution and visual clarity of media files while generating intermediate frames to improve motion smoothness. The system is built to handle intensive media transformation tasks by leveraging hardware acceleration and custom encoding pipelines. The project distinguishes itself through a plugin-based architecture that allows for the integration of custom machine learning models and specialized algorithms. It utilizes a modular
Awesome-GANs is a curated resource list and research repository focused on the development and evaluation of generative adversarial networks. It serves as a structured index for academic literature and open-source implementations dedicated to the creation of synthetic data generators. The project provides a framework for training competing neural networks to produce outputs that mimic the statistical properties of original datasets. It emphasizes the use of configuration-driven pipelines to manage model hyperparameters and dataset paths, facilitating reproducible research workflows and standa
PyTorch-GAN is a research-oriented framework providing a collection of modular implementations for generative adversarial network architectures. It serves as a toolkit for training and evaluating models that utilize adversarial minimax optimization to produce synthetic data, offering a structured environment for exploring complex generative tasks within the PyTorch ecosystem. The library distinguishes itself through a comprehensive suite of image synthesis and manipulation capabilities, including super-resolution, inpainting, and cross-domain style translation. It supports advanced training m
This is a library of generative model architectures built using the TensorFlow framework. It provides implementations for producing synthetic data and realistic images, specifically focusing on Variational Autoencoders and various Generative Adversarial Network variants. The collection includes specific GAN architectures such as WGAN-GP, LSGAN, InfoGAN, and EBGAN. It also features Variational Autoencoders designed to learn latent representations and synthesize new samples from learned distributions. The project covers image processing pipelines for normalizing and cropping data, as well as a
AnimeGANv2 is a generative adversarial network training framework and image stylization tool designed to convert real-world photographs and videos into anime-style imagery. It functions as an anime style generator that transforms real-world scenes into animation through supervised style transfer. The project provides a system for training style models and extracting specific generator weight parameters from deep learning checkpoints to create lightweight models for inference. It focuses on landscape image stylization and the ability to mimic specific artistic styles from provided datasets. T
This project is a PyTorch-based generative framework and implementation template for building Generative Adversarial Networks. It provides a collection of foundational toolkits and architectural patterns designed to synthesize high-quality artificial data while focusing on the stability of adversarial neural networks. The framework distinguishes itself through a specialized toolkit for conditional image generation, which integrates discrete labels and auxiliary classification into the training process. It utilizes specific mechanisms to guide the generative process toward target classes by co
ESRGAN is a deep learning image restoration framework designed for image super-resolution. It uses a generative adversarial network system to upscale low-resolution images into high-quality versions with sharp visual details and recovered fine textures. The framework implements a perceptual super-resolution model that optimizes the trade-off between perceived visual quality and pixel-level signal-to-noise ratio. It includes weight-interpolation blending to allow for the adjustment of visual sharpness and signal-to-noise ratios by mixing weights from different trained models. The system cover
Keras-GAN is a collection of generative adversarial network implementations built with Keras for synthetic data generation and image manipulation. It provides frameworks for image-to-image translation, image inpainting, and neural image super-resolution. The library includes tools for learning disentangled latent space representations to control specific attributes of synthetic outputs. It also features capabilities for image domain translation using paired or unpaired data and the ability to fill corrupted or missing image parts by analyzing surrounding visual context. The project covers ge
This project is a deep learning framework designed for training and deploying image-to-image translation models. It serves as a research platform for experimenting with neural network architectures that transform visual content between distinct stylistic domains, supporting both paired and unpaired training data. The framework distinguishes itself through its support for cycle-consistency constraints, which allow for image translation between domains without requiring corresponding paired examples. It provides a structured pipeline that utilizes adversarial loss optimization, where generator
This project is a collection of interactive graphical tools designed for monitoring neural network training, latent space mappings, and the internal mechanisms of transformers. It functions as a visual learning environment for understanding how large language models process tokens and an educational tool for analyzing the interactions between generators and discriminators within adversarial networks. The system provides a browser-based transformer architecture visualizer to show the mathematical operations used for token prediction in real time. It also includes a generative adversarial netwo
StarGAN is a PyTorch image-to-image translation framework designed to synthesize visual styles and attributes across multiple domains. It implements a generative adversarial network that serves as a deep learning image translator for modifying specific visual characteristics within an image dataset. The framework uses a single unified model to handle translations between multiple image domains rather than requiring separate pairs of models. It is a research implementation that learns mappings between different image attributes without the need for paired training data. The project covers the
This repository is a deep learning educational resource and a neural network project suite. It provides a collection of practical TensorFlow implementations and coding projects designed to demonstrate the application of various neural network architectures to real-world data. The project includes specific samples for generative adversarial networks, focusing on synthetic image generation and style translation. It also provides examples of deep learning model construction across different learning paradigms. The codebase covers a broad range of capabilities, including computer vision for imag
This project is a TensorFlow implementation of an image-to-image translation framework based on conditional generative adversarial networks. It provides the tools to train models that map input images to output images based on learned visual patterns, as well as a server for processing image translation requests and serving trained model checkpoints to web clients. The framework includes a system for converting trained model weights into a portable format for browser-based inference. It also features a validation process that generates comparative reports by analyzing input, output, and targe
StyleGAN2 is a TensorFlow generative adversarial network and image synthesis model designed to produce high-resolution synthetic visual content. It functions as a deep learning architecture that learns patterns from image datasets to synthesize new images. The project includes a latent space projection tool for mapping existing images to latent vectors to analyze their representation within a generative model. It also provides an image quality evaluation framework to measure the visual fidelity and diversity of synthetic outputs. The system covers the full generative pipeline, including imag
RestorePhotos is an AI face restoration tool and deep learning image upscaler designed to remove blur and reconstruct lost details in degraded facial photographs. It functions as a face photo enhancer and a generative adversarial network image processor that transforms low-quality pixels into high-resolution facial features. The system utilizes a GPU-accelerated inference engine to run machine learning models for real-time image restoration. This hardware acceleration supports the heavy matrix multiplications and tensor-based operations required to sharpen facial images and improve visual fid
StyleGAN3 is a PyTorch implementation of a generative adversarial network designed for high-fidelity image synthesis. It functions as an image synthesis model and a deep learning research tool used to train and deploy networks that generate realistic synthetic imagery from custom datasets. The project is specifically an alias-free generative model, utilizing an architecture that eliminates jagged artifacts to produce smooth translational and rotational image sequences. This enables the creation of alias-free videos and the generation of high-resolution photos without visual distortions. The
This project is a collection of TensorFlow 2.x machine learning tutorials and practical code examples. It serves as a deep learning implementation guide for constructing diverse neural network architectures, including convolutional, recurrent, and generative networks. The repository provides templates and examples for several specialized domains, including computer vision for image classification and object detection, natural language processing for text generation and language understanding, and generative AI for synthesizing data using adversarial networks and autoencoders. It also includes