30 open-source projects similar to jezen/is-thirteen, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Is Thirteen alternative.
This project is a PyTorch-based computer vision library and deep learning image processing framework. It provides a collection of neural network architectures designed for visual analysis tasks, specifically focusing on image classification, object detection, and semantic segmentation. The toolset implements diverse methodologies for visual recognition, including anchor-free object detection, regional proposal networks, and heatmap-based keypoint estimation. It utilizes both convolutional neural networks for spatial feature extraction and transformer-based self-attention mechanisms to compute
This project is a deep learning image restoration tool designed to remove scratches, fading, and noise from aged photographs and film. It utilizes generative adversarial networks for image translation, alongside specialized networks for face enhancement and video colorization. The system distinguishes itself through a combination of latent-space domain mapping and progressive face enhancement to recover blurred or missing high-frequency facial details. For video content, it employs a colorization framework that uses optical flow and temporal guidance to propagate color from selected keyframes
PaddleGAN is a generative AI framework and deep learning computer vision library built on the PaddlePaddle framework. It serves as a toolkit for image and video synthesis, providing a collection of generative adversarial network implementations for creating synthetic visual content. The library focuses on advanced synthesis capabilities, including the generation of talking heads through lip motion synchronization and the creation of synthetic videos via motion transfer from driving sequences. It provides tools for domain-to-domain translation, allowing for image style transfer and the transfo
ImageAI is a Python computer vision library providing a suite of tools for image classification, object detection, and video analytics. It functions as an integrated framework for locating and labeling objects in static images and video streams, utilizing deep learning models for identification and categorization. The project includes a model training toolkit that allows for the creation of custom classifiers and detectors through scratch training or transfer learning. It features a GPU-accelerated inference engine to increase processing speed for vision tasks and includes specialized utiliti
PaddleClas is a toolkit for image classification and recognition built on PaddlePaddle. It provides a suite of tools for training deep learning models and a framework for implementing visual search and retrieval systems. The project includes a computer vision model optimization suite and tools for cross-platform deployment. It enables the export of trained models to servers, mobile devices, and edge hardware to achieve high-performance inference across different programming languages. The toolkit covers model compression and optimization through pruning, quantization, and knowledge distillat
This project is a collection of deep learning tools for image classification and audio tagging, providing a repository of pre-trained model weights and architectures. It serves as a Keras model zoo that enables the immediate use of established neural networks for inference and transfer learning. The library includes a music tagging framework that classifies audio recordings using convolutional recurrent neural networks and mel-spectrograms. For visual data, it provides implementations of architectures such as ResNet, VGG, and Xception, alongside a repository of weights trained on large datase
This is a PyTorch implementation of EfficientNet convolutional neural networks. It serves as a computer vision model library providing architectures for image classification and high-level feature extraction, including pre-trained weights for immediate image categorization. The library supports transfer learning by allowing the modification of model architectures and output layers to accommodate a custom number of classes for new datasets. It also includes a model exporter to convert trained PyTorch weights into the ONNX format for production inference. The system covers broader computer vis
Awesome-Backbones is a modular deep learning framework designed for the end-to-end lifecycle of computer vision models. It provides an integrated platform for training, benchmarking, and deploying convolutional and transformer-based neural network architectures for image classification tasks. The framework distinguishes itself through a configuration-driven approach to model assembly, allowing users to define backbone, neck, and head components externally. It includes a specialized toolkit for model interpretability, utilizing gradient-based visualization techniques to generate class activati
This project is a PyTorch implementation of the Faster R-CNN architecture for object detection. It provides a framework for identifying multiple object classes and their corresponding bounding boxes within images using a deep learning system. The implementation includes a training pipeline for optimizing models on custom datasets and a utility for converting pretrained weights from external formats into a compatible structure for model initialization. The system covers a two-stage detection pipeline comprising a region proposal network and an ROI pooling layer. It incorporates multi-task los
NSFW detection on the client-side via TensorFlow.js
This project is a deep learning library built for single-image super-resolution and visual enhancement. It provides a framework for training and deploying neural network architectures designed to reconstruct high-resolution images from low-resolution sources, effectively recovering fine details and removing artifacts caused by downscaling or compression. The library distinguishes itself through the implementation of generative adversarial networks and residual block architectures, which work together to improve the realism and clarity of upscaled outputs. It supports training through both pix
This project is a collection of PyTorch learning resources and educational guides designed to teach the construction and training of neural networks. It serves as a comprehensive deep learning tutorial covering various model architectures and practical implementation strategies. The resources provide specific guidance on implementing computer vision tasks, such as image classification and synthetic imagery generation, as well as reinforcement learning agents using value networks and experience replay. It also covers sequential data modeling through recurrent networks and generative modeling u
This project is a neural network image classifier and a set of tools for building and training convolutional neural networks to recognize and categorize images. It serves as a machine learning educational guide, providing a practical resource for learning neural network fundamentals through an onboarding process. The system includes a dedicated workflow for pretrained model fine-tuning, allowing existing network weights to be adapted to new image categories. This is supported by a transfer learning pipeline that replaces final classification layers and adjusts weights through targeted retrain
ccv is a computer vision library written in C designed for high-performance visual analysis. It serves as a framework for image classification, object detection, and the identification of faces, pedestrians, and vehicles. The library distinguishes itself through hardware-accelerated vision and deep learning inference optimizations. It utilizes a quantized tensor processor to transform floating-point data into eight-bit integers and implements integer-quantized attention mechanisms to reduce memory bandwidth and increase data throughput. The project covers a broad range of capabilities, inclu
This is a comprehensive deep learning course delivered entirely through Jupyter Notebooks, designed to teach neural network construction using TensorFlow 2.x. The curriculum follows a sequential-model-first pedagogy, introducing the Sequential API before moving to functional and subclassing approaches, and covers the full spectrum of model building from regression and classification through convolutional neural networks, natural language processing, and time series forecasting. The course is structured around a checkpoint-based training workflow that saves the best model weights during traini
Gluon-CV is an MXNet computer vision library that provides a comprehensive collection of pre-implemented vision architectures and training pipelines. It serves as a deep learning research toolkit and a model zoo containing state-of-the-art pre-trained weights for image and video analysis. The project includes a specialized human pose estimation library and a model compression toolkit. These tools allow for the pruning and quantization of deep learning models to increase inference speed and facilitate deployment on constrained edge hardware. The library covers a broad range of vision capabili
geemap is a Python library and toolkit for interactive geospatial analysis, visualization, and satellite imagery analysis using Google Earth Engine data and cloud computing. It provides a mapping tool for displaying geospatial datasets within Jupyter notebooks and a suite of tools for classifying imagery and calculating zonal statistics. The project includes a utility to convert geospatial analysis scripts from JavaScript into Python code to facilitate data manipulation. It also enables the generation of timelapse animations and time-series visualizations from satellite imagery catalogs. The
This project provides a deep residual network framework and pre-trained PyTorch models designed for high-accuracy image recognition. It implements a neural network architecture that utilizes skip connections to enable the training of very deep models without gradient degradation. The system is designed for computer vision tasks, including image classification, object detection, and visual data segmentation. It includes weights trained on ImageNet to support transfer learning and the fine-tuning of models on custom image datasets. The architectural design focuses on residual learning blocks,
This project is a collection of educational resources and implementation frameworks providing deep learning model recipes, code samples, and step-by-step guides for computer vision tasks. It organizes complex workflows into modular recipes and implementation guides to facilitate the building of image and video analysis models. The framework focuses on specialized vision capabilities, including an image similarity framework for fast retrieval and re-ranking, human pose estimation, and video action recognition. It also provides specific tools for crowd density estimation and document image clea
This project is a pretrained model library for PyTorch, providing a collection of convolutional neural network architectures and weights. It serves as a computer vision model zoo for image classification and feature extraction, offering a framework for transfer learning where pretrained networks are adapted for custom image recognition tasks. The library focuses on transforming images into high-level numerical representations and calculating class probability scores. It includes utilities for downloading and initializing standard architectures such as ResNet, Inception, and Xception. Capabil
This project is a collection of TensorFlow machine learning examples providing reference implementations for various neural network paradigms. It covers supervised, unsupervised, reinforcement, and sequential learning models. The repository includes implementations for convolutional neural networks focused on image classification and ranking, as well as recurrent neural networks for time-series forecasting and sequence-to-sequence translation. It further provides examples of reinforcement learning agents trained via reward optimization and unsupervised learning techniques such as autoencoders
ConvNetJS is a JavaScript deep learning library and neural network training engine designed for client-side machine learning. It functions as a framework for building, training, and running convolutional neural networks directly within a web browser without the need for a backend server. The library specializes in image recognition and pattern analysis using convolutional and pooling layers. It enables the creation of models for classification and regression tasks, as well as the development of reinforcement learning agents that optimize behavior through trial and error in simulated environme
Immutable.js is a library of persistent data structures and a functional state management toolkit. It provides a collection of immutable objects and arrays that prevent direct mutation to ensure predictable state management in JavaScript applications. The library utilizes structural sharing to efficiently create new versions of data without full copying and implements lazy sequence processing to chain data transformations that execute only when values are requested. It also supports batch mutation processing, allowing multiple changes to be applied to a temporary mutable copy before returning
Donut is an OCR-free document transformer and end-to-end document parser. It functions as a neural network that converts unstructured document images directly into structured data or text without the use of an external optical character recognition engine. The project includes a synthetic document generator to create artificial images and ground-truth labels for training. It employs a transformer model to perform visual question answering and document image classification based on visual layout and text. The system covers several document understanding capabilities, including structured info
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
Anti-Anti-Spider is an automated web scraping toolkit and CAPTCHA bypass framework. It uses convolutional neural networks to recognize characters and digits in image-based security challenges, enabling programmatic access to protected web content. The project functions as an image recognition model trainer, providing a workflow to preprocess labeled image datasets and train custom neural networks. Users can configure model architectures and hyperparameters to align the recognition system with the visual style of specific target websites. The toolkit covers capabilities for image data preproc
This project is an automated machine learning framework and toolkit designed for training and tuning custom models for classification, regression, and recommendations. It functions as a multimodal machine learning toolkit capable of processing and training models using a combination of text, image, audio, and sensor data. The framework distinguishes itself as a multimodal data processor that can handle and visualize large datasets on a single machine using column-oriented disk storage. It includes a core machine learning model generator that converts trained models into formats compatible wit
DeepLearningZeroToAll is a comprehensive educational resource and implementation collection focused on deep learning and machine learning. It provides a structured learning path using TensorFlow to move from foundational linear models to complex neural network architectures. The project is distinguished by its practical implementations of various network types, including multilayer perceptrons for logic problems, convolutional neural networks for spatial data and image recognition, and recurrent neural networks using LSTM cells for time-series forecasting and character sequence prediction. It
This PyTorch-based image super-resolution tool provides a deep learning pipeline for upscaling low-resolution images. It utilizes generative adversarial networks to increase pixel density and reconstruct high-resolution image details. The system includes a GAN-based image upscaler and a training pipeline that optimizes neural network weights using paired datasets and custom loss functions. To manage hardware resources, a patch-based image processor splits high-resolution files into smaller segments to prevent memory allocation errors and system crashes. Additional capabilities include the ap
ml5-library is a JavaScript machine learning library that functions as a browser-based inference engine. It provides a high-level wrapper for implementing neural networks and data models, allowing users to execute machine learning predictions directly on the client side. The library simplifies the integration of machine learning into web applications and creative coding projects by removing the requirement for deep mathematical expertise. It specifically enables web-based image classification through the use of pretrained deep learning models to identify and label objects within images. The