30 open-source projects similar to nvidia/semantic-segmentation, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Semantic Segmentation alternative.
Pixel-wise segmentation on the VOC2012dataset dataset using pytorchpytorch.
Detectron2 is a PyTorch computer vision framework and visual recognition platform designed for training and deploying models for object detection, image segmentation, and visual recognition. It provides a research-oriented environment for training complex vision models with multi-GPU acceleration. The project includes a specialized object detection library for identifying and locating multiple objects via bounding boxes, as well as an image segmentation toolkit for creating pixel-level masks through instance, semantic, and panoptic segmentation. Additionally, it features a human pose estimati
PyTorch module to use OpenFace's nn4.small2.v1.t7 model
This project is a library of pretrained computer vision architectures and backbones for image classification and feature extraction. It serves as a comprehensive model zoo and collection of standardized image encoders, including ResNet, Vision Transformers, and EfficientNet, for use in visual analysis and as backbones for object detection and image segmentation. The library provides a framework for distributed training and evaluation of image models using advanced data augmentation and optimization scripts. It includes a dedicated toolset for converting trained PyTorch vision models into the
About PyTorch 1.2.0 Now the master branch supports PyTorch 1.2.0 by default. Due to the serious version problem (especially torch.utils.data.dataloader), MDSR functions are temporarily disabled. If you have to train/evaluate the MDSR model, please use legacy branches.
This is a PyTorch-based computer vision library for detecting 2D and 3D facial landmark coordinates. It functions as a facial landmark detector and reconstruction tool, utilizing deep learning to identify precise geometric points on human faces from image datasets. The library allows for the selection of specific detection backends to balance accuracy and processing speed. It supports the integration of precomputed bounding box files, which enables the system to bypass the initial detection phase and proceed directly to landmark extraction. The toolkit includes capabilities for batch image p
This project is a modular PyTorch framework for training and evaluating object detection and instance segmentation models. It serves as a computer vision research tool and a deep learning inference engine designed to identify object locations, classes, and pixel-level masks within images. The framework implements a two-stage inference pipeline that utilizes region proposal networks and a symmetric mask-head architecture. It provides specialized capabilities for instance segmentation, object bounding box detection, and human pose estimation via anatomical keypoint detection. The system includ
(Disclaimer: this is work in progress and does not feature all the functionalities of detectron. Currently only inference and evaluation are supported -- no training) (News: Now supporting FPN and ResNet-101!)
This repo is used to research convolutional networks primarily for computer vision tasks. For this purpose, the repo contains (re)implementations of various classification, segmentation, detection, and pose estimation models and scripts for training/evaluating/converting.
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
Neural Style and MSG-Net
This repository contains some models for semantic segmentation and the pipeline of training and testing models, implemented in PyTorch
IJCAI2020 & IJCV2021 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo
OpenPose is a real-time pose estimation engine designed to detect and track human body, face, hand, and foot landmarks. It functions as a multi-person motion tracker, identifying the spatial coordinates of multiple individuals simultaneously within video streams or static images. Beyond two-dimensional detection, the software acts as a three-dimensional kinematics processor, reconstructing spatial movement data from single or multiple synchronized camera perspectives. The system distinguishes itself through a bottom-up approach that utilizes part-affinity fields to associate body parts across
InsightFace is a comprehensive deep learning framework designed for face recognition, biometric identity verification, and feature extraction. It provides a specialized engine for one-to-one verification and one-to-many identification tasks, utilizing convolutional neural networks to transform raw image pixels into high-dimensional vector embeddings. The project includes a complete toolkit for detecting, aligning, and processing facial data to ensure consistent identity discrimination. Beyond core recognition, the platform distinguishes itself through an extensive model management and optimiz
High-level batteries-included neural network training library for Pytorch
PyTorch implementation of neural style transfer algorithm
This is a PyTorch object detection framework that implements the Single Shot MultiBox Detector for identifying and localizing multiple objects within images and video. The project provides a neural network architecture designed for single-shot object detection, which predicts bounding boxes and class labels in one pass. The implementation includes a real-time object detector capable of processing live video streams to track and label objects across sequential frames. It also features a complete computer vision training pipeline for preparing image datasets and training model weights. The fra
This repository serves as a comprehensive research platform and toolkit for advancing machine learning, quantum computing, and large-scale scientific data analysis. It provides foundational frameworks for developing complex algorithmic systems, offering the necessary infrastructure for distributed training, computational graph execution, and high-performance model development. The project distinguishes itself by integrating specialized research domains with robust, privacy-preserving methodologies. It supports diverse scientific discovery through tools for quantum simulation, physics-informed
YOLOv2 in PyTorch
STEAL - Learning Semantic Boundaries from Noisy Annotations (CVPR 2019)
This repository provides structured code examples and project templates designed for classroom instruction in machine learning and neural networks. It offers reference implementations of deep learning models for both computer vision and natural language processing tasks, built using PyTorch as the core framework. The codebase is organized as a modular project template with separate directories for data handling, model definitions, and training scripts, promoting reusability and clarity. It includes predefined pipelines for image classification and text processing, along with a command-line in
Darknet is a high-performance C-based inference engine and computer vision library designed for real-time object identification and localization. It serves as a neural network framework for training and deploying detection models using the YOLO architecture, providing a toolset for deep learning training and deployment. The project differentiates itself through a C and CUDA implementation that enables hardware acceleration for matrix multiplication and inference speed optimization. It provides a shared library interface for embedding detection capabilities into external applications and suppo
TCS humAIn This is a Flutter application that is used to locate the license plate out of a picture given to the application. Cue the Drum Rolls for what I am about to disclose. With the help of Sayak Paul the tensorflow model that was 255mb was cut short to a 2mb file. TFLite did the trick for…
WACV'21: Do We Really Need Gold Samples for Sample Weighting Under Label Noise?