Computervision Recipes

Features

Computer Vision Tutorials - Offers a comprehensive collection of code samples and best practices for building deep learning image and video analysis models.
Implementation Recipes - Organizes complex computer vision tasks into modular recipes and guided workflows for reproducible model building.
Object Detection and Tracking - Provides implementations for identifying objects via bounding boxes and tracking their movement across video frames.
Object Detection - Locates and identifies items within an image by generating bounding boxes and class labels.
Image Segmentation - Provides techniques for partitioning images into precise object boundaries and masks.
Segmentation Model Training - Develops segmentation systems by fine-tuning pre-trained backbones on custom annotated datasets.
Image Classification - Implements supervised machine learning techniques to assign category labels to images.
Classification Training - Provides implementations for building single or multi-label image classification models.
Fine-Tuning Pipelines - Implements workflows for adapting pre-trained neural network backbones to custom datasets.
Vision Workflows - Provides Python-based guides for training, fine-tuning, and validating neural networks for visual data processing.
Action Recognition Training - Builds and evaluates models for activity classification using custom datasets or benchmark fine-tuning.
Model Recipes - Provides modular, step-by-step implementations for common vision tasks like object detection and action recognition.
Notebook Execution Environments - Uses notebook-driven prototyping to iteratively develop and validate computer vision algorithms.
Image Similarity Estimation - Implements a framework for computing visual similarity and executing fast image retrieval.
Model Deployment - Provides a workflow for packaging trained vision models into containers for scalable inference.
Object Pose Estimations - Locates anatomical keypoints on the human body to determine posture and orientation.
Representation Learning - Trains deep neural networks to compute image representations for identifying similar images.
Crowd - Provides tools to estimate human density and count individuals within varied scene environments.
Human Activity Recognition - Analyzes video sequences to identify, categorize, and timestamp specific human activities.
Image Retrieval Systems - Executes rapid searches for similar images using nearest neighbor search algorithms.
Keypoint Detection - Identifies points of interest on objects using models that detect both the object and its keypoints.
Training Pipelines - The computer vision library builds custom models to localize specific points of interest on objects using a mask-based framework.
Model Fine-Tuning - The computer vision library adjusts pre-trained models using custom datasets to improve accuracy for specific tasks.
Vision Model Training - The computer vision library trains high-accuracy models for identifying and locating objects in custom datasets.
Hyperparameter Optimization - Utilizes grid search and parallel sweeping to find optimal model parameters for accuracy and speed.
Retrieval Re-ranking - Implements k-reciprocal re-ranking to refine the accuracy of image retrieval results.
Hard Negative Mining - Increases model precision by sampling difficult negative examples during the training process.
Video Object Tracking - Identifies and follows multiple distinct objects across video frames using tracking algorithms.
GPU Provisioning Services - Automates the provisioning of GPU-enabled virtual machines pre-configured with necessary vision libraries.
Containerized Training Environments - Creates containerized images with standardized dependencies for CPU and GPU-based model training and testing.
Kubernetes Deployments - Provides configurations for orchestrating containerized vision models on Kubernetes for REST API inference.
Kubernetes Application Deployments - Packages trained vision models into containers for automated deployment to managed Kubernetes clusters.
Model Inference APIs - Hosts trained models on cloud containers to expose them as REST APIs for scalable inference.
Computer Vision Libraries - Best practices and code for computer vision.

Open-source alternatives to Computervision Recipes

Similar open-source projects, ranked by how many features they share with Computervision Recipes.

dmlc/gluon-cv
dmlc/gluon-cv
5,922View on GitHub
Gluon-CV is an MXNet computer vision library that provides a comprehensive collection of pre-implemented vision architectures and training pipelines. It serves as a deep learning research toolkit and a model zoo containing state-of-the-art pre-trained weights for image and video analysis. The project includes a specialized human pose estimation library and a model compression toolkit. These tools allow for the pruning and quantization of deep learning models to increase inference speed and facilitate deployment on constrained edge hardware. The library covers a broad range of vision capabili
Pythonaction-recognitioncomputer-visiondeep-learning
View on GitHub5,922
tingsongyu/pytorch_tutorial
TingsongYu/PyTorch_Tutorial
8,018View on GitHub
This project is a comprehensive collection of educational examples and reference implementations for building vision and language models using PyTorch. It serves as a deep learning tutorial covering the end-to-end process of developing neural networks, from initial architecture definition to final production deployment. The repository provides detailed guides on implementing a wide range of domain-specific models, including convolutional neural networks for object detection and segmentation, as well as transformer and recurrent architectures for natural language processing. It emphasizes gene
Python
View on GitHub8,018
paddlepaddle/paddledetection
PaddlePaddle/PaddleDetection
14,243View on GitHub
PaddleDetection is an object detection framework designed for the end-to-end development, training, and deployment of computer vision models. It provides a comprehensive library of modular neural network architectures and pipelines that support object detection, instance segmentation, and multi-object tracking tasks. The project distinguishes itself through a configuration-driven approach that decouples model components like backbones and heads, allowing for the flexible assembly of custom vision workflows. It incorporates advanced techniques such as anchor-free detection logic, joint detecti
Pythonblazefacedeepsortdetr
View on GitHub14,243
ultralytics/ultralytics
ultralytics/ultralytics
58,468View on GitHub
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification. By utilizing a modular architecture, the platform allows users to swap model components to balance inference speed and accuracy requirements for diverse applications. The framework distinguishes itself through its support for real-time processing and flexible deployment. It in
Pythonclicomputer-visiondeep-learning
View on GitHub58,468

See all 30 alternatives to Computervision Recipes

microsoftcomputervision-recipes

Features

Open-source alternatives to Computervision Recipes

dmlc/gluon-cv

TingsongYu/PyTorch_Tutorial

PaddlePaddle/PaddleDetection

ultralytics/ultralytics

Star history

Open-source alternatives to Computervision Recipes

dmlc/gluon-cv

TingsongYu/PyTorch_Tutorial

PaddlePaddle/PaddleDetection

ultralytics/ultralytics