30 open-source projects similar to atduskgreg/opencv-processing, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Opencv Processing alternative.
GoCV is a computer vision library and Go language binding for OpenCV. It serves as an image processing toolkit and deep learning inference engine, providing programmatic access to a wide range of algorithms for image manipulation, object detection, and video analysis. The project differentiates itself through high-performance native bindings and hardware acceleration. It utilizes a foreign function interface to map Go calls to C++ functions and includes a hardware-agnostic backend dispatch to route neural network tasks to computation engines such as CUDA and OpenVINO. The library covers a br
This project is a scientific computing framework for the .NET ecosystem, providing a comprehensive suite of libraries for numerical analysis, statistics, and mathematical optimization. It serves as a foundational toolkit for developing applications in machine learning, digital signal processing, and computer vision. The framework provides specialized toolkits for training and deploying predictive models, including neural networks, support vector machines, and decision trees. It further distinguishes itself with deep integrations for real-time visual analysis, such as object tracking and facia
scikit-image is a Python image processing library and scientific image analysis toolkit. It provides a framework for digital image processing and computer vision, utilizing numerical arrays for pixel-level manipulations. The library enables the quantification of image properties and the detection of visual features, such as edges and blobs. It includes tools for image segmentation and the extraction of textures and patterns to characterize objects within visual data. Capabilities cover image manipulation through color space conversion, geometric transformations, and digital restoration. It a
node-opencv is a high-performance C++ native addon and bridge that connects Node.js applications to the OpenCV library. It serves as an image processing toolkit and computer vision library, allowing JavaScript code to execute vision algorithms and image manipulation operations through native bindings. The project provides specialized capabilities for face and shape detection, as well as face identity recognition using trained models. It includes tools for object motion tracking through optical flow and background subtraction, along with the ability to identify specific patterns and analyze sh
JavaCV provides a Java-based interface for native computer vision and video processing libraries. It functions as a wrapper for native vision libraries, allowing Java applications to perform image analysis, object detection, and video stream processing. The project integrates comprehensive computer vision capabilities, including facial recognition, image segmentation, and optical flow analysis for motion tracking. It also provides tools for hardware geometry calibration and projector-camera alignment to ensure accurate spatial representation. The system covers high-performance media renderin
Kornia is a differentiable computer vision library and cross-framework tensor vision toolset. It implements vision operations as differentiable tensors to enable integration into deep learning pipelines and supports the transpilation of operations across PyTorch, TensorFlow, JAX, and NumPy. The project provides specialized toolsets for geometric vision and stereo depth, including algorithms for 3D scene reconstruction, camera calibration, and pose estimation. It further distinguishes itself as a differentiable image augmentation framework, applying random geometric and color transformations w
OpenCVSharp is a .NET library that wraps native OpenCV functions, providing C# developers with access to OpenCV's computer vision capabilities through an API that mirrors the native C/C++ style. It serves as a managed wrapper for image processing, feature detection, object detection, and image manipulation tasks, while also handling automatic disposal of unmanaged OpenCV resources like Mat objects to prevent memory leaks in .NET applications. The library enables keypoint detection and descriptor extraction using algorithms such as AKAZE, BRISK, or FAST, with brute-force or FLANN-based matchin
opencv4nodejs is a set of JavaScript wrappers and a C++ native addon that provides Node.js bindings for the OpenCV library. It functions as a computer vision library and image processing framework, exposing high-performance C++ algorithms to a JavaScript environment. The project enables the execution of vision algorithms for detecting faces, tracking objects, and analyzing visual data using deep neural networks. It includes capabilities for data pattern classification, text pattern recognition, and the identification of facial landmarks and gestures. The framework covers a broad capability s
This project is a collection of optional, community-contributed algorithms and specialized vision tools that extend the core OpenCV framework. It serves as a comprehensive library of extra modules for computer vision research, providing advanced toolsets for image processing, visual data analysis, and object detection. The library includes specialized frameworks for augmented reality tracking, biometric face recognition, and three-dimensional pose estimation. It provides distinct capabilities for identifying AR markers, tracking 3D object silhouettes, and performing neural network vulnerabili
libfacedetection is a C++ face detection library and computer vision tool. It utilizes a neural network face detector to identify human faces in images and return bounding box coordinates. The library is designed for low latency and high throughput processing, enabling real-time face detection in image and video streams. It supports automated image analysis for identifying coordinates of human faces across large batches of photos and high-performance video processing.
Faceai is a computer vision toolkit designed for facial analysis, identity recognition, and image processing. It provides integrated engines for detecting human faces in static images and live video streams, matching facial encodings against identity databases, and mapping facial landmarks to understand geometric structure and alignment. The project enables real-time augmented reality applications, such as applying virtual makeup and digital accessories by scaling assets to detected facial coordinates. It also includes a suite for digital image restoration capable of removing noise, erasing w
This library provides a deep learning framework for identifying human faces and extracting facial landmarks within digital images. It utilizes a multi-task convolutional neural network architecture to simultaneously perform face classification, bounding box regression, and landmark localization. The system processes images through three sequential stages of neural networks, incorporating image pyramid resizing to detect faces of varying scales. To ensure accuracy, it employs bounding box regression to refine coordinate predictions and non-maximum suppression to filter out redundant overlappin
PaddleDetection is an object detection framework designed for the end-to-end development, training, and deployment of computer vision models. It provides a comprehensive library of modular neural network architectures and pipelines that support object detection, instance segmentation, and multi-object tracking tasks. The project distinguishes itself through a configuration-driven approach that decouples model components like backbones and heads, allowing for the flexible assembly of custom vision workflows. It incorporates advanced techniques such as anchor-free detection logic, joint detecti
Gyroflow is a gyroscope video stabilization software and IMU telemetry processor designed to remove camera shake from video files. It functions as a hardware-accelerated video renderer and lens calibration tool, utilizing embedded or external gyroscope and accelerometer data to perform pixel-level stabilization. The system is distinguished by its ability to integrate with professional non-linear video editing software via plugins, allowing stabilization to be applied directly to timelines without transcoding original footage. It supports diverse telemetry ingestion from camera brands, flight
SensorsCalibration is a toolkit for computing the intrinsic and extrinsic parameters of cameras and LiDAR sensors in automotive environments. It provides tools for calculating internal camera focal properties, aligning LiDAR point clouds to 2D camera images, and determining the spatial transformations between multiple sensors and the vehicle coordinate system. The project includes a targetless calibration framework that aligns sensors to a vehicle by analyzing natural features in road environments without the need for specialized physical markers. It also supports factory calibration processe
jeelizFaceFilter is a browser-based computer vision engine and WebGL face tracking library designed for AR filters and real-time facial movement tracking. It functions as a neural network face detector that identifies multiple faces and monitors mouth movements and rotation within a web browser. The system distinguishes itself through a model-swappable detection pipeline, allowing the exchange of neural network weights to balance accuracy and performance across different camera angles and devices. It features real-time lighting synchronization to match the illumination of 3D overlays with the
FreeMoCap is an open-source markerless motion capture system that reconstructs 3D human pose from video. It uses a multi-camera setup with ChArUco board calibration to accurately triangulate body landmarks, and it also supports single-camera recording for simpler captures. The system outputs skeleton joint data and generates interactive Jupyter notebooks for each recording, enabling users to explore and analyse motion data directly. Built around hardware-synchronised video capture and MediaPipe-based 2D pose detection, FreeMoCap supports both calibrated multi-camera recording and real-time 2D
This repository is a comprehensive collection of reference implementations and sample libraries for the Universal Windows Platform. It provides practical examples of how to use Windows Runtime APIs to build cross-device applications, including detailed guidance on XAML-based declarative user interfaces and DirectX-integrated rendering. The project distinguishes itself by providing a wide array of hardware integration suites, covering low-level communication with USB, Serial, I2C, SPI, and GPIO peripherals. It includes specialized implementations for mixed reality holographic rendering, advanc
This project is a computer vision system designed for the detection and identification of human faces within live video streams. It functions as a facial analysis pipeline that processes visual data to locate facial boundaries and match individuals against a stored database of known identities. The system utilizes a multi-stage neural network framework to isolate facial regions and extract unique identity characteristics. By converting facial image data into compact numerical vectors, it performs geometric similarity calculations to verify or identify subjects as they appear in motion. The s
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
The Intel RealSense SDK is a software development kit providing drivers and libraries for interfacing with depth cameras to capture color, depth, and infrared data streams. It includes a depth camera driver for device discovery and sensor configuration, a stereo vision library for computing depth maps and aligning frames, and a 3D point cloud generator to transform depth and infrared frames into spatial representations. The SDK distinguishes itself through on-chip depth calculation and stereo calibration, using internal vision processors to reduce host CPU load. It supports hardware-level str
Bild is an image processing library implemented in the Go programming language. It provides a collection of algorithmic engines for image manipulation, including a convolution kernel engine for filtering, an image blending tool for layer composition, and a procedural noise generator for creating synthetic textures. The project is distinguished by its procedural generation capabilities, implementing Perlin, Gaussian, binary, and uniform noise algorithms to produce random pixel distributions and organic patterns. It also features a command-line interface that allows users to apply visual effect
GPUImage2 is a Swift framework for applying real-time filters and effects to images and video using the GPU. It provides a real-time video filter library, an image geometry manipulation engine, and an OpenGL shading pipeline for processing visual data on graphics hardware. The framework enables the construction of visual effect pipelines by chaining image sources to consumers in sequential flows. It supports the development of custom fragment and vertex shaders for bespoke image processing and offers the ability to bundle these operations into reusable units via graph-based grouping. Capabil
Caire is a command-line image processing engine designed for content-aware resizing and batch manipulation. It utilizes seam carving algorithms to adjust image dimensions by identifying and removing low-energy pixels, allowing for the rescaling of images while preserving primary visual subjects and maintaining aspect ratios. The tool distinguishes itself through its ability to protect specific visual elements, such as human faces, from distortion during the resizing process. Users can apply custom binary masks to define regions for protection or forced removal, and the engine provides real-ti
OpenCV is an open-source computer vision library and visual analysis toolkit. It provides a framework for processing static images and dynamic video frames to analyze visual data and extract information using deep learning. The project functions as a real-time image processing framework, enabling the execution of vision algorithms on live video streams for immediate analysis and data processing. The toolkit covers a broad range of capabilities including image pattern recognition, real-time video analysis, and visual data extraction. It also supports automated visual inspection for detecting
This project is a collection of neural network models and geometric tools designed for image feature matching, spatial alignment, and visual localization. It provides a pre-trained neural network model for identifying high-accuracy correspondences between sparse image features without requiring local training. The system utilizes a graph neural network matcher that employs attention mechanisms and message passing to learn spatial relationships between image feature points. It integrates a RANSAC camera pose estimator to filter feature matches and calculate the relative spatial transformation
This project is a Python wrapper for the OpenCV computer vision library, providing a bridge that exposes high-performance C++ functions to the Python programming language. It serves as a collection of tools for real-time image processing, object detection, and machine learning on visual data. The project provides precompiled binary distributions, allowing for the integration of vision capabilities into Python applications without requiring a local C++ compiler. It offers multi-variant package distributions, including headless versions designed for server or cloud environments where a graphica
This project is a cross-platform mobile camera framework and real-time computer vision library. It provides a high-performance interface for mobile applications to handle hardware control, media capture, and live camera frame processing. The framework includes a dedicated system for running AI models and custom analysis on live camera streams using high-performance worklets. It also functions as a real-time detection and decoding system for QR codes and barcodes. Broad capabilities cover the capture of high-resolution photos and videos with controls for zoom, HDR, and frame rates. The projec
Super-Gradients is a PyTorch computer vision framework and training library designed for the full lifecycle of vision models. It functions as a deep learning model optimizer and a deployment toolkit for training and fine-tuning models across image classification, object detection, semantic segmentation, and pose estimation tasks. The project provides specific tools for model optimization, including teacher-student knowledge distillation and numerical precision compression to reduce memory and computational requirements. It also includes the implementation of the Yolo-NAS architecture for high
Darknet is a high-performance C-based inference engine and computer vision library designed for real-time object identification and localization. It serves as a neural network framework for training and deploying detection models using the YOLO architecture, providing a toolset for deep learning training and deployment. The project differentiates itself through a C and CUDA implementation that enables hardware acceleration for matrix multiplication and inference speed optimization. It provides a shared library interface for embedding detection capabilities into external applications and suppo