30 open-source projects similar to shawn-shan/fawkes, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Fawkes alternative.
facenet-pytorch is a facial recognition library for PyTorch that provides pretrained neural networks for detecting faces and extracting facial embeddings. It includes an MTCNN face detector for locating faces and landmarks, alongside an InceptionResnet face encoder to convert facial images into high-dimensional vectors for identity verification. The project provides tools for identity recognition by comparing facial embeddings using cosine similarity. It also supports facial video tracking to maintain identity consistency across consecutive frames and allows for the fine-tuning of pretrained
Deepface is a comprehensive deep learning library for facial recognition and demographic analysis. It provides a modular pipeline that handles the entire lifecycle of facial processing, including detection, geometric alignment, and the transformation of facial images into high-dimensional numerical vector embeddings for identity verification and similarity comparison. The library distinguishes itself through a model ensemble approach, which combines predictions from multiple pre-trained neural networks to improve classification accuracy and reduce bias. It also integrates advanced security fe
CompreFace is a facial recognition system designed for human face detection, identification, and biometric identity verification. It provides a registry of known people and the ability to match faces in images against this database to determine a specific identity. The system extracts facial landmarks to map geometry and analyzes physical attributes including age, gender, and head pose. It can also verify whether two different images belong to the same individual. The project is implemented as a microservice-based deployment utilizing a REST API gateway and a PostgreSQL metadata store. It in
Official implementation of Joint Monocular 3D Vehicle Detection and Tracking (ICCV 2019)
An unsupervised learning framework for depth and ego-motion estimation from monocular videos
Earth observation processing framework for machine learning in Python
TRI-ML Monocular Depth Estimation Repository
Techniques for deep learning with satellite & aerial imagery
A collection of computer vision pre-trained models.
Real-Time 3D Semantic Reconstruction from 2D data
Library for Fast and Flexible Human Pose Estimation
EasyOCR is a deep learning-based computer vision library designed to perform optical character recognition on images and video frames. It functions as a comprehensive pipeline that automates the transformation of visual text into machine-readable strings, enabling the digitization of physical documents, forms, and receipts into searchable data. The engine distinguishes itself through a multi-stage processing workflow that combines convolutional neural networks for spatial feature extraction with sequence-based decoding mechanisms. This architecture allows the system to identify and interpret
An open source tool to quantify the world
Python package for earth-observing satellite data processing
imgaug is a Python library for machine learning data augmentation and computer vision dataset expansion. It provides tools to increase the volume and variety of training sets by applying random geometric, color, and noise transformations to images. The library ensures spatial consistency by synchronizing transformations across images and their associated annotations, such as bounding boxes, keypoints, and segmentation maps. It uses a compositional pipeline pattern to chain multiple augmentations into sequences and employs deterministic seed management to reproduce specific data samples. The
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Yolact is a computer vision framework and real-time instance segmentation model. It utilizes a fully convolutional neural network to detect objects and generate pixel-level masks for images and video feeds. The system employs prototypical mask generation to create global mask prototypes that are linearly combined for instance-specific results. It incorporates deformable convolutional layers and deformable region-of-interest pooling to adapt spatial sampling to the irregular shapes of objects. The framework covers the full model development lifecycle, including training on custom datasets, ac
pysot is a computer vision framework designed for single object tracking. It provides a platform for implementing and evaluating algorithms that locate and follow specific target objects across sequences of video frames. The project includes implementations of the SiamRPN architecture for region proposal network based localization and the SiamMask model, which combines tracking with binary mask generation to provide pixel-level segmentation of objects. The framework also contains a visual tracking evaluation toolkit used to measure the accuracy and reliability of tracking algorithms against
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON, SVE for ARM, HVX for Hexagon
The implementation of an algorithm presented in the CVPR18 paper: "Detect-and-Track: Efficient Pose Estimation in Videos"
Detectron2 is a PyTorch computer vision framework and visual recognition platform designed for training and deploying models for object detection, image segmentation, and visual recognition. It provides a research-oriented environment for training complex vision models with multi-GPU acceleration. The project includes a specialized object detection library for identifying and locating multiple objects via bounding boxes, as well as an image segmentation toolkit for creating pixel-level masks through instance, semantic, and panoptic segmentation. Additionally, it features a human pose estimati
Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds
Meshroom is a node-based photogrammetry software designed to transform collections of two-dimensional images into three-dimensional models and scene geometry. It provides a visual interface for constructing and managing modular data pipelines, allowing users to automate complex computer vision tasks such as feature extraction, depth map estimation, and mesh generation. The software distinguishes itself through a distributed computational framework that dispatches resource-intensive tasks across local hardware or remote render farms. By utilizing a directed acyclic graph execution model, it en
TensorFlow Implementation for Computing a Semantically Segmented Bird's Eye View (BEV) Image Given the Images of Multiple Vehicle-Mounted Cameras.
A Simple and Versatile Framework for Object Detection and Instance Recognition
Libvips is a C-based image processing library designed to manipulate large visual assets through a low-memory, parallel processing pipeline. It functions as a streaming image processor that avoids loading entire files into system memory, enabling the handling of massive images in resource-constrained environments. The library distinguishes itself through a demand-driven architecture that constructs a deferred execution plan, computing only the necessary pixels for a final output. By utilizing a cache-friendly tiled processing model and memory-mapped file access, it minimizes latency and redun
YOLOv5 is a comprehensive computer vision framework designed for end-to-end deep learning, specializing in real-time object detection, image classification, and instance segmentation. It provides a unified toolkit that manages the entire lifecycle of a model, from initial dataset configuration and hyperparameter tuning to high-speed inference and deployment. The framework utilizes a modular neural architecture, allowing users to swap backbone and head components to tailor models for specific visual tasks. What distinguishes this project is its focus on production-ready deployment and model ef