10 open-source projects similar to yangchris11/samurai, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Samurai alternative.
YOLOv10 is a PyTorch computer vision library and real-time vision framework designed for locating and identifying multiple objects in images and video streams. It functions as an end-to-end object detector that optimizes for high-speed deployment and detection precision. The project is distinguished by an NMS-free detection architecture that predicts a single bounding box per object, eliminating the need for non-maximum suppression post-processing to reduce inference latency. It further optimizes for edge hardware through scalable weights and a quantization-friendly structure that facilitates
A collection of reference implementations and code samples for integrating Android camera hardware and software APIs. The project provides demonstrations for using both the Jetpack CameraX library and the low-level Camera2 API to implement photo and video capture features. The repository includes specialized implementations for high-performance recording, such as high-frame-rate slow motion and high-dynamic-range video. It also features examples of machine learning vision, demonstrating how to analyze live camera frames for object detection and QR code scanning. The project covers broad imag
This project is a cross-platform mobile camera framework and real-time computer vision library. It provides a high-performance interface for mobile applications to handle hardware control, media capture, and live camera frame processing. The framework includes a dedicated system for running AI models and custom analysis on live camera streams using high-performance worklets. It also functions as a real-time detection and decoding system for QR codes and barcodes. Broad capabilities cover the capture of high-resolution photos and videos with controls for zoom, HDR, and frame rates. The projec
This project is a deep learning face classification system that detects human faces and classifies gender and emotion. It utilizes convolutional neural networks and computer vision tools to analyze facial attributes in both static images and live video streams. The system includes specialized classifiers for emotions based on the FER2013 dataset and gender based on IMDB datasets. These models are integrated into a containerized web service, allowing the classification logic to be exposed as an API that processes image data via network requests. The technical surface covers the entire pipelin
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
RT-DETR is a real-time object detection model based on the detection transformer architecture. It is implemented as a computer vision model for both the PyTorch and PaddlePaddle deep learning platforms, designed to identify and locate multiple objects in images and video streams. The model eliminates the need for anchor generation and non-maximum suppression by utilizing a transformer-based approach. It focuses on high-performance detection, balancing precision and low latency for live environment deployment. The system employs a hybrid encoder and multi-scale feature fusion to extract globa
MiDaS is a PyTorch computer vision library and monocular depth estimation model designed to predict scene depth from single images. It functions as a scene depth predictor that computes distance maps to determine object proximity to the camera. The project enables zero-shot depth transfer, allowing the model to be applied to new datasets or environments without additional training data. It focuses on relative depth regression to predict scale-invariant depth maps. The library includes a real-time depth visualizer for capturing live camera feeds and displaying corresponding depth maps. It als
YOLO-World is a vision-language framework and open-vocabulary object detection model. It identifies objects in images and video based on free-form text prompts without requiring predefined category labels. The system enables the identification of arbitrary objects by fusing image features with text embeddings. It includes a specialized tool for automated image labeling, which generates bounding box annotations for custom datasets using text-based prompts. The project provides a deployment pipeline for converting models into quantized ONNX and TFLite formats, supporting real-time inference on
Anomalib is a PyTorch-based library for visual anomaly detection, offering a modular framework, a comprehensive model zoo, and a benchmarking suite designed for industrial defect detection. It provides a wide range of algorithms—including generative, discriminative, teacher-student, and vision-language approaches—that support unsupervised, few-shot, and zero-shot settings. The library enables deployment through model export to ONNX and OpenVINO for edge devices, and includes a no-code web application for training and inference. It also features a command-line interface for orchestrating multi