Segment Everything Everywhere All At Once

Segment Everything Everywhere All At Once | Awesome Repos

Features

Semantic Image Segmentations - Provides algorithms for dividing images into precise segments to isolate distinct objects and patterns.
Query-Based Mask Generators - Produces precise object masks by processing user-defined or learned queries through a predictive decoder.
Text-Prompted Masking - Converts natural language text descriptions into precise pixel-level semantic masks.
Multi-Modal Image Querying - Generates pixel-level masks using a combination of text prompts, audio signals, and referring images.
Image Labeling Engines - Utilizes an image labeling engine to predict category labels for generated masks based on predicted embeddings.
Automated Image Labeling - Automatically generates category labels for segmentation masks based on a predefined semantic vocabulary.
Multi-Modal Image Segmentation Frameworks - Implements a comprehensive framework for generating image masks using combined text, image, and audio signals.
Multi-Modal Prompt Integration - Combines textual prompts, images, and audio signals into a unified embedding space to guide the segmentation process.
SAM-Based Implementations - Leverages the Segment Anything Model architecture to perform high-quality image and video segmentation.
Video Object Tracking - Implements algorithms for maintaining object identity and tracking segments across sequential video frames.
Interactive Video Object Trackers - Provides an interactive tool for tracking visual entities across video frames using referring images or textual queries.
Semantic Masking Architectures - Assigns semantic category labels to every individual pixel of generated masks using deep learning architectures.
Agent Session Memory - Provides persistent storage of interaction history and query decisions to maintain segmentation continuity across user turns.
Interactive Segmenters - Ships a user interface for the iterative refinement of image segmentation masks through multiple interaction rounds.
Referring-Image Feature Matching - Extracts specific objects in target images by matching visual features and spatial relationships with a reference image region.
Cell Segmentation - Unified segmentation model for diverse inputs.

Open-source alternatives to Segment Everything Everywhere All At Once

Similar open-source projects, ranked by how many features they share with Segment Everything Everywhere All At Once.

syscv/sam-hq
SysCV/sam-hq
4,234View on GitHub
sam-hq is a collection of pre-trained vision foundation models and adapters designed for high-quality image segmentation, multimodal feature extraction, and depth estimation. It provides a zero-shot vision model capable of performing segmentation and classification across diverse domains without requiring task-specific training. The project features a high-quality image segmentation tool based on the Segment Anything Model that generates precise masks from spatial prompts. It includes a multimodal feature extractor to generate high-dimensional vector embeddings from both image and text inputs
Jupyter Notebookhigh-qualitysamsegment-anything
View on GitHub4,234
dmlc/gluon-cv
dmlc/gluon-cv
5,922View on GitHub
Gluon-CV is an MXNet computer vision library that provides a comprehensive collection of pre-implemented vision architectures and training pipelines. It serves as a deep learning research toolkit and a model zoo containing state-of-the-art pre-trained weights for image and video analysis. The project includes a specialized human pose estimation library and a model compression toolkit. These tools allow for the pruning and quantization of deep learning models to increase inference speed and facilitate deployment on constrained edge hardware. The library covers a broad range of vision capabili
Pythonaction-recognitioncomputer-visiondeep-learning
View on GitHub5,922
facebookresearch/sam3
facebookresearch/sam3
7,762View on GitHub
This project is a computer vision system for object segmentation and tracking across images and videos. It employs models capable of identifying and masking objects using text prompts, bounding boxes, click points, or image exemplars. The system differentiates itself through memory-based video tracking and shared-memory architectures that maintain consistent object identities over time. It supports multi-object processing in single computation passes to increase frame throughput and utilizes iterative refinement to correct segmentation boundaries through sequential prompts. The software also
Python
View on GitHub7,762
dusty-nv/jetson-inference
dusty-nv/jetson-inference
8,734View on GitHub
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
C++caffecomputer-visiondeep-learning
View on GitHub8,734

See all 30 alternatives to Segment Everything Everywhere All At Once

UX-DecoderSegment-Everything-Everywhere-All-At-Once

Segment Everything Everywhere All At Once

Features

Open-source alternatives to Segment Everything Everywhere All At Once

SysCV/sam-hq

dmlc/gluon-cv

facebookresearch/sam3

dusty-nv/jetson-inference

Star history

Open-source alternatives to Segment Everything Everywhere All At Once

SysCV/sam-hq

dmlc/gluon-cv

facebookresearch/sam3

dusty-nv/jetson-inference