RT DETR

RT DETR - detect objects in real-time | Awesome Repos

Features

Real-Time Object Detection - Identifies and locates multiple objects within images or video streams using high precision and low latency processing.
NMS-Free Object Detectors - Implements an end-to-end detection transformer that removes the need for anchor generation and non-maximum suppression.
DETR Implementations - Implements the Detection Transformer architecture to achieve higher accuracy than traditional convolutional neural networks.
Computer Vision Inference - Provides a high-performance execution engine for real-time computer vision inference in robotics and surveillance applications.
Object Query Mechanisms - Utilizes a set of learnable object queries to predict bounding boxes and class labels in a single pass.
Vision Transformer Encoders - Employs a vision transformer encoder with self-attention mechanisms to extract global context from image features.
Real-Time Model Inference on Frames - Provides an optimized inference pipeline to maintain high frame rates during real-time object detection on video streams.
Computer Vision Models - Provides a high-performance object detection implementation optimized for the PaddlePaddle deep learning platform.
Multi-Scale Feature Pyramids - Merges high- and low-resolution image features using multi-scale pyramids to detect objects of various sizes.
Hungarian Matching Losses - Implements a bipartite matching loss strategy to assign unique predictions to ground truth objects without needing non-maximum suppression.
Hybrid Architectures - Combines multi-scale feature extraction with a lightweight head to optimize the balance between detection accuracy and inference speed.
PyTorch Computer Vision Pipelines - Implements a deep learning pipeline for image analysis and object localization using the PyTorch framework.
Real-Time Detection Optimizations - Optimizes detection tasks for live environments where both high speed and precision are critical.
Transformer - Listed in the “Transformer” section of the Ailia Models awesome list.

Open-source alternatives to RT DETR

Similar open-source projects, ranked by how many features they share with RT DETR.

thu-mig/yolov10
THU-MIG/yolov10
11,316View on GitHub
YOLOv10 is a PyTorch computer vision library and real-time vision framework designed for locating and identifying multiple objects in images and video streams. It functions as an end-to-end object detector that optimizes for high-speed deployment and detection precision. The project is distinguished by an NMS-free detection architecture that predicts a single bounding box per object, eliminating the need for non-maximum suppression post-processing to reduce inference latency. It further optimizes for edge hardware through scalable weights and a quantization-friendly structure that facilitates
Python
View on GitHub11,316
amdegroot/ssd.pytorch
amdegroot/ssd.pytorch
5,224View on GitHub
This is a PyTorch object detection framework that implements the Single Shot MultiBox Detector for identifying and localizing multiple objects within images and video. The project provides a neural network architecture designed for single-shot object detection, which predicts bounding boxes and class labels in one pass. The implementation includes a real-time object detector capable of processing live video streams to track and label objects across sequential frames. It also features a complete computer vision training pipeline for preparing image datasets and training model weights. The fra
Pythoncomputer-visiondeep-learningimage-recognition
View on GitHub5,224
zylo117/yet-another-efficientdet-pytorch
zylo117/Yet-Another-EfficientDet-Pytorch
5,245View on GitHub
This project is a PyTorch implementation of the EfficientDet architecture designed for real-time object detection. It provides a neural network and inference engine capable of identifying and locating multiple objects within images or video streams. The implementation includes pretrained computer vision models with optimized weights, enabling immediate inference and fine-tuning without the need for training from scratch. The project covers the full pipeline for computer vision model optimization, including custom object detection training and model weight optimization. It incorporates struct
Jupyter Notebookbifpndetectionefficientdet
View on GitHub5,245
fundamentalvision/deformable-detr
fundamentalvision/Deformable-DETR
3,895View on GitHub
Deformable-DETR is an object detection system for computer vision that uses a transformer-based encoder-decoder architecture. It identifies and locates objects within images by representing potential targets as a set of learnable queries. The project employs sampling-based attention to restrict attention to a small set of points around a reference, reducing computational complexity and speeding up convergence. It further utilizes multi-scale feature fusion to detect objects of varying sizes within a single frame. The system includes capabilities for training models across multiple GPU cluste
Python
View on GitHub3,895

See all 30 alternatives to RT DETR

lyuwenyuRT-DETR

Features

Open-source alternatives to RT DETR

THU-MIG/yolov10

amdegroot/ssd.pytorch

zylo117/Yet-Another-EfficientDet-Pytorch

fundamentalvision/Deformable-DETR

Star history

Open-source alternatives to RT DETR

THU-MIG/yolov10

amdegroot/ssd.pytorch

zylo117/Yet-Another-EfficientDet-Pytorch

fundamentalvision/Deformable-DETR