SOTAVerified

Object Detection

Papers

Showing 201250 of 10957 papers

TitleStatusHype
Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization0
Visual Consensus Prompting for Co-Salient Object DetectionCode1
SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems0
Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes0
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection0
Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models0
DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory PredictionCode1
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes0
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving0
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture0
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity0
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding0
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling0
Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection0
RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning0
A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions0
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild0
Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline0
Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task0
S^2Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection0
CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors0
Weather-Aware Object Detection Transformer for Domain Adaptation0
Flyweight FLIM Networks for Salient Object Detection in Biomedical Images0
DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmenCode1
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*0
CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection0
ATLASv2: LLM-Guided Adaptive Landmark Acquisition and Navigation on the Edge0
Detecting streaks in smart telescopes images with Deep Learning0
DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing0
Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection0
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and ResultsCode2
LEMUR Neural Network Dataset: Towards Seamless AutoMLCode1
Vision-Language Model for Object Detection and Segmentation: A Review and EvaluationCode2
Uncertainty Guided Refinement for Fine-Grained Salient Object DetectionCode1
RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature LearningCode1
RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object DetectionCode0
self-prompting analogical reasoning for uav object detectionCode2
High Dynamic Range Modulo Imaging for Robust Object Detection in Autonomous Driving0
MultiCore+TPU Accelerated Multi-Modal TinyML for Livestock Behaviour Recognition0
Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object DetectionCode0
WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer0
Detect Anything 3D in the WildCode3
Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement0
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural NetworksCode1
P2Object: Single Point Supervised Object Detection and Instance SegmentationCode2
Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network0
RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions0
Show:102550
← PrevPage 5 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified