SOTAVerified

Object Detection

Papers

Showing 851875 of 10957 papers

TitleStatusHype
OK-VQA: A Visual Question Answering Benchmark Requiring External KnowledgeCode1
DiffuBox: Refining 3D Object Detection with Point DiffusionCode1
Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesCode1
Categorical Depth Distribution Network for Monocular 3D Object DetectionCode1
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher LearningCode1
CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAsCode1
CodeReef: an open platform for portable MLOps, reusable automation actions and reproducible benchmarkingCode1
Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing ImagesCode1
Co-Fix3D: Enhancing 3D Object Detection with Collaborative RefinementCode1
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language ModelCode1
CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object DetectionCode1
Common Limitations of Image Processing Metrics: A Picture StoryCode1
CondenseNet V2: Sparse Feature Reactivation for Deep NetworksCode1
Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth PredictionCode1
Domain Adaptive Object Detection for Autonomous Driving under Foggy WeatherCode1
A Dual-Cycled Cross-View Transformer Network for Unified Road Layout Estimation and 3D Object Detection in the Bird's-Eye-ViewCode1
Comics Datasets Framework: Mix of Comics datasets for detection benchmarkingCode1
An Ultra-low Power TinyML System for Real-time Visual Processing at EdgeCode1
A Dual Weighting Label Assignment Scheme for Object DetectionCode1
Diagnosing Human-object Interaction DetectorsCode1
Comparison Of Deep Object Detectors On A New Vulnerable Pedestrian DatasetCode1
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-raysCode1
Compact Generalized Non-local NetworkCode1
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection TransformerCode1
Show:102550
← PrevPage 35 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified