SOTAVerified

Object Detection

Papers

Showing 851900 of 10957 papers

TitleStatusHype
CodeReef: an open platform for portable MLOps, reusable automation actions and reproducible benchmarkingCode1
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
CBNet: A Composite Backbone Network Architecture for Object DetectionCode1
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object DetectionCode1
CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited AnnotationsCode1
Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object DetectionCode1
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
Collaborative Camouflaged Object Detection: A Large-Scale Dataset and BenchmarkCode1
Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing ImagesCode1
Collaborative Transformers for Grounded Situation RecognitionCode1
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language ModelCode1
CenterFusion: Center-based Radar and Camera Fusion for 3D Object DetectionCode1
DilateFormer: Multi-Scale Dilated Transformer for Visual RecognitionCode1
Co-mining: Self-Supervised Learning for Sparsely Annotated Object DetectionCode1
ComPtr: Towards Diverse Bi-source Dense Prediction Tasks via A Simple yet General Complementary TransformerCode1
Common Limitations of Image Processing Metrics: A Picture StoryCode1
A Dual-Cycled Cross-View Transformer Network for Unified Road Layout Estimation and 3D Object Detection in the Bird's-Eye-ViewCode1
Compact Generalized Non-local NetworkCode1
An Ultra-low Power TinyML System for Real-time Visual Processing at EdgeCode1
A Dual Weighting Label Assignment Scheme for Object DetectionCode1
3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object DetectionCode1
Comparison Of Deep Object Detectors On A New Vulnerable Pedestrian DatasetCode1
COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detectionCode1
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object DetectionCode1
Disentangled High Quality Salient Object DetectionCode1
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object DetectionCode1
DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN TrainingCode1
DropLoss for Long-Tail Instance SegmentationCode1
An Explicit Local and Global Representation Disentanglement Framework with Applications in Deep Clustering and Unsupervised Object DetectionCode1
Concealed Object DetectionCode1
AO2-DETR: Arbitrary-Oriented Object Detection TransformerCode1
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against CorruptionsCode1
CondenseNet V2: Sparse Feature Reactivation for Deep NetworksCode1
Conditional DETR for Fast Training ConvergenceCode1
Confidence-Aware Learning for Camouflaged Object DetectionCode1
DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object DetectionCode1
3D-LaneNet: End-to-End 3D Multiple Lane DetectionCode1
AP-Loss for Accurate One-Stage Object DetectionCode1
Conformer: Local Features Coupling Global Representations for Visual RecognitionCode1
ConQueR: Query Contrast Voxel-DETR for 3D Object DetectionCode1
Categorical Depth Distribution Network for Monocular 3D Object DetectionCode1
Texture-guided Saliency Distilling for Unsupervised Salient Object DetectionCode1
Dual Memory Aggregation Network for Event-Based Object Detection with Learnable RepresentationCode1
Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous DrivingCode1
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher LearningCode1
3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object DetectionCode1
Context-Transformer: Tackling Object Confusion for Few-Shot DetectionCode1
Context-aware Cross-level Fusion Network for Camouflaged Object DetectionCode1
DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object DetectionCode1
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-raysCode1
Show:102550
← PrevPage 18 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified