SOTAVerified

Object Detection

Papers

Showing 16011650 of 10957 papers

TitleStatusHype
Disentangled High Quality Salient Object DetectionCode1
Hybrid-Attention Guided Network with Multiple Resolution Features for Person Re-IdentificationCode1
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising TrainingCode1
BSH-Det3D: Improving 3D Object Detection with BEV Shape HeatmapCode1
BTS-Net: Bi-directional Transfer-and-Selection Network For RGB-D Salient Object DetectionCode1
Bucketed Ranking-based Losses for Efficient Training of Object DetectorsCode1
DFNet: Discriminative feature extraction and integration network for salient object detectionCode1
Diagnosing Human-object Interaction DetectorsCode1
An Interactively Reinforced Paradigm for Joint Infrared-Visible Image Fusion and Saliency Object DetectionCode1
CLIP-Guided Source-Free Object Detection in Aerial ImagesCode1
DeVIS: Making Deformable Transformers Work for Video Instance SegmentationCode1
A Dataset for Provident Vehicle Detection at NightCode1
IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous DrivingCode1
IDa-Det: An Information Discrepancy-aware Distillation for 1-bit DetectorsCode1
An Investigation into Whitening Loss for Self-supervised LearningCode1
DID-M3D: Decoupling Instance Depth for Monocular 3D Object DetectionCode1
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural NetworksCode1
An Investigation of Preprocessing Filters and Deep Learning Methods for Vessel Type Classification With Underwater Acoustic DataCode1
Image Augmentation for Multitask Few-Shot Learning: Agricultural Domain Use-CaseCode1
Image Captioning through Image TransformerCode1
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLMCode1
ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustnessCode1
Cached Transformers: Improving Transformers with Differentiable Memory CacheCode1
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing IndustryCode1
DETRs with Hybrid MatchingCode1
Implementation of a perception system for autonomous vehicles using a detection-segmentation network in SoC FPGACode1
Activation to Saliency: Forming High-Quality Labels for Completely Unsupervised Salient Object DetectionCode1
Improved Residual Networks for Image and Video RecognitionCode1
Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed AugmentationCode1
CAGNet: Content-Aware Guidance for Salient Object DetectionCode1
CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point CloudsCode1
CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object DetectionCode1
Cloud Object Detector Adaptation by Integrating Different Source KnowledgeCode1
Calibrated RGB-D Salient Object DetectionCode1
Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous DrivingCode1
Taming Self-Training for Open-Vocabulary Object DetectionCode1
Calibration-free BEV Representation for Infrastructure PerceptionCode1
Improving the Detection of Small Oriented Objects in Aerial ImagesCode1
CaLiV: LiDAR-to-Vehicle Calibration of Arbitrary Sensor Setups via Object ReconstructionCode1
CamDiff: Camouflage Image Augmentation via Diffusion ModelCode1
Inception Convolution with Efficient Dilation SearchCode1
Camera clustering for scalable stream-based active distillationCode1
Monitoring and Adapting the Physical State of a Camera for Autonomous VehiclesCode1
Incremental Learning Techniques for Semantic SegmentationCode1
CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion ModelsCode1
CamoFormer: Masked Separable Attention for Camouflaged Object DetectionCode1
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like SettingCode1
AutoAssign: Differentiable Label Assignment for Dense Object DetectionCode1
Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding BoxCode1
AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object DetectionCode1
Show:102550
← PrevPage 33 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified