SOTAVerified

Object Detection

Papers

Showing 901925 of 10957 papers

TitleStatusHype
Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based SegmentationCode1
Do text-free diffusion models learn discriminative visual representations?Code1
Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone ImagesCode1
DPNet: Dual-Path Network for Real-time Object Detection with Lightweight AttentionCode1
Learning Dynamic Query Combinations for Transformer-based Object Detection and SegmentationCode1
ASOD60K: An Audio-Induced Salient Object Detection Dataset for Panoramic VideosCode1
Advancing Referring Expression Segmentation Beyond Single ImageCode1
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D SceneCode1
ApproxDet: Content and Contention-Aware Approximate Object Detection for MobilesCode1
DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object DetectionCode1
A Structure-Aware Relation Network for Thoracic Diseases Detection and SegmentationCode1
DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN TrainingCode1
Advancing Self-supervised Monocular Depth Learning with Sparse LiDARCode1
DSGN: Deep Stereo Geometry Network for 3D Object DetectionCode1
Advancing Vision Transformers with Group-Mix AttentionCode1
3D Small Object Detection with Dynamic Spatial PruningCode1
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object DetectionCode1
A Simple Pooling-Based Design for Real-Time Salient Object DetectionCode1
Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave RadarCode1
Dual-Level Collaborative Transformer for Image CaptioningCode1
AQD: Towards Accurate Fully-Quantized Object DetectionCode1
Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous DrivingCode1
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous ModalitiesCode1
AquaVision: Automating the detection of waste in water bodies using deep transfer learningCode1
Data Augmentation for Object Detection via Differentiable Neural RenderingCode1
Show:102550
← PrevPage 37 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified