SOTAVerified

Object Detection

Papers

Showing 28762900 of 10957 papers

TitleStatusHype
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object DetectionCode3
Salient Object Detection in RGB-D VideosCode1
Mean Teacher DETR with Masked Feature Alignment: A Robust Domain Adaptive Detection Transformer Framework0
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection0
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting0
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object DetectionCode1
Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLACode1
Pre-Training LiDAR-Based 3D Object Detectors Through ColorizationCode0
Rethinking Scale Imbalance in Semi-supervised Object Detection for Aerial Images0
MaRU: A Manga Retrieval and Understanding System Connecting Vision and Language0
The Importance of Anti-Aliasing in Tiny Object DetectionCode0
Skipped Feature Pyramid Network with Grid Anchor for Object Detection0
OV-VG: A Benchmark for Open-Vocabulary Visual GroundingCode1
Deep MDP: A Modular Framework for Multi-Object TrackingCode0
Guidance system for Visually Impaired Persons using Deep Learning and Optical flow0
Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing ImagesCode1
Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS0
A review of individual tree crown detection and delineation from optical remote sensing images0
EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye ViewCode1
ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map ConstructionCode1
Zone Evaluation: Revealing Spatial Bias in Object DetectionCode1
Multi‑camera trajectory matching based on hierarchical clustering and constraintsCode1
RTNH+: Enhanced 4D Radar Object Detection Network using Combined CFAR-based Two-level Preprocessing and Vertical Encoding0
DT/MARS-CycleGAN: Improved Object Detection for MARS Phenotyping Robot0
Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond0
Show:102550
← PrevPage 116 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified