SOTAVerified

Object Detection

Papers

Showing 951975 of 10957 papers

TitleStatusHype
TransGOP: Transformer-Based Gaze Object PredictionCode1
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception TasksCode1
UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object TrackingCode1
Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI PoolingCode1
LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object DetectionCode1
ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual ConnectionsCode1
GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph CreationCode1
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
Efficient One-stage Video Object Detection by Exploiting Temporal ConsistencyCode1
TDViT: Temporal Dilated Video Transformer for Dense Video TasksCode1
Switch EMA: A Free Lunch for Better Flatness and SharpnessCode1
AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision TransformerCode1
MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLOCode1
G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object DetectionCode1
Spatio-temporal Prompting Network for Robust Video Feature ExtractionCode1
Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous DrivingCode1
RIDERS: Radar-Infrared Depth Estimation for Robust SensingCode1
SU-SAM: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed ScenesCode1
SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object DetectionCode1
pLitterStreet: Street Level Plastic Litter Detection and MappingCode1
MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under UncertaintyCode1
Rethinking Centered Kernel Alignment in Knowledge DistillationCode1
Focaler-IoU: More Focused Intersection over Union LossCode1
BlenDA: Domain Adaptive Object Detection through diffusion-based blendingCode1
MAMBA: Multi-level Aggregation via Memory Bank for Video Object DetectionCode1
Show:102550
← PrevPage 39 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified