SOTAVerified

Object Detection

Papers

Showing 28262850 of 10957 papers

TitleStatusHype
Taking a PEEK into YOLOv5 for Satellite Component Recognition via Entropy-based Visual Explanations0
Quantitative Evaluation of a Multi-Modal Camera Setup for Fusing Event Data with RGB Images0
Effective Human-AI Teams via Learned Natural Language Rules and OnboardingCode1
AiluRus: A Scalable ViT Framework for Dense PredictionCode0
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV ImagesCode1
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation0
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLOCode1
CenterRadarNet: Joint 3D Object Detection and Tracking Framework using 4D FMCW Radar0
M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object DetectionCode0
Efficient Vision Transformer for Accurate Traffic Sign Detection0
Recognize Any RegionsCode1
Scattering Vision Transformer: Spectral Mixing Matters0
Enhancing Traffic Object Detection in Variable Illumination with RGB-Event FusionCode0
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
Re-Scoring Using Image-Language Similarity for Few-Shot Object DetectionCode1
HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point CloudsCode1
Spuriosity Rankings for Free: A Simple Framework for Last Layer Retraining Based on Object Detection0
View Classification and Object Detection in Cardiac Ultrasound to Localize Valves via Deep Learning0
YOLOv8-Based Visual Detection of Road Hazards: Potholes, Sewer Covers, and Manholes0
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object DetectionCode1
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual RecognitionCode2
Radar-Lidar Fusion for Object Detection by Designing Effective Convolution Networks0
Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient ?Code0
Improving Online Source-free Domain Adaptation for Object Detection by Unsupervised Data Acquisition0
Show:102550
← PrevPage 114 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified