SOTAVerified

Object Detection

Papers

Showing 126150 of 10957 papers

TitleStatusHype
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object TrackingCode3
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample SelectionCode3
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
LION: Linear Group RNN for 3D Object Detection in Point CloudsCode3
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing CommunityCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object DetectionCode3
How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary DetectionCode3
A Comparative Analysis of Object Detection Metrics with a Companion Open-Source ToolkitCode3
Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object DetectionCode3
Bag of Freebies for Training Object Detection Neural NetworksCode3
General Object Foundation Model for Images and Videos at ScaleCode3
Geometric-aware Pretraining for Vision-centric 3D Object DetectionCode3
A Survey of Camouflaged Object Detection and BeyondCode3
BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object DetectionCode3
Frequency Dynamic Convolution for Dense Image PredictionCode3
Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationCode3
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose EstimationCode3
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
EfficientDet: Scalable and Efficient Object DetectionCode3
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into OneCode3
A Survey on Performance Metrics for Object-Detection AlgorithmsCode3
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object DetectionCode3
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionCode3
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkCode3
Show:102550
← PrevPage 6 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified