SOTAVerified

Object Detection

Papers

Showing 601625 of 10957 papers

TitleStatusHype
Feature Pyramid Networks for Object DetectionCode2
SSD: Single Shot MultiBox DetectorCode2
Fast R-CNNCode2
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed AugmentationCode1
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object DetectionCode1
Multiple Object Stitching for Unsupervised Representation LearningCode1
Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object DetectorCode1
GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region RemovalCode1
OD3: Optimization-free Dataset Distillation for Object DetectionCode1
Adaptive Semantic Token Communication for Transformer-based Edge InferenceCode1
AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection SystemsCode1
Decoupling Classifier for Boosting Few-shot Object Detection and Instance SegmentationCode1
AGI-Elo: How Far Are We From Mastering A Task?Code1
M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object DetectionCode1
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story GenerationCode1
M3CAD: Towards Generic Cooperative Autonomous Driving BenchmarkCode1
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection TransformerCode1
A Simple Detector with Frame Dynamics is a Strong TrackerCode1
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff PerspectiveCode1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
E-InMeMo: Enhanced Prompting for Visual In-Context LearningCode1
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object DetectionCode1
Show:102550
← PrevPage 25 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified