SOTAVerified

Object Detection

Papers

Showing 151200 of 10957 papers

TitleStatusHype
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object DetectionCode2
Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement LearningCode2
Focusing on Tracks for Online Multi-Object TrackingCode2
Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter EmbeddingCode2
Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language ModelsCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
Rethinking Features-Fused-Pyramid-Neck for Object DetectionCode2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionCode2
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and ResultsCode2
Vision-Language Model for Object Detection and Segmentation: A Review and EvaluationCode2
self-prompting analogical reasoning for uav object detectionCode2
P2Object: Single Point Supervised Object Detection and Instance SegmentationCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object DetectionCode2
LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object DetectionCode2
RoMA: Scaling up Mamba-based Foundation Models for Remote SensingCode2
Referring to Any PersonCode2
MI-DETR: An Object Detection Model with Multi-time Inquiries MechanismCode2
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image InterpretationCode2
MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detectionCode2
iFormer: Integrating ConvNet and Transformer for Mobile ApplicationCode2
YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-IDCode2
PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object DetectionCode2
LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual TasksCode2
Practical Continual Forgetting for Pre-trained Vision ModelsCode2
A Simple Aerial Detection Baseline of Multimodal Language ModelsCode2
UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle ImageryCode2
Samba: A Unified Mamba-based Framework for General Salient Object DetectionCode2
YOLO-UniOW: Efficient Universal Open-World Object DetectionCode2
CGCOD: Class-Guided Camouflaged Object DetectionCode2
MR-GDINO: Efficient Open-World Continual Object DetectionCode2
A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint SpaceCode2
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
SCoralDet: Efficient real-time underwater soft coral detection with YOLOCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
Mr. DETR: Instructive Multi-Route Training for Detection TransformersCode2
RemDet: Rethinking Efficient Model Design for UAV Object DetectionCode2
Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge DistillationCode2
EMOv2: Pushing 5M Vision Model FrontierCode2
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation LearningCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
TinyViM: Frequency Decoupling for Tiny Hybrid Vision MambaCode2
Open Vocabulary Monocular 3D Object DetectionCode2
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation TrainingCode2
Interpreting Object-level Foundation Models via Visual Precision SearchCode2
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
Show:102550
← PrevPage 4 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified