SOTAVerified

Object Detection

Papers

Showing 13511400 of 10957 papers

TitleStatusHype
RFAConv: Innovating Spatial Attention and Standard Convolutional OperationCode1
Multi-view Adversarial Discriminator: Mine the Non-causal Factors for Object Detection in Unseen DomainsCode1
MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object DetectionCode1
Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object DetectionCode1
VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue DistributionCode1
Form-NLU: Dataset for the Form Natural Language UnderstandingCode1
CRN: Camera Radar Net for Accurate, Robust, Efficient 3D PerceptionCode1
Open-Vocabulary Point-Cloud Object Detection without 3D AnnotationCode1
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object DetectionCode1
DeGPR: Deep Guided Posterior Regularization for Multi-Class Cell Detection and CountingCode1
Q-DETR: An Efficient Low-Bit Quantized Detection TransformerCode1
Rethinking Local Perception in Lightweight Vision TransformerCode1
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerCode1
Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous DrivingCode1
An intelligent modular real-time vision-based system for environment perceptionCode1
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and TrackingCode1
T-FFTRadNet: Object Detection with Swin Vision Transformers from Raw ADC Radar SignalsCode1
SimDistill: Simulated Multi-modal Distillation for BEV 3D Object DetectionCode1
Explicit Attention-Enhanced Fusion for RGB-Thermal Perception TasksCode1
LinK: Linear Kernel for LiDAR-based 3D PerceptionCode1
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye ViewCode1
3D Video Object Detection with Learnable Object-Centric Global OptimizationCode1
Feature Shrinkage Pyramid for Camouflaged Object Detection with TransformersCode1
ZBS: Zero-shot Background Subtraction via Instance-level Background Modeling and Foreground SelectionCode1
Ensemble-based Blackbox Attacks on Dense PredictionCode1
Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone ImagesCode1
Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object DetectionCode1
Viewpoint Equivariance for Multi-View 3D Object DetectionCode1
Freestyle Layout-to-Image SynthesisCode1
Unknown Sniffer for Object Detection: Don't Turn a Blind Eye to Unknown ObjectsCode1
2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object DetectionCode1
Physically Adversarial Infrared Patches with Learnable Shapes and LocationsCode1
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-MatchingCode1
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-TrainingCode1
Box-Level Active DetectionCode1
The effectiveness of MAE pre-pretraining for billion-scale pretrainingCode1
Rigidity-Aware Detection for 6D Object Pose EstimationCode1
Detecting the open-world objects with the help of the BrainCode1
Understanding the Role of the Projector in Knowledge DistillationCode1
VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object DetectionCode1
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D UnderstandingCode1
Constructing Metric-Semantic Maps using Floor Plan Priors for Long-Term Indoor LocalizationCode1
Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow PredictionCode1
CCTV-Gun: Benchmarking Handgun Detection in CCTV ImagesCode1
Identification of Novel Classes for Improving Few-Shot Object DetectionCode1
Dual Memory Aggregation Network for Event-Based Object Detection with Learnable RepresentationCode1
CAPE: Camera View Position Embedding for Multi-View 3D Object DetectionCode1
Scribble-Supervised RGB-T Salient Object DetectionCode1
Rethinking Model Ensemble in Transfer-based Adversarial AttacksCode1
Among Us: Adversarially Robust Collaborative Perception by ConsensusCode1
Show:102550
← PrevPage 28 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified