SOTAVerified

Object Detection

Papers

Showing 13761400 of 10957 papers

TitleStatusHype
Co-Fix3D: Enhancing 3D Object Detection with Collaborative RefinementCode1
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
SC3D: Label-Efficient Outdoor 3D Object Detection via Single Click Annotation0
CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection0
Learned Multimodal Compression for Autonomous Driving0
Sign language recognition based on deep learning and low-cost handcrafted descriptorsCode0
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection0
See It All: Contextualized Late Aggregation for 3D Dense Captioning0
Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces0
Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object DetectionCode1
Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries0
Unified-IoU: For High-Quality Object DetectionCode1
Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions0
MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers0
Latent Disentanglement for Low Light Image Enhancement0
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts0
MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective PerceptionCode0
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object DetectionCode0
Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes0
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection0
Optimizing Vision Transformers with Data-Free Knowledge Transfer0
PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object DetectionCode1
FADE: A Dataset for Detecting Falling Objects around Buildings in VideoCode1
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising TrainingCode0
Show:102550
← PrevPage 56 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified