SOTAVerified

Object Detection

Papers

Showing 10011050 of 10957 papers

TitleStatusHype
Cached Transformers: Improving Transformers with Differentiable Memory CacheCode1
Object-Aware Domain Generalization for Object DetectionCode1
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and BenchmarkCode1
CLIM: Contrastive Language-Image Mosaic for Region RepresentationCode1
Transformers in Unsupervised Structure-from-MotionCode1
PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object DetectionCode1
Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object DetectionCode1
Semantic-Aware Autoregressive Image Modeling for Visual Representation LearningCode1
Simple Image-level Classification Improves Open-vocabulary Object DetectionCode1
FoMo-Bench: a multi-modal, multi-scale and multi-task Forest Monitoring Benchmark for remote sensing foundation modelsCode1
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous ModalitiesCode1
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object DetectorCode1
PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object DetectionCode1
DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object DetectionCode1
Relax Image-Specific Prompt Requirement in SAM: A Single Generic Prompt for Segmenting Camouflaged ObjectsCode1
Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object DiscoveryCode1
MedYOLO: A Medical Image Object Detection FrameworkCode1
Mixed Pseudo Labels for Semi-Supervised Object DetectionCode1
What, How, and When Should Object Detectors Update in Continually Changing Test Domains?Code1
Weakly Supervised 3D Object Detection via Multi-Level Visual GuidanceCode1
CholecTrack20: A Dataset for Multi-Class Multiple Tool Tracking in Laparoscopic SurgeryCode1
ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object DetectionCode1
MaxQ: Multi-Axis Query for N:M Sparsity NetworkCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
SiCP: Simultaneous Individual and Cooperative Perception for 3D Object Detection in Connected and Automated VehiclesCode1
Image and AIS Data Fusion Technique for Maritime Computer Vision ApplicationsCode1
Bootstrapping Autonomous Driving Radars with Self-Supervised LearningCode1
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic SegmentationCode1
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object DetectionCode1
Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer LearningCode1
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object DetectionCode1
Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited LabelsCode1
Boosting Object Detection with Zero-Shot Day-Night Domain AdaptationCode1
Spectrum-driven Mixed-frequency Network for Hyperspectral Salient Object DetectionCode1
Efficient Multimodal Semantic Segmentation via Dual-Prompt LearningCode1
Is Underwater Image Enhancement All Object Detectors Need?Code1
Do text-free diffusion models learn discriminative visual representations?Code1
LEOD: Label-Efficient Object Detection for Event CamerasCode1
The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understandingCode1
RQFormer: Rotated Query Transformer for End-to-End Oriented Object DetectionCode1
Unified-modal Salient Object Detection via Adaptive Prompt LearningCode1
Advancing Vision Transformers with Group-Mix AttentionCode1
VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt LearningCode1
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural NetworksCode1
Periodically Exchange Teacher-Student for Source-Free Object DetectionCode1
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point SupervisionCode1
PointOBB: Learning Oriented Object Detection via Single Point SupervisionCode1
MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object DetectionCode1
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher LearningCode1
LDConv: Linear deformable convolution for improving convolutional neural networksCode1
Show:102550
← PrevPage 21 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified