SOTAVerified

Object Detection

Papers

Showing 301350 of 10957 papers

TitleStatusHype
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language ModelsCode1
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata ExtractionCode0
Single Shot AI-assisted quantification of KI-67 proliferation index in breast cancer0
Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception0
MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection0
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite ImageryCode1
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach0
Frequency Dynamic Convolution for Dense Image PredictionCode3
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object DetectionCode0
Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection0
LGI-DETR: Local-Global Interaction for UAV Object Detection0
MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability0
R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception0
Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection0
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model0
Superpowering Open-Vocabulary Object Detectors for X-ray VisionCode1
Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos0
Should we pre-train a decoder in contrastive learning for dense prediction tasks?0
Event-Based Crossing Dataset (EBCD)Code0
An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection0
You Only Look Once at Anytime (AnytimeYOLO): Analysis and Optimization of Early-Exits for Object-Detection0
Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving0
Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes0
Region Masking to Accelerate Video Processing on Neuromorphic Hardware0
Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis0
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection0
MapGlue: Multimodal Remote Sensing Image MatchingCode0
RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles0
A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions0
Test-Time Backdoor Detection for Object Detection Models0
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark0
DCA: Dividing and Conquering Amnesia in Incremental Object DetectionCode0
UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection FrameworkCode1
Robust Object Detection of Underwater Robot based on Domain GeneralizationCode1
Shift, Scale and Rotation Invariant Multiple Object Detection using Balanced Joint Transform Correlator0
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data GenerationCode0
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection0
Is Discretization Fusion All You Need for Collaborative Perception?Code1
HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object DetectionCode0
LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object DetectionCode2
FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene0
State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionCode1
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point CloudsCode0
A Revisit to the Decoder for Camouflaged Object Detection0
SparseAlign: A Fully Sparse Framework for Cooperative Object Detection0
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection0
MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models0
8-Calves Image datasetCode0
Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
Show:102550
← PrevPage 7 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified