SOTAVerified

Object Detection

Papers

Showing 9511000 of 10957 papers

TitleStatusHype
TransGOP: Transformer-Based Gaze Object PredictionCode1
UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object TrackingCode1
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception TasksCode1
Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI PoolingCode1
LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object DetectionCode1
GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph CreationCode1
ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual ConnectionsCode1
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
Switch EMA: A Free Lunch for Better Flatness and SharpnessCode1
TDViT: Temporal Dilated Video Transformer for Dense Video TasksCode1
Efficient One-stage Video Object Detection by Exploiting Temporal ConsistencyCode1
MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLOCode1
AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision TransformerCode1
G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object DetectionCode1
Spatio-temporal Prompting Network for Robust Video Feature ExtractionCode1
RIDERS: Radar-Infrared Depth Estimation for Robust SensingCode1
Multimodal-Enhanced Objectness Learner for Corner Case Detection in Autonomous DrivingCode1
SU-SAM: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed ScenesCode1
SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object DetectionCode1
pLitterStreet: Street Level Plastic Litter Detection and MappingCode1
MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under UncertaintyCode1
Rethinking Centered Kernel Alignment in Knowledge DistillationCode1
Focaler-IoU: More Focused Intersection over Union LossCode1
BlenDA: Domain Adaptive Object Detection through diffusion-based blendingCode1
MAMBA: Multi-level Aggregation via Memory Bank for Video Object DetectionCode1
Trapped in texture bias? A large scale comparison of deep instance segmentationCode1
SAMF: Small-Area-Aware Multi-focus Image Fusion for Object DetectionCode1
DCDet: Dynamic Cross-based 3D Object DetectorCode1
Improving the Detection of Small Oriented Objects in Aerial ImagesCode1
CLIP-Guided Source-Free Object Detection in Aerial ImagesCode1
Generic Knowledge Boosted Pre-training For Remote Sensing ImagesCode1
A Flying Bird Object Detection Method for Surveillance VideoCode1
What How and When Should Object Detectors Update in Continually Changing Test Domains?Code1
Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial LearningCode1
Towards Robust 3D Object Detection with LiDAR and 4D Radar Fusion in Various Weather ConditionsCode1
Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity TrainingCode1
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view ImagesCode1
PairDETR : Joint Detection and Association of Human Bodies and FacesCode1
Depth-Aware Concealed Crop Detection in Dense Agricultural ScenesCode1
CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object DetectionCode1
Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN PerspectiveCode1
HINTED: Hard Instance Enhanced Detector with Mixed-Density Feature Fusion for Sparsely-Supervised 3D Object DetectionCode1
Referring Expression CountingCode1
Shape-IoU: More Accurate Metric considering Bounding Box Shape and ScaleCode1
DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object DetectionCode1
MonoLSS: Learnable Sample Selection For Monocular 3D DetectionCode1
GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object DetectionCode1
DECO: Query-Based End-to-End Object Detection with ConvNetsCode1
Universal Noise Annotation: Unveiling the Impact of Noisy annotation on Object DetectionCode1
SPGroup3D: Superpoint Grouping Network for Indoor 3D Object DetectionCode1
Show:102550
← PrevPage 20 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified