SOTAVerified

Object Detection

Papers

Showing 451500 of 10957 papers

TitleStatusHype
Deep learning approaches to surgical video segmentation and object detection: A Scoping Review0
MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering0
FeatSharp: Your Vision Model Features, Sharper0
Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection0
Generative AI Framework for 3D Object Generation in Augmented Reality0
Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection0
KnowZRel: Common Sense Knowledge-based Zero-Shot Relationship Retrieval for Generalised Scene Graph GenerationCode0
Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving ScenariosCode0
YOLOv12: A Breakdown of the Key Architectural Features0
ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v110
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera0
MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection0
Image compositing is all you need for data augmentation0
GroundCap: A Visually Grounded Image Captioning Dataset0
An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice0
DAMamba: Vision State Space Model with Dynamic Adaptive ScanCode2
Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation0
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection0
Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection0
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection0
DA-Mamba: Domain Adaptive Hybrid Mamba-Transformer Based One-Stage Object DetectionCode1
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs0
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
Object Detection and TrackingCode0
Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection0
Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object DetectionCode1
Instance Segmentation of Scene Sketches Using Natural Image Priors0
Deep Reinforcement Learning-Based User Scheduling for Collaborative Perception0
Knowledge Swapping via Learning and UnlearningCode0
Plantation Monitoring Using Drone Images: A Dataset and Performance Review0
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image InterpretationCode2
Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation0
Uncertainty Aware Human-machine Collaboration in Camouflaged Object DetectionCode0
A Survey on Mamba Architecture for Vision Applications0
SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer0
Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving0
Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m0
Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement SystemsCode0
Dense Object Detection Based on De-homogenized Queries0
Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object DetectionCode0
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning0
Secure Visual Data Processing via Federated Learning0
Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector0
AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers0
Counting Fish with Temporal Representations of Sonar Video0
MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detectionCode2
DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection0
LP-DETR: Layer-wise Progressive Relations for Object Detection0
A Performance Analysis of You Only Look Once Models for Deployment on Constrained Computational Edge Devices in Drone Applications0
Show:102550
← PrevPage 10 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified