SOTAVerified

Object Detection

Papers

Showing 37513800 of 10957 papers

TitleStatusHype
WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images0
SL-YOLO: A Stronger and Lighter Drone Target Detection Model0
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection0
Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning0
Structure Tensor Representation for Robust Oriented Object Detection0
Diachronic Document Dataset for Semantic Layout Analysis0
Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras0
Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions0
LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection0
RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering0
DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines0
Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction0
Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks0
Cross-Modal Consistency in Multimodal Large Language Models0
UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation0
Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance0
Multimodal Object Detection using Depth and Image Data for Manufacturing Parts0
Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning0
Depthwise Separable Convolutions with Deep Residual Convolutions0
Multi-scale Frequency Enhancement Network for Blind Image Deblurring0
Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs0
United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing ImagesCode0
LFSamba: Marry SAM with Mamba for Light Field Salient Object DetectionCode0
FuzzRisk: Online Collision Risk Estimation for Autonomous Vehicles based on Depth-Aware Object Detection via Fuzzy Inference0
AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems0
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing0
Open-set object detection: towards unified problem formulation and benchmarking0
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving0
SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection0
Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent0
l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion0
Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory0
On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution DataCode0
Exploring the Feasibility of Affordable Sonar Technology: Object Detection in Underwater Environments Using the Ping 360Code0
UEVAVD: A Dataset for Developing UAV's Eye View Active Object DetectionCode0
Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage0
ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing0
Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data0
Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery0
An Application-Agnostic Automatic Target Recognition System Using Vision Language Models0
From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
LiDAttack: Robust Black-box Attack on LiDAR-based Object DetectionCode0
SIRA: Scalable Inter-frame Relation and Association for Radar Perception0
V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams0
Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems0
OSAD: Open-Set Aircraft Detection in SAR Images0
Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision0
One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection0
A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning0
Show:102550
← PrevPage 76 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified