SOTAVerified

Object Detection

Papers

Showing 901950 of 10957 papers

TitleStatusHype
Video-RAG: Visually-aligned Retrieval-Augmented Long Video ComprehensionCode3
YCB-LUMA: YCB Object Dataset with Luminance Keying for Object LocalizationCode0
VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation0
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
Physics-Guided Detector for SAR AirplanesCode1
Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster0
SL-YOLO: A Stronger and Lighter Drone Target Detection Model0
Exploring Emerging Trends and Research Opportunities in Visual Place Recognition0
WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images0
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection0
Vision Eagle Attention: a new lens for advancing image classificationCode1
Structure Tensor Representation for Robust Oriented Object Detection0
Diachronic Document Dataset for Semantic Layout Analysis0
Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions0
Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning0
RETR: Multi-View Radar Detection Transformer for Indoor PerceptionCode1
Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras0
RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering0
Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks0
Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction0
Cross-Modal Consistency in Multimodal Large Language Models0
DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines0
LEAP:D - A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection0
Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature IntegrationCode1
V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising DiffusionCode2
Multimodal Object Detection using Depth and Image Data for Manufacturing Parts0
UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation0
Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance0
Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning0
Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Depthwise Separable Convolutions with Deep Residual Convolutions0
Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs0
United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing ImagesCode0
Multi-scale Frequency Enhancement Network for Blind Image Deblurring0
LFSamba: Marry SAM with Mamba for Light Field Salient Object DetectionCode0
Fast and Efficient Transformer-based Method for Bird's Eye View Instance PredictionCode1
FuzzRisk: Online Collision Risk Estimation for Autonomous Vehicles based on Depth-Aware Object Detection via Fuzzy Inference0
AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems0
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationCode1
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing0
An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal ModelsCode1
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving0
Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent0
Open-set object detection: towards unified problem formulation and benchmarking0
SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection0
Exploring the Feasibility of Affordable Sonar Technology: Object Detection in Underwater Environments Using the Ping 360Code0
l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion0
On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution DataCode0
Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory0
UEVAVD: A Dataset for Developing UAV's Eye View Active Object DetectionCode0
Show:102550
← PrevPage 19 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified