SOTAVerified

Object Detection

Papers

Showing 150 of 10957 papers

TitleStatusHype
A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains0
RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images0
Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis0
Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection0
Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios0
Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping1
ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge0
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object DetectionCode2
Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target DetectionCode0
Detection of Rail Line Track and Human Beings Near the Track to Avoid Accidents0
Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed AugmentationCode1
Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement LearningCode2
Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object DetectionCode0
DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic0
A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario0
LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object DetectionCode0
ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation0
Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos0
TDiR: Transformer based Diffusion for Image Restoration Tasks0
Feature Hallucination for Self-supervised Action Recognition0
From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents0
A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects0
Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages0
YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual PerceptionCode5
Class Agnostic Instance-level Descriptor for Visual Instance Search0
Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology AugmentationCode0
Retrospective Memory for Camouflaged Object Detection0
VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and ReasoningCode0
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection FrameworkCode4
Comparison of Two Methods for Stationary Incident Detection Based on Background Image0
How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection0
Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection0
UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data0
FindMeIfYouCan: Bringing Open Set metrics to near , far and farther Out-of-Distribution Object Detection0
Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational VideosCode0
Focusing on Tracks for Online Multi-Object TrackingCode2
MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data ExtractionCode0
Vision-based Lifting of 2D Object Detections for Automated Driving0
Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds0
FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image FusionCode0
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration0
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object DetectionCode1
Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement0
DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos0
CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects0
WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras0
Data Augmentation For Small Object using Fast AutoAugment0
Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection0
ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations0
Show:102550
← PrevPage 1 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified