SOTAVerified

Object Detection

Papers

Showing 851900 of 10957 papers

TitleStatusHype
On Moving Object Segmentation from Monocular Video with Transformers0
HDI-Former: Hybrid Dynamic Interaction ANN-SNN Transformer for Object Detection Using Frames and Events0
ROICtrl: Boosting Instance Control for Visual Generation0
Deep Fourier-embedded Network for Bi-modal Salient Object DetectionCode1
RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos0
Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks0
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel ObjectsCode1
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models0
TinyViM: Frequency Decoupling for Tiny Hybrid Vision MambaCode2
Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and ImplementationCode1
Interpretable Dynamic Graph Neural Networks for Small Occluded Object Detection and Tracking0
Box for Mask and Mask for Box: weak losses for multi-task partially supervised learningCode0
Open Vocabulary Monocular 3D Object DetectionCode2
Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory0
Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment0
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation TrainingCode2
CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation0
Interpreting Object-level Foundation Models via Visual Precision SearchCode2
Leverage Task Context for Object Affordance Ranking0
Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and TasksCode1
CIA: Controllable Image Augmentation Framework Based on Stable DiffusionCode0
Imperceptible Adversarial Examples in the Physical World0
Diagnosis of diabetic retinopathy using machine learning & deep learning technique0
Learn from Foundation Model: Fruit Detection Model without Manual AnnotationCode1
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks0
LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Towards RAW Object Detection in Diverse ConditionsCode1
Highly Efficient and Unsupervised Framework for Moving Object Detection in Satellite VideosCode1
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation0
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data0
Twin Trigger Generative Networks for Backdoor Attacks against Object Detection0
Enhancing Object Detection Accuracy in Autonomous Vehicles Using Synthetic Data0
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUsCode1
A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles0
MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving0
VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving0
Beneath the Surface: The Role of Underwater Image Enhancement in Object DetectionCode0
Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation0
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
AnywhereDoor: Multi-Target Backdoor Attacks on Object DetectionCode0
Transforming Static Images Using Generative Models for Video Salient Object Detection0
WARLearn: Weather-Adaptive Representation LearningCode0
Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection0
MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection0
Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors0
RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image GenerationCode2
VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation0
Video-RAG: Visually-aligned Retrieval-Augmented Long Video ComprehensionCode3
Show:102550
← PrevPage 18 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified