SOTAVerified

Object Detection

Papers

Showing 901950 of 10957 papers

TitleStatusHype
Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based SegmentationCode1
DPNet: Dual-Path Network for Real-time Object Detection with Lightweight AttentionCode1
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and GroundingCode1
DQS3D: Densely-matched Quantization-aware Semi-supervised 3D DetectionCode1
DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object DetectionCode1
DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmenCode1
Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone ImagesCode1
Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D SceneCode1
ApproxDet: Content and Contention-Aware Approximate Object Detection for MobilesCode1
DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN TrainingCode1
A Survey of Self-Supervised and Few-Shot Object DetectionCode1
3D Small Object Detection with Dynamic Spatial PruningCode1
Data Augmentation for Object Detection via Differentiable Neural RenderingCode1
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object DetectionCode1
Advancing Vision Transformers with Group-Mix AttentionCode1
DualConv: Dual Convolutional Kernels for Lightweight Deep Neural NetworksCode1
Dual-Level Collaborative Transformer for Image CaptioningCode1
Dual Memory Aggregation Network for Event-Based Object Detection with Learnable RepresentationCode1
Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave RadarCode1
DUNIT: Detection-Based Unsupervised Image-to-Image TranslationCode1
AQD: Towards Accurate Fully-Quantized Object DetectionCode1
Dynamic Context-Sensitive Filtering Network for Video Salient Object DetectionCode1
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous ModalitiesCode1
AquaVision: Automating the detection of waste in water bodies using deep transfer learningCode1
DARDet: A Dense Anchor-free Rotated Object Detector in Aerial ImagesCode1
A Random CNN Sees Objects: One Inductive Bias of CNN and Its ApplicationsCode1
A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object DetectionCode1
Adversarial Attack and Defense of YOLO Detectors in Autonomous Driving ScenariosCode1
DAROD: A Deep Automotive Radar Object Detector on Range-Doppler mapsCode1
3DMOTFormer: Graph Transformer for Online 3D Multi-Object TrackingCode1
ASSD: Attentive Single Shot Multibox DetectorCode1
Dynamic Retraining-Updating Mean Teacher for Source-Free Object DetectionCode1
3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D DetectionCode1
E^2TAD: An Energy-Efficient Tracking-based Action DetectorCode1
3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance SegmentationCode1
EARL: An Elliptical Distribution aided Adaptive Rotation Label Assignment for Oriented Object Detection in Remote Sensing ImagesCode1
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse MotionCode1
EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural NetworksCode1
A Real-time Low-cost Artificial Intelligence System for Autonomous Spraying in Palm PlantationsCode1
DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model TrainingCode1
Dataset Enhancement with Instance-Level AugmentationsCode1
EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware AccelerationCode1
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionCode1
Effect of Annotation Errors on Drone Detection with YOLOv3Code1
RQFormer: Rotated Query Transformer for End-to-End Oriented Object DetectionCode1
A recurrent CNN for online object detection on raw radar framesCode1
Nonlinear optical encoding enabled by recurrent linear scatteringCode1
Efficient Few-Shot Object Detection via Knowledge InheritanceCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
Domain-Adaptive Self-Supervised Pre-Training for Face & Body Detection in DrawingsCode1
Show:102550
← PrevPage 19 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified