SOTAVerified

Object Detection

Papers

Showing 751800 of 10957 papers

TitleStatusHype
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space ModelCode1
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary DetectionCode1
Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and FutureCode1
A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particlesCode1
OSSA: Unsupervised One-Shot Style AdaptationCode1
OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing ImagesCode1
A Confidence-Aware Matching Strategy For Generalized Multi-Object TrackingCode1
BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained DevicesCode1
Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image TranslationCode1
Neuromorphic Drone Detection: an Event-RGB Multimodal ApproachCode1
PDT: Uav Target Detection Dataset for Pests and Diseases TreeCode1
MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary NoduleCode1
STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object TrackingCode1
Towards Physically Realizable Adversarial Attacks in Embodied Vision NavigationCode1
GLCONet: Learning Multi-source Perception Representation for Camouflaged Object DetectionCode1
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer NetworksCode1
When to Extract ReID Features: A Selective Approach for Improved Multiple Object TrackingCode1
LEROjD: Lidar Extended Radar-Only Object DetectionCode1
Visual Grounding with Multi-modal Conditional AdaptationCode1
Can OOD Object Detectors Learn from Foundation Models?Code1
Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target DetectionCode1
SSFam: Scribble Supervised Salient Object Detection FamilyCode1
LowFormer: Hardware Efficient Design for Convolutional Transformer BackbonesCode1
Latent Distillation for Continual Object Detection at the EdgeCode1
Frequency-Spatial Entanglement Learning for Camouflaged Object DetectionCode1
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object DetectionCode1
Fisher Information guided Purification against Backdoor AttacksCode1
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba TrainingCode1
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-ViewCode1
Enhancing Sound Source Localization via False Negative EliminationCode1
NAS-BNN: Neural Architecture Search for Binary Neural NetworksCode1
A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future DirectionsCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object DetectionCode1
A Lightweight Insulator Defect Detection Model Based on Drone ImagesCode1
UMAD: University of Macau Anomaly Detection Benchmark DatasetCode1
OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and FusionCode1
SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action RecognitionCode1
PADetBench: Towards Benchmarking Physical Attacks against Object DetectionCode1
Multi-Granularity Part Sampling Attention for Fine-Grained Visual ClassificationCode1
Co-Fix3D: Enhancing 3D Object Detection with Collaborative RefinementCode1
Unified-IoU: For High-Quality Object DetectionCode1
Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object DetectionCode1
PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object DetectionCode1
FADE: A Dataset for Detecting Falling Objects around Buildings in VideoCode1
UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster ScenariosCode1
SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic ScenesCode1
GUI Element Detection Using SOTA YOLO Deep Learning ModelsCode1
Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D GaussianCode1
Show:102550
← PrevPage 16 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified