SOTAVerified

Object Detection

Papers

Showing 13511400 of 10957 papers

TitleStatusHype
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes0
A Lightweight Modular Framework for Low-Cost Open-Vocabulary Object Detection TrainingCode0
Vision Calorimeter: Migrating Visual Object Detector to High-energy Particle ImagesCode0
Detection of Intracranial Hemorrhage for Trauma PatientsCode0
Just a Hint: Point-Supervised Camouflaged Object Detection0
A Closer Look at Data Augmentation Strategies for Finetuning-Based Low/Few-Shot Object Detection0
SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection0
IDD-YOLOv5: A Lightweight Insulator Defect Real-time Detection AlgorithmCode0
Leveraging Superfluous Information in Contrastive Representation Learning0
Latent Diffusion for Guided Document Table Generation0
SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action RecognitionCode1
Segment-Anything Models Achieve Zero-shot Robustness in Autonomous DrivingCode0
Boundary-Recovering Network for Temporal Action Detection0
Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection0
YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems0
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing CommunityCode3
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System0
PADetBench: Towards Benchmarking Physical Attacks against Object DetectionCode1
MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation0
Depth-guided Texture Diffusion for Image Semantic Segmentation0
Multi-Granularity Part Sampling Attention for Fine-Grained Visual ClassificationCode1
Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques0
Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs0
Multimodal Relational Triple Extraction with Query-based Entity Object Transformer0
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
Co-Fix3D: Enhancing 3D Object Detection with Collaborative RefinementCode1
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
SC3D: Label-Efficient Outdoor 3D Object Detection via Single Click Annotation0
CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection0
Learned Multimodal Compression for Autonomous Driving0
Sign language recognition based on deep learning and low-cost handcrafted descriptorsCode0
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection0
See It All: Contextualized Late Aggregation for 3D Dense Captioning0
Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces0
Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object DetectionCode1
Divide and Conquer: Improving Multi-Camera 3D Perception with 2D Semantic-Depth Priors and Input-Dependent Queries0
Unified-IoU: For High-Quality Object DetectionCode1
Exploring Domain Shift on Radar-Based 3D Object Detection Amidst Diverse Environmental Conditions0
MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers0
Latent Disentanglement for Low Light Image Enhancement0
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts0
MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective PerceptionCode0
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object DetectionCode0
Multi-scale Contrastive Adaptor Learning for Segmenting Anything in Underperformed Scenes0
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection0
Optimizing Vision Transformers with Data-Free Knowledge Transfer0
PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object DetectionCode1
FADE: A Dataset for Detecting Falling Objects around Buildings in VideoCode1
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising TrainingCode0
Show:102550
← PrevPage 28 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified