SOTAVerified

Object Detection

Papers

Showing 21512200 of 10957 papers

TitleStatusHype
YOLOv9 for Fracture Detection in Pediatric Wrist Trauma X-ray ImagesCode1
GRA: Detecting Oriented Objects through Group-wise Rotating and Attention0
V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions0
FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation0
HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object DetectionCode2
Detection of Fast-Moving Objects with Neuromorphic Hardware0
Cannabis Seed Variant Detection using Faster R-CNN0
SimPB: A Single Model for 2D and 3D Object Detection from Multiple CamerasCode1
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception0
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
CSDNet: Detect Salient Object in Depth-Thermal via A Lightweight Cross Shallow and Deep Perception Network0
A Hybrid SNN-ANN Network for Event-based Object Detection with Spatial and Temporal Attention0
Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object DetectorsCode0
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring0
SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph0
D-YOLO a robust framework for object detection in adverse weather conditions0
Improving Distant 3D Object Detection Using 2D Box Supervision0
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest0
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap for Domain-Adaptive Object DetectionCode1
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization0
CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow0
FogGuard: guarding YOLO against fog using perceptual lossCode0
Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks0
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation LearningCode2
FieldNet: Efficient Real-Time Shadow Removal for Enhanced Vision in Field Robotics0
A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product0
Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks0
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense PredictionsCode3
Aedes aegypti Egg Counting with Neural Networks for Object Detection0
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection0
PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution0
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection0
Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object DetectionCode0
Adaptive Bounding Box Uncertainties via Two-Step Conformal PredictionCode1
JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection0
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions0
Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference0
Inception-YOLO: Computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP, SPPF, and inception modules0
LISO: Lidar-only Self-Supervised 3D Object DetectionCode2
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation StrategiesCode0
Genetic Learning for Designing Sim-to-Real Data AugmentationsCode0
Evaluating the Energy Efficiency of Few-Shot Learning for Object Detection in Industrial Settings0
LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations0
Cross-domain and Cross-dimension Learning for Image-to-Graph TransformersCode0
Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object DetectionCode1
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion HeadCode5
SeSame: Simple, Easy 3D Object Detection with Point-Wise SemanticsCode1
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
Show:102550
← PrevPage 44 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified