SOTAVerified

Object Detection

Papers

Showing 901950 of 10957 papers

TitleStatusHype
Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based SegmentationCode1
Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object DetectionCode1
CenterMask : Real-Time Anchor-Free Instance SegmentationCode1
Contrastive Masked Autoencoders are Stronger Vision LearnersCode1
Control Distance IoU and Control Distance IoU Loss Function for Better Bounding Box RegressionCode1
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language ModelCode1
CenterNet3D: An Anchor Free Object Detector for Point CloudCode1
DODA: Diffusion for Object-detection Domain Adaptation in AgricultureCode1
ApproxDet: Content and Contention-Aware Approximate Object Detection for MobilesCode1
ConvMLP: Hierarchical Convolutional MLPs for VisionCode1
DoReMi: First glance at a universal OMR datasetCode1
An Explicit Local and Global Representation Disentanglement Framework with Applications in Deep Clustering and Unsupervised Object DetectionCode1
CE-FPN: Enhancing Channel Information for Object DetectionCode1
CD-FSOD: A Benchmark for Cross-domain Few-shot Object DetectionCode1
Advancing Vision Transformers with Group-Mix AttentionCode1
Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object DetectionCode1
3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object DetectionCode1
CDNet is all you need: Cascade DCN based underwater object detection RCNNCode1
DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote SensingCode1
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
AQD: Towards Accurate Fully-Quantized Object DetectionCode1
CORU: Comprehensive Post-OCR Parsing and Receipt Understanding DatasetCode1
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous ModalitiesCode1
AquaVision: Automating the detection of waste in water bodies using deep transfer learningCode1
Co-segmentation Inspired Attention Module for Video-based Computer Vision TasksCode1
A Random CNN Sees Objects: One Inductive Bias of CNN and Its ApplicationsCode1
Diverse Branch Block: Building a Convolution as an Inception-like UnitCode1
Adversarial Attack and Defense of YOLO Detectors in Autonomous Driving ScenariosCode1
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
3DMOTFormer: Graph Transformer for Online 3D Multi-Object TrackingCode1
Accelerate 3D Object Detection Models via Zero-Shot Attention Key PruningCode1
CoVA: Context-aware Visual Attention for Webpage Information ExtractionCode1
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open ScenesCode1
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive ArchitectureCode1
3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance SegmentationCode1
CR3DT: Camera-RADAR Fusion for 3D Detection and TrackingCode1
Cross-domain Detection via Graph-induced Prototype AlignmentCode1
Eliminating Position Bias of Language Models: A Mechanistic ApproachCode1
CBNet: A Composite Backbone Network Architecture for Object DetectionCode1
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image EncodingCode1
DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object DetectionCode1
CrossDet: Crossline Representation for Object DetectionCode1
CCSPNet-Joint: Efficient Joint Training Method for Traffic Sign Detection Under Extreme ConditionsCode1
Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone ImagesCode1
CG-SSD: Corner Guided Single Stage 3D Object Detection from LiDAR Point CloudCode1
A recurrent CNN for online object detection on raw radar framesCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
MTTrans: Cross-Domain Object Detection with Mean-Teacher TransformerCode1
Cross-Domain Weakly-Supervised Object Detection through Progressive Domain AdaptationCode1
CenterFusion: Center-based Radar and Camera Fusion for 3D Object DetectionCode1
Show:102550
← PrevPage 19 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified