SOTAVerified

Object Detection

Papers

Showing 10511100 of 10957 papers

TitleStatusHype
Enhancing Novel Object Detection via Cooperative Foundational ModelsCode1
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionCode1
Point Cloud Self-supervised Learning via 3D to Multi-view Masked AutoencoderCode1
Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task ModelCode1
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
Florence-2: Advancing a Unified Representation for a Variety of Vision TasksCode1
Linear Gaussian Bounding Box Representation and Ring-Shaped Rotated Convolution for Oriented Object DetectionCode1
DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasetsCode1
Meta-Adapter: An Online Few-shot Learner for Vision-Language ModelCode1
Instruct Me More! Random Prompting for Visual In-Context LearningCode1
Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding BoxCode1
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph EnrichmentCode1
Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM MicrographsCode1
Proposal-Level Unsupervised Domain Adaptation for Open World Unbiased DetectorCode1
Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image FusionCode1
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode1
Patch-based Selection and Refinement for Early Object DetectionCode1
Effective Human-AI Teams via Learned Natural Language Rules and OnboardingCode1
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV ImagesCode1
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLOCode1
Recognize Any RegionsCode1
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
Re-Scoring Using Image-Language Similarity for Few-Shot Object DetectionCode1
HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point CloudsCode1
DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object DetectionCode1
RGB-X Object Detection via Scene-Specific Fusion ModulesCode1
A High-Resolution Dataset for Instance Detection with Multi-View Instance CaptureCode1
HSIC-based Moving WeightAveraging for Few-Shot Open-Set Object DetectionCode1
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like SettingCode1
Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Federated Object DetectionCode1
LP-OVOD: Open-Vocabulary Object Detection by Linear ProbingCode1
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object DetectionCode1
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object DetectionCode1
Salient Object Detection in RGB-D VideosCode1
Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLACode1
OV-VG: A Benchmark for Open-Vocabulary Visual GroundingCode1
Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing ImagesCode1
ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map ConstructionCode1
EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye ViewCode1
Zone Evaluation: Revealing Spatial Bias in Object DetectionCode1
Multi‑camera trajectory matching based on hierarchical clustering and constraintsCode1
MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation CoefficientCode1
Towards Generalizable Multi-Camera 3D Object Detection via Perspective DebiasingCode1
RoboLLM: Robotic Vision Tasks Grounded on Multimodal Large Language ModelsCode1
Open-CRB: Towards Open World Active Learning for 3D Object DetectionCode1
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNetsCode1
Rank-DETR for High Quality Object DetectionCode1
Relational Prior Knowledge Graphs for Detection and Instance SegmentationCode1
Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous DrivingCode1
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionCode1
Show:102550
← PrevPage 22 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified