SOTAVerified

Object Detection

Papers

Showing 10511075 of 10957 papers

TitleStatusHype
Enhancing Novel Object Detection via Cooperative Foundational ModelsCode1
Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and RetentionCode1
Point Cloud Self-supervised Learning via 3D to Multi-view Masked AutoencoderCode1
Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task ModelCode1
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
Florence-2: Advancing a Unified Representation for a Variety of Vision TasksCode1
Linear Gaussian Bounding Box Representation and Ring-Shaped Rotated Convolution for Oriented Object DetectionCode1
DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasetsCode1
Instruct Me More! Random Prompting for Visual In-Context LearningCode1
Meta-Adapter: An Online Few-shot Learner for Vision-Language ModelCode1
Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding BoxCode1
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph EnrichmentCode1
Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM MicrographsCode1
Proposal-Level Unsupervised Domain Adaptation for Open World Unbiased DetectorCode1
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode1
Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image FusionCode1
Patch-based Selection and Refinement for Early Object DetectionCode1
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV ImagesCode1
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLOCode1
Recognize Any RegionsCode1
Effective Human-AI Teams via Learned Natural Language Rules and OnboardingCode1
Re-Scoring Using Image-Language Similarity for Few-Shot Object DetectionCode1
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point CloudsCode1
DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object DetectionCode1
Show:102550
← PrevPage 43 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified