SOTAVerified

Object Detection

Papers

Showing 11011125 of 10957 papers

TitleStatusHype
QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation0
Mero Nagarikta: Advanced Nepali Citizenship Data Extractor with Deep Learning-Powered Text Detection and OCR0
PixLens: A Novel Framework for Disentangled Evaluation in Diffusion-Based Image Editing with Object Detection + SAMCode0
Believing is Seeing: Unobserved Object Detection using Generative ModelsCode0
Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga0
Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions0
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary DetectionCode1
Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and FutureCode1
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts0
Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach0
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection0
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading0
Rethinking Weak-to-Strong Augmentation in Source-Free Domain Adaptive Object Detection0
Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava0
Improving Object Detection via Local-global Contrastive Learning0
Improved detection of discarded fish species through BoxAL active learningCode0
Learning De-Biased Representations for Remote-Sensing ImageryCode0
Cross Resolution Encoding-Decoding For Detection TransformersCode0
Fast Object Detection with a Machine Learning Edge Device0
Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object DetectionCode0
STONE: A Submodular Optimization Framework for Active 3D Object DetectionCode0
Learning 3D Perception from Others' Predictions0
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual RepresentationsCode0
BiSSL: Enhancing the Alignment Between Self-Supervised Pretraining and Downstream Fine-Tuning via Bilevel Optimization0
Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker0
Show:102550
← PrevPage 45 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified