SOTAVerified

Object Detection

Papers

Showing 18011850 of 10957 papers

TitleStatusHype
A Study on Unsupervised Anomaly Detection and Defect Localization using Generative Model in Ultrasonic Non-Destructive Testing0
Rethinking Early-Fusion Strategies for Improved Multispectral Object DetectionCode1
REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph GenerationCode2
GreenCOD: A Green Camouflaged Object Detection Method0
MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface DefectsCode1
DiffuBox: Refining 3D Object Detection with Point DiffusionCode1
SpotNet: An Image Centric, Lidar Anchored Approach To Long Range Perception0
UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-ClassesCode1
Boost UAV-based Ojbect Detection via Scale-Invariant Feature Disentanglement and Adversarial Learning0
Multimodal Object Detection via Probabilistic a priori Information IntegrationCode0
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models0
MonoDETRNext: Next-Generation Accurate and Efficient Monocular 3D Object Detector0
Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection0
Towards Global Optimal Visual In-Context Learning Prompt Selection0
Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasetsCode0
Balanced ID-OOD tradeoff transfer makes query based detectors good few shot learners0
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
Designing A Sustainable Marine Debris Clean-up Framework without Human LabelsCode0
MOD-UV: Learning Mobile Object Detectors from Unlabeled VideosCode1
Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual RepresentationsCode0
YOLOv10: Real-Time End-to-End Object DetectionCode11
Improving Single Domain-Generalized Object Detection: A Focus on Diversification and AlignmentCode1
TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System0
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens0
Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing0
Vision Transformer with Sparse Scan PriorCode0
Class-Conditional self-reward mechanism for improved Text-to-Image modelsCode0
Collaboration of Teachers for Semi-supervised Object Detection0
Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation0
Transfer Learning Approach for Railway Technical Map (RTM) Component Identification0
Empowering Urban Traffic Management: Elevated 3D LiDAR for Data Collection and Advanced Object Detection Analysis0
FFAM: Feature Factorization Activation Map for Explanation of 3D DetectorsCode0
Mutual Information Analysis in Multimodal Learning Systems0
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once0
Active Object Detection with Knowledge Aggregation and Distillation from Large ModelsCode0
Multi-View Attentive Contextualization for Multi-View 3D Object Detection0
Versatile Teacher: A Class-aware Teacher-student Framework for Cross-domain AdaptationCode1
Bangladeshi Native Vehicle Detection in WildCode0
DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical AlignmentCode2
FADet: A Multi-sensor 3D Object Detection Network based on Local Featured AttentionCode1
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationCode2
InfRS: Incremental Few-Shot Object Detection in Remote Sensing ImagesCode1
Visible and Clear: Finding Tiny Objects in Difference MapCode1
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection0
A Large-scale Multi Domain Leukemia Dataset for the White Blood Cells Detection with Morphological Attributes for ExplainabilityCode1
A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model0
Drone-type-Set: Drone types detection benchmark for drone detection and tracking0
Grounded 3D-LLM with Referent TokensCode2
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object DetectionCode7
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Show:102550
← PrevPage 37 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified