SOTAVerified

Object Detection

Papers

Showing 401450 of 10957 papers

TitleStatusHype
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation0
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters0
Get In Video: Add Anything You Want to the Video0
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images0
2D Object Detection: A Survey0
Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection0
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach0
ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport TheoremCode0
Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks0
Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation0
Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing0
BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation0
L2RDaS: Synthesizing 4D Radar Tensors for Model Generalization via Dataset Expansion0
AI-Driven Multi-Stage Computer Vision System for Defect Detection in Laser-Engraved Industrial Nameplates0
MIAdapt: Source-free Few-shot Domain Adaptive Object Detection for Microscopic Images0
Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use CaseCode0
ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge0
SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images0
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New DatasetCode1
Robust detection of overlapping bioacoustic sound events0
Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?0
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized DetectionCode1
Illuminant and light direction estimation using Wasserstein distance method0
ClipGrader: Leveraging Vision-Language Models for Robust Label Quality Assessment in Object Detection0
Uncertainty Representation in a SOTIF-Related Use Case with Dempster-Shafer Theory for LiDAR Sensor-Based Object DetectionCode0
Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR0
Visual-RFT: Visual Reinforcement Fine-TuningCode7
MI-DETR: An Object Detection Model with Multi-time Inquiries MechanismCode2
A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data0
Unifying Light Field Perception with Field of ParallaxCode0
UniFa: A unified feature hallucination framework for any-shot object detection0
RFWNet: A Lightweight Remote Sensing Object Detector Integrating Multi-Scale Receptive Fields and Foreground Focus Mechanism0
Technical Report for ReID-SAM on SkiTB Visual Tracking Challenge 20250
FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object DetectionCode1
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic KernelsCode4
Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds0
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance0
Learning Mask Invariant Mutual Information for Masked Image Modeling0
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model EvaluationCode0
Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies0
Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv100
Advanced YOLO-based Real-time Power Line Detection for Vegetation Management0
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event CamerasCode1
Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads0
Progressive Local Alignment for Medical Multimodal Pre-training0
Multi-Perspective Data Augmentation for Few-shot Object DetectionCode1
LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR0
Experimental validation of UAV search and detection system in real wilderness environment0
Geometry-Aware 3D Salient Object Detection Network0
Cross-domain Few-shot Object Detection with Multi-modal Textual EnrichmentCode1
Show:102550
← PrevPage 9 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified