SOTAVerified

Object Detection

Papers

Showing 33513400 of 10957 papers

TitleStatusHype
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation0
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images0
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters0
Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction0
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning0
2D Object Detection: A Survey0
ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport TheoremCode0
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach0
Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection0
Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation0
Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks0
Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use CaseCode0
BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation0
L2RDaS: Synthesizing 4D Radar Tensors for Model Generalization via Dataset Expansion0
MIAdapt: Source-free Few-shot Domain Adaptive Object Detection for Microscopic Images0
AI-Driven Multi-Stage Computer Vision System for Defect Detection in Laser-Engraved Industrial Nameplates0
Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing0
Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds?0
ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge0
Robust detection of overlapping bioacoustic sound events0
SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images0
Illuminant and light direction estimation using Wasserstein distance method0
ClipGrader: Leveraging Vision-Language Models for Robust Label Quality Assessment in Object Detection0
Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR0
Uncertainty Representation in a SOTIF-Related Use Case with Dempster-Shafer Theory for LiDAR Sensor-Based Object DetectionCode0
A Comparison of Object Detection and Phrase Grounding Models in Chest X-ray Abnormality Localization using Eye-tracking Data0
Unifying Light Field Perception with Field of ParallaxCode0
RFWNet: A Lightweight Remote Sensing Object Detector Integrating Multi-Scale Receptive Fields and Foreground Focus Mechanism0
UniFa: A unified feature hallucination framework for any-shot object detection0
Technical Report for ReID-SAM on SkiTB Visual Tracking Challenge 20250
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model EvaluationCode0
Learning Mask Invariant Mutual Information for Masked Image Modeling0
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance0
Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds0
Advanced YOLO-based Real-time Power Line Detection for Vegetation Management0
Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv100
Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies0
Progressive Local Alignment for Medical Multimodal Pre-training0
Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads0
Experimental validation of UAV search and detection system in real wilderness environment0
LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR0
Geometry-Aware 3D Salient Object Detection Network0
MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering0
Deep learning approaches to surgical video segmentation and object detection: A Scoping Review0
FeatSharp: Your Vision Model Features, Sharper0
KnowZRel: Common Sense Knowledge-based Zero-Shot Relationship Retrieval for Generalised Scene Graph GenerationCode0
Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection0
Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection0
Generative AI Framework for 3D Object Generation in Augmented Reality0
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera0
Show:102550
← PrevPage 68 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified