SOTAVerified

Object Detection

Papers

Showing 34013450 of 10957 papers

TitleStatusHype
M-FLAG: Medical Vision-Language Pre-training with Frozen Language Models and Latent Space Geometry OptimizationCode1
S2R-ViT for Multi-Agent Cooperative Perception: Bridging the Gap from Simulation to Reality0
Semi-DETR: Semi-Supervised Object Detection with Detection TransformersCode1
Analysing Gender Bias in Text-to-Image Models using Object DetectionCode0
Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-LabelingCode1
KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection0
Deteksi Sampah di Permukaan dan Dalam Perairan pada Objek Video dengan Metode Robust and Efficient Post-Processing dan Tubelet-Level Bounding Box Linking0
Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave RadarCode1
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression0
Aeolus Ocean -- A simulation environment for the autonomous COLREG-compliant navigation of Unmanned Surface Vehicles using Deep Reinforcement Learning and Maritime Object DetectionCode0
Robotic surface exploration with vision and tactile sensing for cracks detection and characterisation0
WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water SurfacesCode1
Multimodal Object Detection in Remote Sensing0
YOLIC: An Efficient Method for Object Localization and Classification on Edge DevicesCode0
YOGA: Deep Object Detection in the Wild with Lightweight Feature Learning and Multiscale Attention0
GVCCI: Lifelong Learning of Visual Grounding for Language-Guided Robotic ManipulationCode0
A New Dataset and Comparative Study for Aphid Cluster Detection0
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems0
DNN-Based Map Deviation Detection in LiDAR Point CloudsCode1
Bio-Inspired Night Image Enhancement Based on Contrast Enhancement and Denoising0
Joint Salient Object Detection and Camouflaged Object Detection via Uncertainty-aware Learning0
Preventing Errors in Person Detection: A Part-Based Self-Monitoring FrameworkCode0
Q-YOLOP: Quantization-aware You Only Look Once for Panoptic Driving Perception0
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge UnderstandingCode0
Visible and infrared self-supervised fusion trained on a single example0
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye ViewCode1
Edge-Aware Mirror Network for Camouflaged Object DetectionCode1
Camouflaged Object Detection with Feature Grafting and Distractor AwareCode0
Domain Generalized Object Detection for Remote Sensing ImagesCode0
Artificial Eye for the Blind0
Open-Vocabulary Object Detection via Scene Graph Discovery0
Joint Perceptual Learning for Enhancement and Object Detection in Underwater Scenarios0
Solvent: A Framework for Protein FoldingCode1
PseudoCell: Hard Negative Mining as Pseudo Labeling for Deep Learning-Based Centroblast Cell DetectionCode0
Semi-supervised Learning from Street-View Images and OpenStreetMap for Automatic Building Height EstimationCode1
Line Graphics Digitization: A Step Towards Full AutomationCode0
SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection0
Base Layer Efficiency in Scalable Human-Machine Coding0
RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation0
Unbalanced Optimal Transport: A Unified Framework for Object DetectionCode1
Object Recognition System on a Tactile Device for Visually Impaired0
Focusing on what to decode and what to train: SOV Decoding with Specific Target Guided DeNoising and Vision Language AdvisorCode0
Practical Collaborative Perception: A Framework for Asynchronous and Multi-Agent 3D Object DetectionCode1
MaskBEV: Joint Object Detection and Footprint Completion for Bird's-eye View 3D Point CloudsCode1
SUIT: Learning Significance-guided Information for 3D Temporal Detection0
Exploiting Richness of Learned Compressed Representation of Images for Semantic Segmentation0
SRCD: Semantic Reasoning with Compound Domains for Single-Domain Generalized Object Detection0
IAdet: Simplest human-in-the-loop object detectionCode0
Hierarchical Open-vocabulary Universal Image SegmentationCode2
Show:102550
← PrevPage 69 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified