SOTAVerified

Object Detection

Papers

Showing 26512700 of 10957 papers

TitleStatusHype
Gen2Det: Generate to Detect0
Texture-Semantic Collaboration Network for ORSI Salient Object DetectionCode0
Automated Multimodal Data Annotation via Calibration With Indoor Positioning System0
Rethinking Object Saliency Ranking: A Novel Whole-flow Processing ParadigmCode0
Uni3DL: Unified Model for 3D and Language Understanding0
ScAR: Scaling Adversarial Robustness for LiDAR Object DetectionCode0
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object DetectionCode1
Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and BaselineCode2
Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery0
RotaTR: Detection Transformer for Dense and Rotated Object0
Survey on deep learning in multimodal medical imaging for cancer detection0
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks0
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection0
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object DetectionCode1
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
Strong but simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer LearningCode1
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing0
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-trainingCode0
Exploring Adversarial Robustness of LiDAR-Camera Fusion Model in Autonomous Driving0
Spectrum-driven Mixed-frequency Network for Hyperspectral Salient Object DetectionCode1
Boosting Object Detection with Zero-Shot Day-Night Domain AdaptationCode1
Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction with Extremely Limited LabelsCode1
Segment and Caption AnythingCode2
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
Efficient Multimodal Semantic Segmentation via Dual-Prompt LearningCode1
RadioGalaxyNET: Dataset and Novel Computer Vision Algorithms for the Detection of Extended Radio Galaxies and Infrared HostsCode0
SCHEME: Scalable Channel Mixer for Vision Transformers0
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion ModelsCode2
Towards Efficient 3D Object Detection in Bird's-Eye-View Space for Autonomous Driving: A Convolutional-Only Approach0
Fool the Hydra: Adversarial Attacks against Multi-view Object Detection Systems0
SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation0
Cascaded Interaction with Eroded Deep Supervision for Salient Object Detection0
Union-over-Intersections: Object Detection beyond Winner-Takes-AllCode0
Hy-Tracker: A Novel Framework for Enhancing Efficiency and Accuracy of Object Tracking in Hyperspectral Videos0
Is Underwater Image Enhancement All Object Detectors Need?Code1
TIDE: Test Time Few Shot Object DetectionCode0
AutArch: An AI-assisted workflow for object detection and automated recording in archaeological cataloguesCode0
RQFormer: Rotated Query Transformer for End-to-End Oriented Object DetectionCode1
Do text-free diffusion models learn discriminative visual representations?Code1
Weakly-semi-supervised object detection in remotely sensed imagery0
PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection0
An Efficient Illumination Invariant Tiger Detection Framework for Wildlife Surveillance0
LEOD: Label-Efficient Object Detection for Event CamerasCode1
The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understandingCode1
Joint network for specular highlight detection and adversarial generation of specular-free images trained with polarimetric dataCode0
Integration of Robotics, Computer Vision, and Algorithm Design: A Chinese Poker Self-Playing Robot0
TransNeXt: Robust Foveal Visual Perception for Vision TransformersCode2
Feedback RoI Features Improve Aerial Object Detection0
Show:102550
← PrevPage 54 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified