SOTAVerified

Object Detection

Papers

Showing 33013350 of 10957 papers

TitleStatusHype
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data GenerationCode0
Shift, Scale and Rotation Invariant Multiple Object Detection using Balanced Joint Transform Correlator0
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point CloudsCode0
FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene0
HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object DetectionCode0
SparseAlign: A Fully Sparse Framework for Cooperative Object Detection0
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection0
Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization0
MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models0
8-Calves Image datasetCode0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
Point Cloud Based Scene Segmentation: A Survey0
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection0
Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime0
FLASHμ: Fast Localizing And Sizing of Holographic Microparticles0
FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection0
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionCode0
The Power of One: A Single Example is All it Takes for Segmentation in VLMs0
HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer0
Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection0
TARS: Traffic-Aware Radar Scene Flow Estimation0
Object detection characteristics in a learning factory environment using YOLOv80
Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection0
Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection0
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation0
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection0
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection0
How good are deep learning methods for automated road safety analysis using video data? An experimental study0
Fully-Synthetic Training for Visual Quality Inspection in Automotive Production0
Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X0
Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving0
Simulating Automotive Radar with Lidar and Camera Inputs0
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Boundary Regression for Leitmotif Detection in Music Audio0
Physics-based AI methodology for Material Parameter Extraction from Optical Data0
SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection0
Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection0
VocalEyes: Enhancing Environmental Perception for the Visually Impaired through Vision-Language Models and Distance-Aware Object Detection0
RS2AD: End-to-End Autonomous Driving Data Generation from Roadside Sensor Observations0
HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection0
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection0
Semantic Communications with Computer Vision Sensing for Edge Video Transmission0
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera0
A Light Perspective for 3D Object Detection0
Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection0
Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals0
AnywhereDoor: Multi-Target Backdoor Attacks on Object DetectionCode0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning0
Accurate and Efficient Two-Stage Gun Detection in Video0
Show:102550
← PrevPage 67 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified