SOTAVerified

Object Detection

Papers

Showing 351400 of 10957 papers

TitleStatusHype
Point Cloud Based Scene Segmentation: A Survey0
UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection0
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection0
Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime0
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionCode0
FLASHμ: Fast Localizing And Sizing of Holographic Microparticles0
The Power of One: A Single Example is All it Takes for Segmentation in VLMs0
HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer0
TARS: Traffic-Aware Radar Scene Flow Estimation0
RoMA: Scaling up Mamba-based Foundation Models for Remote SensingCode2
Object detection characteristics in a learning factory environment using YOLOv80
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object DetectionCode1
Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection0
Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection0
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground SimulationCode1
How good are deep learning methods for automated road safety analysis using video data? An experimental study0
Evaluating the Impact of Synthetic Data on Object Detection Tasks in Autonomous Driving0
Fully-Synthetic Training for Visual Quality Inspection in Automotive Production0
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection0
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection0
Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection0
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation0
Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X0
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual LabelsCode1
Simulating Automotive Radar with Lidar and Camera Inputs0
Referring to Any PersonCode2
SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection0
Boundary Regression for Leitmotif Detection in Music Audio0
Physics-based AI methodology for Material Parameter Extraction from Optical Data0
Accelerate 3D Object Detection Models via Zero-Shot Attention Key PruningCode1
VocalEyes: Enhancing Environmental Perception for the Visually Impaired through Vision-Language Models and Distance-Aware Object Detection0
Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection0
A Light Perspective for 3D Object Detection0
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection0
HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection0
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera0
SimROD: A Simple Baseline for Raw Object Detection with Global and Local EnhancementsCode1
Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection0
Semantic Communications with Computer Vision Sensing for Edge Video Transmission0
RS2AD: End-to-End Autonomous Driving Data Generation from Roadside Sensor Observations0
Enhancing Layer Attention Efficiency through Pruning Redundant Retrievals0
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic PromptsCode1
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive ReinforcementCode4
AnywhereDoor: Multi-Target Backdoor Attacks on Object DetectionCode0
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning0
Accurate and Efficient Two-Stage Gun Detection in Video0
Improving SAM for Camouflaged Object Detection via Dual Stream Adapters0
Show:102550
← PrevPage 8 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified