SOTAVerified

Object Detection

Papers

Showing 76017650 of 10957 papers

TitleStatusHype
A Training-free, One-shot Detection Framework For Geospatial Objects In Remote Sensing Images0
PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS with Relationship Recovery0
A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)0
AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving0
CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection0
A convnet for non-maximum suppression0
Group Equivariant BEV for 3D Object Detection0
ATLASv2: LLM-Guided Adaptive Landmark Acquisition and Navigation on the Edge0
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining0
Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount0
Pushing the Limits of Asynchronous Graph-based Object Detection with Event Cameras0
Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information0
Putting 3D Spatially Sparse Networks on a Diet0
PVAFN: Point-Voxel Attention Fusion Network with Multi-Pooling Enhancing for 3D Object Detection0
AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems0
Group channel pruning and spatial attention distilling for object detection0
PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features0
Ground Plane Matters: Picking Up Ground Plane Prior in Monocular 3D Object Detection0
DeepMix: Mobility-aware, Lightweight, and Hybrid 3D Object Detection for Headsets0
PV-RCNN++: Semantical Point-Voxel Feature Interaction for 3D Object Detection0
PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges0
PV-SSD: A Multi-Modal Point Cloud Feature Fusion Method for Projection Features and Variable Receptive Field Voxel Features0
Real-time 3D object proposal generation and classification under limited processing resources0
Real-Time and Robust 3D Object Detection Within Road-Side LiDARs Using Domain Adaptation0
Real-time HOG+SVM based object detection using SoC FPGA for a UHD video stream0
Ground material classification for UAV-based photogrammetric 3D data A 2D-3D Hybrid Approach0
Cross-Domain Spatial Matching for Camera and Radar Sensor Data Fusion in Autonomous Vehicle Perception System0
AI-based thermal bridge detection of building rooftops on district scale using aerial images0
Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher0
Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection0
GroundCap: A Visually Grounded Image Captioning Dataset0
A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection0
A Continuous Occlusion Model for Road Scene Understanding0
Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation0
GRIP: Generative Robust Inference and Perception for Semantic Robot Manipulation in Adversarial Environments0
A Taught-Obesrve-Ask (TOA) Method for Object Detection with Critical Supervision0
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring0
Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models0
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction0
Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training0
A.I. and Data-Driven Mobility at Volkswagen Financial Services AG0
QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation0
A Context Aware and Video-Based Risk Descriptor for Cyclists0
QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving0
ReAFFPN: Rotation-equivariant Attention Feature Fusion Pyramid Networks for Aerial Object Detection0
Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning0
A System-Level Solution for Low-Power Object Detection0
GridCLIP: One-Stage Object Detection by Grid-Level CLIP Representation Learning0
G-Rep: Gaussian Representation for Arbitrary-Oriented Object Detection0
Cross-domain Federated Object Detection0
Show:102550
← PrevPage 153 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified