SOTAVerified

Object Detection

Papers

Showing 14511500 of 10957 papers

TitleStatusHype
Lightweight Neural Architecture Search for Temporal Convolutional Networks at the EdgeCode1
Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing MechanismCode1
Exploring Active 3D Object Detection from a Generalization PerspectiveCode1
Long-tail Detection with Effective Class-MarginsCode1
OvarNet: Towards Open-vocabulary Object Attribute RecognitionCode1
Unleash the Potential of Image Branch for Cross-modal 3D Object DetectionCode1
FemtoDet: An Object Detection Baseline for Energy Versus Performance TradeoffsCode1
SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded NetworkCode1
Towards Spatial Equilibrium Object DetectionCode1
EARL: An Elliptical Distribution aided Adaptive Rotation Label Assignment for Oriented Object Detection in Remote Sensing ImagesCode1
CLIP the Gap: A Single Domain Generalization Approach for Object DetectionCode1
Dynamic Grained Encoder for Vision TransformersCode1
FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D DetectionCode1
Rethinking Voxelization and Classification for 3D Object DetectionCode1
HRTransNet: HRFormer-Driven Two-Modality Salient Object DetectionCode1
Lightweight Salient Object Detection in Optical Remote-Sensing Images via Semantic Matching and Edge AlignmentCode1
Object as Query: Lifting any 2D Object Detector to 3D DetectionCode1
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed ManipulationCode1
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance SegmentationCode1
A Fast Unified System for 3D Object Detection and TrackingCode1
Vision HGNN: An Image is More than a Graph of NodesCode1
Annealing-Based Label-Transfer Learning for Open World Object DetectionCode1
Learning To Generate Language-Supervised and Open-Vocabulary Scene Graph Using Pre-Trained Visual-Semantic SpaceCode1
Deep Dive Into Gradients: Better Optimization for 3D Object Detection With Gradient-Corrected IoU SupervisionCode1
Object Detection With Self-Supervised Scene AdaptationCode1
MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object DetectionCode1
Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object DetectionCode1
Masked Retraining Teacher-Student Framework for Domain Adaptive Object DetectionCode1
Benchmarking Robustness of 3D Object Detection to Common CorruptionsCode1
LSTFE-Net:Long Short-Term Feature Enhancement Network for Video Small Object DetectionCode1
Azimuth Super-Resolution for FMCW Radar in Autonomous DrivingCode1
Harmonious Teacher for Cross-Domain Object DetectionCode1
Novel Scenes & Classes: Towards Adaptive Open-set Object DetectionCode1
3DPPE: 3D Point Positional Encoding for Transformer-based Multi-Camera 3D Object DetectionCode1
PODA: Prompt-driven Zero-shot Domain AdaptationCode1
AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object DetectionCode1
Feature Aggregated Queries for Transformer-Based Video Object DetectorsCode1
CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited AnnotationsCode1
LaPE: Layer-adaptive Position Embedding for Vision Transformers with Independent Layer NormalizationCode1
A General Regret Bound of Preconditioned Gradient Method for DNN TrainingCode1
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's-Eye ViewCode1
Learning from Noisy Data for Semi-Supervised 3D Object DetectionCode1
Guided Hybrid Quantization for Object detection in Multimodal Remote Sensing Imagery via One-to-one Self-teachingCode1
Disjoint Masking with Joint Distillation for Efficient Masked Image ModelingCode1
TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry LearningCode1
Fewer is More: Efficient Object Detection in Large Aerial ImagesCode1
A Close Look at Spatial Modeling: From Attention to ConvolutionCode1
Mask Focal Loss: A unifying framework for dense crowd counting with canonical object detection networksCode1
GOOD: Exploring Geometric Cues for Detecting Objects in an Open WorldCode1
A recurrent CNN for online object detection on raw radar framesCode1
Show:102550
← PrevPage 30 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified