SOTAVerified

Object Detection

Papers

Showing 501550 of 10957 papers

TitleStatusHype
Beyond Self-attention: External Attention using Two Linear Layers for Visual TasksCode2
UniFormer: Unifying Convolution and Self-attention for Visual RecognitionCode2
Efficient Multi-Scale Attention Module with Cross-Spatial LearningCode2
Universal Guidance for Diffusion ModelsCode2
BiFormer: Vision Transformer with Bi-Level Routing AttentionCode2
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object DetectionCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising DiffusionCode2
VEViD: Vision Enhancement via Virtual diffraction and coherent DetectionCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object DetectionCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Distance-IoU Loss: Faster and Better Learning for Bounding Box RegressionCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
Dilated Neighborhood Attention TransformerCode2
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object DetectionCode2
DetGPT: Detect What You Need via ReasoningCode2
DEYO: DETR with YOLO for End-to-End Object DetectionCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
Detection in Crowded Scenes: One Proposal, Multiple PredictionsCode2
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure AnalysisCode2
Detect Everything with Few ExamplesCode2
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTsCode2
Dense Distinct Query for End-to-End Object DetectionCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
DETR Does Not Need Multi-Scale or Locality DesignCode2
Decoupled Knowledge DistillationCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
DFormer: Rethinking RGBD Representation Learning for Semantic SegmentationCode2
Agent Attention: On the Integration of Softmax and Linear AttentionCode2
ALBench: A Framework for Evaluating Active Learning in Object DetectionCode2
DaViT: Dual Attention Vision TransformersCode2
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionCode2
Deep Incubation: Training Large Models by Divide-and-ConqueringCode2
Bottleneck Transformers for Visual RecognitionCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Dataset QuantizationCode2
Show:102550
← PrevPage 11 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified