SOTAVerified

Object Detection

Papers

Showing 451500 of 10957 papers

TitleStatusHype
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with TransformersCode2
DetGPT: Detect What You Need via ReasoningCode2
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual LossCode2
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersCode2
Reversible Column NetworksCode2
DETR Does Not Need Multi-Scale or Locality DesignCode2
Revisiting Unreasonable Effectiveness of Data in Deep Learning EraCode2
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language ModelsCode2
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object DetectionCode2
COALA: A Practical and Vision-Centric Federated Learning PlatformCode2
Dilated Neighborhood Attention TransformerCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
Exploring Plain Vision Transformer Backbones for Object DetectionCode2
A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint SpaceCode2
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
Samba: A Unified Mamba-based Framework for General Salient Object DetectionCode2
Detect Everything with Few ExamplesCode2
SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image InterpretationCode2
Scale Normalized Image Pyramids with AutoFocus for Object DetectionCode2
Scaling Efficient Masked Image Modeling on Large Remote Sensing DatasetCode2
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation TrainingCode2
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNsCode2
Detection in Crowded Scenes: One Proposal, Multiple PredictionsCode2
PubTables-1M: Towards comprehensive table extraction from unstructured documentsCode2
SCSA: Exploring the Synergistic Effects Between Spatial and Channel AttentionCode2
SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual RecognitionCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
Dense Distinct Query for End-to-End Object DetectionCode2
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object DetectionCode2
Self-Supervised Transformers for Unsupervised Object Discovery using Normalized CutCode2
SFSORT: Scene Features-based Simple Online Real-Time TrackerCode2
SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing IndustryCode2
ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose OptimizationCode2
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionCode2
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTsCode2
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure AnalysisCode2
SNIPER: Efficient Multi-Scale TrainingCode2
Scalable SoftGroup for 3D Instance Segmentation on Point CloudsCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object DetectionCode2
Deep Neural Networks to Detect Weeds from Crops in Agricultural Environments in Real-Time: A ReviewCode2
BatchFormerV2: Exploring Sample Relationships for Dense Representation LearningCode2
Deep Incubation: Training Large Models by Divide-and-ConqueringCode2
Sparse Instance Activation for Real-Time Instance SegmentationCode2
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
Deep PCB To COCO ConvertorCode2
Show:102550
← PrevPage 10 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified