SOTAVerified

Object Detection

Papers

Showing 501550 of 10957 papers

TitleStatusHype
BatchFormerV2: Exploring Sample Relationships for Dense Representation LearningCode2
Multi-Class Road User Detection With 3+1D Radar in the View-of-Delft DatasetCode2
Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object DetectionCode2
AdaMixer: A Fast-Converging Query-Based Object DetectorCode2
Exploring Plain Vision Transformer Backbones for Object DetectionCode2
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving DataCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
MonoDETR: Depth-guided Transformer for Monocular 3D Object DetectionCode2
Sparse Instance Activation for Real-Time Instance SegmentationCode2
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-trainingCode2
Real-time Object Detection for Streaming PerceptionCode2
Focal Modulation NetworksCode2
Open-Vocabulary DETR with Conditional MatchingCode2
TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with TransformersCode2
Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point CloudsCode2
V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision TransformerCode2
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point CloudsCode2
Sparse Fuse Dense: Towards High Quality 3D Detection with Depth CompletionCode2
HybridNets: End-to-End Perception NetworkCode2
Decoupled Knowledge DistillationCode2
Accelerating DETR Convergence via Semantic-Aligned MatchingCode2
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training QuantizationCode2
A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object DetectionCode2
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with TransformersCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
F2DNet: Fast Focal Detection Network for Pedestrian DetectionCode2
StrongSORT: Make DeepSORT Great AgainCode2
FreeSOLO: Learning to Segment Objects without AnnotationsCode2
Self-Supervised Transformers for Unsupervised Object Discovery using Normalized CutCode2
GroupViT: Semantic Segmentation Emerges from Text SupervisionCode2
Tiny Object Tracking: A Large-scale Dataset and A BaselineCode2
Context Autoencoder for Self-Supervised Representation LearningCode2
VOS: Learning What You Don't Know by Virtual Outlier SynthesisCode2
The KFIoU Loss for Rotated Object DetectionCode2
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETRCode2
RelTR: Relation Transformer for Scene Graph GenerationCode2
When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention MechanismCode2
UniFormer: Unifying Convolution and Self-attention for Visual RecognitionCode2
TransVOD: End-to-End Video Object Detection with Spatial-Temporal TransformersCode2
Pedestrian Detection: Domain Generalization, CNNs, Transformers and BeyondCode2
QuadTree Attention for Vision TransformersCode2
Equalized Focal Loss for Dense Long-Tailed Object DetectionCode2
Vision Transformer with Deformable AttentionCode2
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-ViewCode2
Grounded Language-Image Pre-trainingCode2
MetaFormer Is Actually What You Need for VisionCode2
Attention Mechanisms in Computer Vision: A SurveyCode2
Deep Neural Networks to Detect Weeds from Crops in Agricultural Environments in Real-Time: A ReviewCode2
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision TransformerCode2
PubTables-1M: Towards comprehensive table extraction from unstructured documentsCode2
Show:102550
← PrevPage 11 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified