SOTAVerified

Object Detection

Papers

Showing 351400 of 10957 papers

TitleStatusHype
RevColV2: Exploring Disentangled Representations in Masked Image ModelingCode2
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance SegmentationCode2
Dataset QuantizationCode2
Turning a CLIP Model into a Scene Text SpotterCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera VideosCode2
ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object DetectionCode2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View RepresentationCode2
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object DetectionCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object DetectionCode2
COCO-O: A Benchmark for Object Detectors under Natural Distribution ShiftsCode2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureCode2
Scale-Aware Modulation Meet TransformerCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph MatchingCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
SAM3D: Zero-Shot 3D Object Detection via Segment Anything ModelCode2
Multi-modal Queried Object Detection in the WildCode2
UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous DrivingCode2
Contextual Object Detection with Multimodal Large Language ModelsCode2
Efficient Multi-Scale Attention Module with Cross-Spatial LearningCode2
DetGPT: Detect What You Need via ReasoningCode2
Going Denser with Open-Vocabulary Part SegmentationCode2
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point CloudsCode2
OctFormer: Octree-based Transformers for 3D Point CloudsCode2
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything ModelCode2
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object DetectionCode2
A Strong and Reproducible Object Detector with Only Public DatasetsCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual RecognitionCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Vision Transformer with Quadrangle AttentionCode2
Spherical Transformer for LiDAR-based 3D RecognitionCode2
Dense Distinct Query for End-to-End Object DetectionCode2
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and TrackingCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
Large Selective Kernel Network for Remote Sensing Object DetectionCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
BiFormer: Vision Transformer with Bi-Level Routing AttentionCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative PerceptionCode2
Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETRCode2
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale AttentionCode2
Virtual Sparse Convolution for Multimodal 3D Object DetectionCode2
Pillar R-CNN for Point Cloud 3D Object DetectionCode2
Show:102550
← PrevPage 8 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified