SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 301325 of 1576 papers

TitleStatusHype
Aligning Bird-Eye View Representation of Point Cloud Sequences using Scene FlowCode1
Fusion is Not Enough: Single Modal Attacks on Fusion Models for 3D Object DetectionCode1
DQS3D: Densely-matched Quantization-aware Semi-supervised 3D DetectionCode1
Fully Sparse Fusion for 3D Object DetectionCode1
MetaBEV: Solving Sensor Failures for BEV Detection and Map SegmentationCode1
Density-Insensitive Unsupervised Domain Adaption on 3D Object DetectionCode1
Parcel3D: Shape Reconstruction from Single RGB Images for Applications in Transportation LogisticsCode1
RadarGNN: Transformation Invariant Graph Neural Network for Radar-based PerceptionCode1
Curricular Object Manipulation in LiDAR-based Object DetectionCode1
MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object DetectionCode1
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object PredictionCode1
VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object DetectionCode1
Open-Vocabulary Point-Cloud Object Detection without 3D AnnotationCode1
CRN: Camera Radar Net for Accurate, Robust, Efficient 3D PerceptionCode1
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerCode1
Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous DrivingCode1
SimDistill: Simulated Multi-modal Distillation for BEV 3D Object DetectionCode1
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and TrackingCode1
LinK: Linear Kernel for LiDAR-based 3D PerceptionCode1
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye ViewCode1
Viewpoint Equivariance for Multi-View 3D Object DetectionCode1
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-TrainingCode1
Constructing Metric-Semantic Maps using Floor Plan Priors for Long-Term Indoor LocalizationCode1
VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object DetectionCode1
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D UnderstandingCode1
Show:102550
← PrevPage 13 of 64Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MegFusionNDS0.77Unverified
3MMFusion-eNDS0.77Unverified
4BEVFusion-eNDS0.76Unverified
5RacoonPowerNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8FusionVPENDS0.75Unverified
9FocalFormer3D-FNDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified