SOTAVerified

3D Object Detection

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Papers

Showing 76100 of 1576 papers

TitleStatusHype
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training ParadigmCode2
UniPAD: A Universal Pre-training Paradigm for Autonomous DrivingCode2
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera VideosCode2
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View RepresentationCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object DetectionCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
SAM3D: Zero-Shot 3D Object Detection via Segment Anything ModelCode2
UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous DrivingCode2
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point CloudsCode2
OctFormer: Octree-based Transformers for 3D Point CloudsCode2
TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene UnderstandingCode2
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object DetectionCode2
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene UnderstandingCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Spherical Transformer for LiDAR-based 3D RecognitionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and TrackingCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
BEVHeight: A Robust Framework for Vision-based Roadside 3D Object DetectionCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative PerceptionCode2
Show:102550
← PrevPage 4 of 64Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EA-LSSNDS0.78Unverified
2MMFusion-eNDS0.77Unverified
3MegFusionNDS0.77Unverified
4RacoonPowerNDS0.76Unverified
5BEVFusion-eNDS0.76Unverified
6DeepInteraction-largeNDS0.76Unverified
7DeepInteraction-eNDS0.76Unverified
8DAANDS0.75Unverified
9FusionVPENDS0.75Unverified
10CenterPoint-FusionNDS0.75Unverified