BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

2022-06-21Code Available2· sign in to hype

Yinhao Li, Zheng Ge, Guanyi Yu, Jinrong Yang, Zengran Wang, Yukang Shi, Jianjian Sun, Zeming Li

Code Available — Be the first to reproduce this paper.

Code

github.com/megvii-basedetection/bevdepth
OfficialIn paperpytorch★ 862
github.com/ZRandomize/MatrixVT
pytorch★ 47

Abstract

In this research, we propose a new 3D object detector with a trustworthy depth estimation, dubbed BEVDepth, for camera-based Bird's-Eye-View (BEV) 3D object detection. Our work is based on a key observation -- depth estimation in recent approaches is surprisingly inadequate given the fact that depth is essential to camera 3D detection. Our BEVDepth resolves this by leveraging explicit depth supervision. A camera-awareness depth estimation module is also introduced to facilitate the depth predicting capability. Besides, we design a novel Depth Refinement Module to counter the side effects carried by imprecise feature unprojection. Aided by customized Efficient Voxel Pooling and multi-frame mechanism, BEVDepth achieves the new state-of-the-art 60.9% NDS on the challenging nuScenes test set while maintaining high efficiency. For the first time, the NDS score of a camera model reaches 60%.

Tasks

3D Object Detection Depth Estimation Object Detection Robust Camera Only 3D Object Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
DAIR-V2X-I	BEVDepth	AP\|R40(moderate)	63.6	—	Unverified
nuScenes Camera Only	BEVDepth-pure	NDS	60.9	—	Unverified
Rope3D	BEVDepth	AP@0.7	42.56	—	Unverified

BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

Code

Abstract

Tasks

Benchmark Results

Reproductions