Joint 3D Proposal Generation and Object Detection from View Aggregation

2017-12-06Code Available0· sign in to hype

Jason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, Steven Waslander

Code Available — Be the first to reproduce this paper.

Code

github.com/kujason/avod
OfficialIn papertf★ 0
github.com/asharakeh/kitti_native_evaluation
pytorch★ 0
github.com/kujason/ip_basic
none★ 0
github.com/Fredrik00/avod
tf★ 0

Abstract

We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high resolution feature maps to generate reliable 3D object proposals for multiple object classes in road scenes. Using these proposals, the second stage detection network performs accurate oriented 3D bounding box regression and category classification to predict the extents, orientation, and classification of objects in 3D space. Our proposed architecture is shown to produce state of the art results on the KITTI 3D object detection benchmark while running in real time with a low memory footprint, making it a suitable candidate for deployment on autonomous vehicles. Code is at: https://github.com/kujason/avod

Tasks

3D Object Detection Autonomous Driving Autonomous Vehicles General Classification Object object-detection Object Detection Region Proposal

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
KITTI Cars Easy	AVOD + Feature Pyramid	AP	81.94	—	Unverified
KITTI Cars Hard	AVOD + Feature Pyramid	AP	66.38	—	Unverified
KITTI Cyclists Easy	AVOD + Feature Pyramid	AP	64	—	Unverified
KITTI Cyclists Hard	AVOD + Feature Pyramid	AP	46.61	—	Unverified
KITTI Cyclists Moderate	AVOD + Feature Pyramid	AP	52.18	—	Unverified
KITTI Pedestrians Easy	AVOD + Feature Pyramid	AP	50.8	—	Unverified
KITTI Pedestrians Hard	AVOD + Feature Pyramid	AP	40.88	—	Unverified
KITTI Pedestrians Moderate	AVOD + Feature Pyramid	AP	42.81	—	Unverified

Joint 3D Proposal Generation and Object Detection from View Aggregation

Code

Abstract

Tasks

Benchmark Results

Reproductions