MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

2021-08-22Code Available1· sign in to hype

Faranak Shamsafar, Samuel Woerz, Rafia Rahim, Andreas Zell

Code Available — Be the first to reproduce this paper.

Code

github.com/cogsys-tuebingen/mobilestereonet
OfficialIn paperpytorch★ 300
github.com/ibaiGorordo/ONNX-MobileStereoNet
pytorch★ 47
github.com/UCI-ISA-Lab/MultiHeadDepth-HomoDepth
pytorch★ 20
github.com/ibaiGorordo/TFLite-MobileStereoNet
tf★ 8

Abstract

Recent methods in stereo matching have continuously improved the accuracy using deep models. This gain, however, is attained with a high increase in computation cost, such that the network may not fit even on a moderate GPU. This issue raises problems when the model needs to be deployed on resource-limited devices. For this, we propose two light models for stereo vision with reduced complexity and without sacrificing accuracy. Depending on the dimension of cost volume, we design a 2D and a 3D model with encoder-decoders built from 2D and 3D convolutions, respectively. To this end, we leverage 2D MobileNet blocks and extend them to 3D for stereo vision application. Besides, a new cost volume is proposed to boost the accuracy of the 2D model, making it performing close to 3D networks. Experiments show that the proposed 2D/3D networks effectively reduce the computational expense (27%/95% and 72%/38% fewer parameters/operations in 2D and 3D models, respectively) while upholding the accuracy. Our code is available at https://github.com/cogsys-tuebingen/mobilestereonet.

Tasks

Depth Estimation Disparity Estimation GPU Stereo Depth Estimation Stereo Matching

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
KITTI2015	3D-MobileStereoNet	three pixel error	1.69	—	Unverified
KITTI2015	2D-MobileStereoNet	three pixel error	2.67	—	Unverified
sceneflow	3D-MobileStereoNet	Average End-Point Error	0.8	—	Unverified
sceneflow	2D-MobileStereoNet	Average End-Point Error	1.14	—	Unverified

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Code

Abstract

Tasks

Benchmark Results

Reproductions