RedNet: Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation

2018-06-04Code Available0· sign in to hype

Jindong Jiang, Lunan Zheng, Fei Luo, Zhijun Zhang

Code Available — Be the first to reproduce this paper.

Code

github.com/JindongJiang/RedNet
OfficialIn paperpytorch★ 0
github.com/lyqcom/rednet30
mindspore★ 0
github.com/MindSpore-paper-code-2/code2/tree/main/REDNet30
mindspore★ 0
github.com/2023-MindSpore-4/Code6/tree/main/REDNet30
none★ 0
github.com/dodoseung/rednet-residual-encoder-decoder-network-pytorch
pytorch★ 0
github.com/2023-MindSpore-1/ms-code-216/tree/main/REDNet30
mindspore★ 0
github.com/code-implementation1/Code7/tree/main/REDNet30
mindspore★ 0
github.com/MindSpore-paper-code-3/code5/tree/main/REDNet30
mindspore★ 0

Abstract

Indoor semantic segmentation has always been a difficult task in computer vision. In this paper, we propose an RGB-D residual encoder-decoder architecture, named RedNet, for indoor RGB-D semantic segmentation. In RedNet, the residual module is applied to both the encoder and decoder as the basic building block, and the skip-connection is used to bypass the spatial feature between the encoder and decoder. In order to incorporate the depth information of the scene, a fusion structure is constructed, which makes inference on RGB image and depth image separately, and fuses their features over several layers. In order to efficiently optimize the network's parameters, we propose a `pyramid supervision' training scheme, which applies supervised learning over different layers in the decoder, to cope with the problem of gradients vanishing. Experiment results show that the proposed RedNet(ResNet-50) achieves a state-of-the-art mIoU accuracy of 47.8% on the SUN RGB-D benchmark dataset.

Tasks

Decoder Segmentation Semantic Segmentation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
NYU-Depth V2	RedNet	Mean IoU	47.2	—	Unverified
SUN-RGBD	TokenFusion (Ti)	Mean IoU	49.4	—	Unverified
SUN-RGBD	TokenFusion (Ti)	Mean IoU	47.8	—	Unverified
SUN-RGBD	TokenFusion (Ti)	Mean IoU	51.4	—	Unverified
THUD Robotic Dataset	RedNet	mIoU	76.92	—	Unverified

RedNet: Residual Encoder-Decoder Network for indoor RGB-D Semantic Segmentation

Code

Abstract

Tasks

Benchmark Results

Reproductions