Expectation-Maximization Attention Networks for Semantic Segmentation

2019-07-31ICCV 2019Code Available0· sign in to hype

Xia Li, Zhisheng Zhong, Jianlong Wu, Yibo Yang, Zhouchen Lin, Hong Liu

Code Available — Be the first to reproduce this paper.

Code

github.com/hendraet/synthesis-in-style
pytorch★ 8
github.com/mfp0610/semantic-segmentaion
pytorch★ 2
github.com/XiaLiPKU/EMANet
pytorch★ 0

Abstract

Self-attention mechanism has been widely used for various tasks. It is designed to compute the representation of each position by a weighted sum of the features at all positions. Thus, it can capture long-range relations for computer vision tasks. However, it is computationally consuming. Since the attention maps are computed w.r.t all other positions. In this paper, we formulate the attention mechanism into an expectation-maximization manner and iteratively estimate a much more compact set of bases upon which the attention maps are computed. By a weighted summation upon these bases, the resulting representation is low-rank and deprecates noisy information from the input. The proposed Expectation-Maximization Attention (EMA) module is robust to the variance of input and is also friendly in memory and computation. Moreover, we set up the bases maintenance and normalization methods to stabilize its training procedure. We conduct extensive experiments on popular semantic segmentation benchmarks including PASCAL VOC, PASCAL Context and COCO Stuff, on which we set new records.

Tasks

Semantic Segmentation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
BDD100K val	EMANet	mIoU	61.4	—	Unverified
COCO-Stuff test	EMANet	mIoU	39.9	—	Unverified
PASCAL Context	EMANet	mIoU	53.1	—	Unverified

Expectation-Maximization Attention Networks for Semantic Segmentation

Code

Abstract

Tasks

Benchmark Results

Reproductions