Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation

2016-05-08Code Available0· sign in to hype

Golnaz Ghiasi, Charless C. Fowlkes

Code Available — Be the first to reproduce this paper.

Code

github.com/golnazghiasi/LRR
OfficialIn papernone★ 0

Abstract

CNN architectures have terrific recognition performance but rely on spatial pooling which makes it difficult to adapt them to tasks that require dense, pixel-accurate labeling. This paper makes two contributions: (1) We demonstrate that while the apparent spatial resolution of convolutional feature maps is low, the high-dimensional feature representation contains significant sub-pixel localization information. (2) We describe a multi-resolution reconstruction architecture based on a Laplacian pyramid that uses skip connections from higher resolution feature maps and multiplicative gating to successively refine segment boundaries reconstructed from lower-resolution maps. This approach yields state-of-the-art semantic segmentation results on the PASCAL VOC and Cityscapes segmentation benchmarks without resorting to more complex random-field inference or instance detection driven architectures.

Tasks

Segmentation Semantic Segmentation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Cityscapes test	LRR-4x	Mean IoU (class)	71.8	—	Unverified

Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation

Code

Abstract

Tasks

Benchmark Results

Reproductions