The Fishyscapes Benchmark: Measuring Blind Spots in Semantic Segmentation

2019-04-05Code Available0· sign in to hype

Hermann Blum, Paul-Edouard Sarlin, Juan Nieto, Roland Siegwart, Cesar Cadena

Code Available — Be the first to reproduce this paper.

Code

github.com/hermannsblum/fishyscapes
Officialtf★ 0

Abstract

Deep learning has enabled impressive progress in the accuracy of semantic segmentation. Yet, the ability to estimate uncertainty and detect failure is key for safety-critical applications like autonomous driving. Existing uncertainty estimates have mostly been evaluated on simple tasks, and it is unclear whether these methods generalize to more complex scenarios. We present Fishyscapes, the first public benchmark for uncertainty estimation in a real-world task of semantic segmentation for urban driving. It evaluates pixel-wise uncertainty estimates towards the detection of anomalous objects in front of the vehicle. We~adapt state-of-the-art methods to recent semantic segmentation models and compare approaches based on softmax confidence, Bayesian learning, and embedding density. Our results show that anomaly detection is far from solved even for ordinary situations, while our benchmark allows measuring advancements beyond the state-of-the-art.

Tasks

Anomaly Detection Autonomous Driving Segmentation Semantic Segmentation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Fishyscapes L&F	Dirichlet DeepLab	AP	34.28	—	Unverified
Fishyscapes L&F	Void Classifier	AP	10.29	—	Unverified
Fishyscapes L&F	Bayesian DeepLab	AP	9.8	—	Unverified
Fishyscapes L&F	Learned Embedding Density	AP	4.7	—	Unverified
Fishyscapes L&F	Softmax Entropy	AP	2.9	—	Unverified

The Fishyscapes Benchmark: Measuring Blind Spots in Semantic Segmentation

Code

Abstract

Tasks

Benchmark Results

Reproductions