A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

2016-10-07Code Available1· sign in to hype

Dan Hendrycks, Kevin Gimpel

Code Available — Be the first to reproduce this paper.

Code

github.com/hendrycks/error-detection
OfficialIn papertf★ 0
github.com/zjysteven/mixoe
pytorch★ 26
github.com/kobybibas/pnml_ood_detection
pytorch★ 25
github.com/guyAmit/GLOD
pytorch★ 16
github.com/dabsdamoon/MNIST-Auxiliary-Decoder
none★ 0
github.com/drumpt/RotNet-OOD
pytorch★ 0
github.com/JakobCode/UncertaintyInNeuralNetworks_Resources
pytorch★ 0
github.com/sooonwoo/RotNet-OOD
pytorch★ 0
github.com/oliverzhang42/ood_medical_images
pytorch★ 0
github.com/2sang/OOD-baseline
tf★ 0

Abstract

We consider the two related problems of detecting if an example is misclassified or out-of-distribution. We present a simple baseline that utilizes probabilities from softmax distributions. Correctly classified examples tend to have greater maximum softmax probabilities than erroneously classified and out-of-distribution examples, allowing for their detection. We assess performance by defining several tasks in computer vision, natural language processing, and automatic speech recognition, showing the effectiveness of this baseline across all. We then show the baseline can sometimes be surpassed, demonstrating the room for future research on these underexplored detection tasks.

Tasks

Anomaly Detection Automatic Speech Recognition Automatic Speech Recognition (ASR)Out-of-Distribution Detection Speech Recognition

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10 vs CIFAR-100	WRN 40-2 (MSP Baseline)	AUROC	87.9	—	Unverified

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions