SOTAVerified

Single Layer Predictive Normalized Maximum Likelihood for Out-of-Distribution Detection

2021-10-18NeurIPS 2021Code Available1· sign in to hype

Koby Bibas, Meir Feder, Tal Hassner

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Detecting out-of-distribution (OOD) samples is vital for developing machine learning based models for critical safety systems. Common approaches for OOD detection assume access to some OOD samples during training which may not be available in a real-life scenario. Instead, we utilize the predictive normalized maximum likelihood (pNML) learner, in which no assumptions are made on the tested input. We derive an explicit expression of the pNML and its generalization error, denoted as the regret, for a single layer neural network (NN). We show that this learner generalizes well when (i) the test vector resides in a subspace spanned by the eigenvectors associated with the large eigenvalues of the empirical correlation matrix of the training data, or (ii) the test sample is far from the decision boundary. Furthermore, we describe how to efficiently apply the derived pNML regret to any pretrained deep NN, by employing the explicit pNML for the last layer, followed by the softmax function. Applying the derived regret to deep NN requires neither additional tunable parameters nor extra data. We extensively evaluate our approach on 74 OOD detection benchmarks using DenseNet-100, ResNet-34, and WideResNet-40 models trained with CIFAR-100, CIFAR-10, SVHN, and ImageNet-30 showing a significant improvement of up to 15.6\% over recent leading methods.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CIFAR-100 vs GaussianDenseNet-BC-100AUROC100Unverified
CIFAR-100 vs GaussianResNet-34AUROC100Unverified
CIFAR-100 vs ImageNet (C)ResNet-34AUROC98.4Unverified
CIFAR-100 vs ImageNet (C)DenseNet-BC-100AUROC99Unverified
CIFAR-100 vs ImageNet (R)ResNet-34AUROC99.2Unverified
CIFAR-100 vs ImageNet (R)DenseNet-BC-100AUROC99.5Unverified
CIFAR-100 vs iSUNResNet-34AUROC99.3Unverified
CIFAR-100 vs iSUNDenseNet-BC-100AUROC99.5Unverified
CIFAR-100 vs LSUN (C)DenseNet-BC-100AUROC96.1Unverified
CIFAR-100 vs LSUN (C)ResNet-34AUROC97.8Unverified
CIFAR-100 vs LSUN (R)ResNet-34AUROC99.6Unverified
CIFAR-100 vs LSUN (R)DenseNet-BC-100AUROC99.7Unverified
CIFAR-100 vs SVHNDenseNet-BC-100AUROC98.4Unverified
CIFAR-100 vs SVHNResNet-34AUROC97.9Unverified
CIFAR-100 vs UniformDenseNet-BC-100AUROC100Unverified
CIFAR-100 vs UniformResNet-34AUROC100Unverified
CIFAR-10 vs GaussianDenseNet-BC-100AUROC100Unverified
CIFAR-10 vs GaussianResNet-34AUROC100Unverified
CIFAR-10 vs ImageNet (C)ResNet-34AUROC99.8Unverified
CIFAR-10 vs ImageNet (C)DenseNet-BC-100AUROC99.9Unverified
CIFAR-10 vs ImageNet (R)ResNet-34AUROC99.9Unverified
CIFAR-10 vs ImageNet (R)DenseNet-BC-100AUROC99.9Unverified
CIFAR-10 vs iSUNDenseNet-BC-100AUROC100Unverified
CIFAR-10 vs iSUNResNet-34AUROC100Unverified
CIFAR-10 vs LSUN (C)ResNet-34AUROC99.5Unverified
CIFAR-10 vs LSUN (C)DenseNet-BC-100AUROC99.9Unverified
CIFAR-10 vs LSUN (R)ResNet-34AUROC100Unverified
CIFAR-10 vs LSUN (R)DenseNet-BC-100AUROC100Unverified
CIFAR-10 vs SVHNDenseNet-BC-100AUROC100Unverified
CIFAR-10 vs SVHNResNet-34AUROC99.8Unverified
CIFAR-10 vs UniformResNet-34AUROC100Unverified
CIFAR-10 vs UniformDenseNet-BC-100AUROC100Unverified
SVHN vs CIFAR-10ResNet-34AUROC99.8Unverified
SVHN vs CIFAR-10DenseNet-BC-100AUROC100Unverified
SVHN vs CIFAR-100ResNet-34AUROC99.8Unverified
SVHN vs CIFAR-100DenseNet-BC-100AUROC100Unverified
SVHN vs GaussianResNet-34AUROC100Unverified
SVHN vs GaussianDenseNet-BC-100AUROC100Unverified
SVHN vs ImageNet (C)DenseNet-BC-100AUROC100Unverified
SVHN vs ImageNet (C)ResNet-34AUROC100Unverified
SVHN vs ImageNet (R)DenseNet-BC-100AUROC100Unverified
SVHN vs ImageNet (R)ResNet-34AUROC100Unverified
SVHN vs iSUNResNet-34AUROC100Unverified
SVHN vs iSUNDenseNet-BC-100AUROC100Unverified
SVHN vs LSUN (C)DenseNet-BC-100AUROC100Unverified
SVHN vs LSUN (C)ResNet-34AUROC99.9Unverified
SVHN vs LSUN (R)ResNet-34AUROC100Unverified
SVHN vs LSUN (R)DenseNet-BC-100AUROC100Unverified
SVHN vs UniformResNet-34AUROC100Unverified
SVHN vs UniformDenseNet-BC-100AUROC100Unverified

Reproductions