Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement
Kai Xu, Rongyu Chen, Gianni Franchi, Angela Yao
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/kai422/scaleOfficialIn paperpytorch★ 15
Abstract
The capacity of a modern deep learning system to determine if a sample falls within its realm of knowledge is fundamental and important. In this paper, we offer insights and analyses of recent state-of-the-art out-of-distribution (OOD) detection methods - extremely simple activation shaping (ASH). We demonstrate that activation pruning has a detrimental effect on OOD detection, while activation scaling enhances it. Moreover, we propose SCALE, a simple yet effective post-hoc network enhancement method for OOD detection, which attains state-of-the-art OOD detection performance without compromising in-distribution (ID) accuracy. By integrating scaling concepts into the training process to capture a sample's ID characteristics, we propose Intermediate Tensor SHaping (ISH), a lightweight method for training time OOD detection enhancement. We achieve AUROC scores of +1.85\% for near-OOD and +0.74\% for far-OOD datasets on the OpenOOD v1.5 ImageNet-1K benchmark. Our code and models are available at https://github.com/kai422/SCALE.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Far-OOD | ISH (ResNet50) | AUROC | 96.79 | — | Unverified |
| Far-OOD | SCALE (ResNet50) | AUROC | 96.53 | — | Unverified |
| ImageNet-1k vs Curated OODs (avg.) | SCALE (ResNet50) | FPR95 | 20.05 | — | Unverified |
| ImageNet-1k vs iNaturalist | SCALE (ResNet50) | AUROC | 98.17 | — | Unverified |
| ImageNet-1k vs Places | SCALE (ResNet50) | FPR95 | 34.51 | — | Unverified |
| ImageNet-1k vs SUN | SCALE (ResNet50) | FPR95 | 23.27 | — | Unverified |
| ImageNet-1k vs Textures | SCALE (ResNet50) | AUROC | 97.37 | — | Unverified |
| Near-OOD | ISH (ResNet50) | AUROC | 84.01 | — | Unverified |
| Near-OOD | SCALE (ResNet50) | AUROC | 81.36 | — | Unverified |