Characterizing Missing Information in Deep Networks Using Backpropagated Gradients

2020-01-01ICLR 2020Unverified0· sign in to hype

Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib

Unverified — Be the first to reproduce this paper.

Abstract

Deep networks face challenges of ensuring their robustness against inputs that cannot be effectively represented by information learned from training data. We attribute this vulnerability to the limitations inherent to activation-based representation. To complement the learned information from activation-based representation, we propose utilizing a gradient-based representation that explicitly focuses on missing information. In addition, we propose a directional constraint on the gradients as an objective during training to improve the characterization of missing information. To validate the effectiveness of the proposed approach, we compare the anomaly detection performance of gradient-based and activation-based representations. We show that the gradient-based representation outperforms the activation-based representation by 0.093 in CIFAR-10 and 0.361 in CURE-TSR datasets in terms of AUROC averaged over all classes. Also, we propose an anomaly detection algorithm that uses the gradient-based representation, denoted as GradCon, and validate its performance on three benchmarking datasets. The proposed method outperforms the majority of the state-of-the-art algorithms in CIFAR-10, MNIST, and fMNIST datasets with an average AUROC of 0.664, 0.973, and 0.934, respectively.

Tasks

Anomaly Detection Attribute Benchmarking

Characterizing Missing Information in Deep Networks Using Backpropagated Gradients

Abstract

Tasks

Reproductions