Locally optimal detection of stochastic targeted universal adversarial perturbations

2020-12-08Unverified0· sign in to hype

Amish Goel, Pierre Moulin

Unverified — Be the first to reproduce this paper.

Abstract

Deep learning image classifiers are known to be vulnerable to small adversarial perturbations of input images. In this paper, we derive the locally optimal generalized likelihood ratio test (LO-GLRT) based detector for detecting stochastic targeted universal adversarial perturbations (UAPs) of the classifier inputs. We also describe a supervised training method to learn the detector's parameters, and demonstrate better performance of the detector compared to other detection methods on several popular image classification datasets.

Tasks

image-classification Image Classification

Locally optimal detection of stochastic targeted universal adversarial perturbations

Abstract

Tasks

Reproductions