Patch-wise Attack for Fooling Deep Neural Network

2020-07-14ECCV 2020Code Available1· sign in to hype

Lianli Gao, Qilong Zhang, Jingkuan Song, Xianglong Liu, Heng Tao Shen

Code Available — Be the first to reproduce this paper.

Code

github.com/qilong-zhang/Patch-wise-iterative-attack
OfficialIn papertf★ 94
github.com/ylhz/tf_to_pytorch_model
pytorch★ 95
github.com/qilong-zhang/Targeted_Patch-wise-plusplus_iterative_attack
tf★ 28

Abstract

By adding human-imperceptible noise to clean images, the resultant adversarial examples can fool other unknown models. Features of a pixel extracted by deep neural networks (DNNs) are influenced by its surrounding regions, and different DNNs generally focus on different discriminative regions in recognition. Motivated by this, we propose a patch-wise iterative algorithm -- a black-box attack towards mainstream normally trained and defense models, which differs from the existing attack methods manipulating pixel-wise noise. In this way, without sacrificing the performance of white-box attack, our adversarial examples can have strong transferability. Specifically, we introduce an amplification factor to the step size in each iteration, and one pixel's overall gradient overflowing the -constraint is properly assigned to its surrounding regions by a project kernel. Our method can be generally integrated to any gradient-based attack methods. Compared with the current state-of-the-art attacks, we significantly improve the success rate by 9.2\% for defense models and 3.7\% for normally trained models on average. Our code is available at https://github.com/qilong-zhang/Patch-wise-iterative-attack

Tasks

Adversarial Attack Image Classification

Patch-wise Attack for Fooling Deep Neural Network

Code

Abstract

Tasks

Reproductions