Enhanced countering adversarial attacks via input denoising and feature restoring

2021-11-19Code Available0· sign in to hype

Yanni Li, Wenhui Zhang, Jiawei Liu, Xiaoli Kou, Hui Li, Jiangtao Cui

Code Available — Be the first to reproduce this paper.

Code

github.com/id-fr/idfr
OfficialIn papertf★ 2

Abstract

Despite the fact that deep neural networks (DNNs) have achieved prominent performance in various applications, it is well known that DNNs are vulnerable to adversarial examples/samples (AEs) with imperceptible perturbations in clean/original samples. To overcome the weakness of the existing defense methods against adversarial attacks, which damages the information on the original samples, leading to the decrease of the target classifier accuracy, this paper presents an enhanced countering adversarial attack method IDFR (via Input Denoising and Feature Restoring). The proposed IDFR is made up of an enhanced input denoiser (ID) and a hidden lossy feature restorer (FR) based on the convex hull optimization. Extensive experiments conducted on benchmark datasets show that the proposed IDFR outperforms the various state-of-the-art defense methods, and is highly effective for protecting target models against various adversarial black-box or white-box attacks. Souce code is released at: https://github.com/ID-FR/IDFR

Tasks

Adversarial Attack Denoising

Enhanced countering adversarial attacks via input denoising and feature restoring

Code

Abstract

Tasks

Reproductions