SOTAVerified

Adversarial Attack

An Adversarial Attack is a technique to find a perturbation that changes the prediction of a machine learning model. The perturbation can be very small and imperceptible to human eyes.

Source: Recurrent Attention Model with Log-Polar Mapping is Robust against Adversarial Attacks

Papers

Showing 651660 of 1808 papers

TitleStatusHype
Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization0
DLOVE: A new Security Evaluation Tool for Deep Learning Based Watermarking Techniques0
DMS: Addressing Information Loss with More Steps for Pragmatic Adversarial Attacks0
DO-AutoEncoder: Learning and Intervening Bivariate Causal Mechanisms in Images0
Democratic Training Against Universal Adversarial Perturbations0
Does Safety Training of LLMs Generalize to Semantically Related Natural Prompts?0
Evaluating Neural Model Robustness for Machine Comprehension0
DoPa: A Comprehensive CNN Detection Methodology against Physical Adversarial Attacks0
Doppelganger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack0
Analyzing the Noise Robustness of Deep Neural Networks0
Show:102550
← PrevPage 66 of 181Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ResNet20Test Accuracy89.9589.95(1)Community Verified
2Xu et al.Attack: PGD2078.68Unverified
33-ensemble of multi-resolution self-ensemblesAttack: AutoAttack78.13Unverified
4TRADES-ANCRA/ResNet18Attack: AutoAttack59.7Unverified
5AdvTraining [madry2018]Attack: PGD2048.44Unverified
6TRADES [zhang2019b]Attack: PGD2045.9Unverified
7XU-NetRobust Accuracy1Unverified
#ModelMetricClaimedVerifiedStatus
13-ensemble of multi-resolution self-ensemblesAttack: AutoAttack51.28Unverified
2multi-resolution self-ensemblesAttack: AutoAttack47.85Unverified