SOTAVerified

Adversarial Attack

An Adversarial Attack is a technique to find a perturbation that changes the prediction of a machine learning model. The perturbation can be very small and imperceptible to human eyes.

Source: Recurrent Attention Model with Log-Polar Mapping is Robust against Adversarial Attacks

Papers

Showing 10211030 of 1808 papers

TitleStatusHype
Seeing is Deceiving: Exploitation of Visual Pathways in Multi-Modal Language Models0
Seeing the Threat: Vulnerabilities in Vision-Language Models to Adversarial Attack0
Seeking Flat Minima over Diverse Surrogates for Improved Adversarial Transferability: A Theoretical Framework and Algorithmic Instantiation0
SAM Meets UAP: Attacking Segment Anything Model With Universal Adversarial Perturbation0
Self adversarial attack as an augmentation method for immunohistochemical stainings0
SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner0
SELF-KNOWLEDGE DISTILLATION ADVERSARIAL ATTACK0
Self-Supervised Adversarial Example Detection by Disentangled Representation0
Self-Supervised Contrastive Learning with Adversarial Perturbations for Robust Pretrained Language Models0
Self-Supervised Representation Learning for Adversarial Attack Detection0
Show:102550
← PrevPage 103 of 181Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Xu et al.Attack: PGD2078.68Unverified
23-ensemble of multi-resolution self-ensemblesAttack: AutoAttack78.13Unverified
3TRADES-ANCRA/ResNet18Attack: AutoAttack59.7Unverified
4AdvTraining [madry2018]Attack: PGD2048.44Unverified
5TRADES [zhang2019b]Attack: PGD2045.9Unverified
6XU-NetRobust Accuracy1Unverified
#ModelMetricClaimedVerifiedStatus
13-ensemble of multi-resolution self-ensemblesAttack: AutoAttack51.28Unverified
2multi-resolution self-ensemblesAttack: AutoAttack47.85Unverified