SOTAVerified

Adversarial Attack

An Adversarial Attack is a technique to find a perturbation that changes the prediction of a machine learning model. The perturbation can be very small and imperceptible to human eyes.

Source: Recurrent Attention Model with Log-Polar Mapping is Robust against Adversarial Attacks

Papers

Showing 341350 of 1808 papers

TitleStatusHype
Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of ArtifactsCode0
Towards Building a Robust Toxicity Predictor0
BruSLeAttack: A Query-Efficient Score-Based Black-Box Sparse Adversarial Attack0
Adversarial Attacks and Dimensionality in Text Classifiers0
Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models0
READ: Improving Relation Extraction from an ADversarial PerspectiveCode0
Multi-granular Adversarial Attacks against Black-box Neural Ranking Models0
Patch Synthesis for Property Repair of Deep Neural Networks0
Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial AttackCode2
The Double-Edged Sword of Input Perturbations to Robust Accurate Fairness0
Show:102550
← PrevPage 35 of 181Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Xu et al.Attack: PGD2078.68Unverified
23-ensemble of multi-resolution self-ensemblesAttack: AutoAttack78.13Unverified
3TRADES-ANCRA/ResNet18Attack: AutoAttack59.7Unverified
4AdvTraining [madry2018]Attack: PGD2048.44Unverified
5TRADES [zhang2019b]Attack: PGD2045.9Unverified
6XU-NetRobust Accuracy1Unverified
#ModelMetricClaimedVerifiedStatus
13-ensemble of multi-resolution self-ensemblesAttack: AutoAttack51.28Unverified
2multi-resolution self-ensemblesAttack: AutoAttack47.85Unverified