SOTAVerified

Adversarial Attack

An Adversarial Attack is a technique to find a perturbation that changes the prediction of a machine learning model. The perturbation can be very small and imperceptible to human eyes.

Source: Recurrent Attention Model with Log-Polar Mapping is Robust against Adversarial Attacks

Papers

Showing 2130 of 1808 papers

TitleStatusHype
BAE: BERT-based Adversarial Examples for Text ClassificationCode2
Adversarial Attacks and Defenses on Graphs: A Review, A Tool and Empirical StudiesCode2
A Little Fog for a Large TurnCode2
Adversarial Attacks and Defenses in Images, Graphs and Text: A ReviewCode2
Foolbox: A Python toolbox to benchmark the robustness of machine learning modelsCode2
ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion ModelsCode1
Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot NavigationCode1
Learning Safety Constraints for Large Language ModelsCode1
3D Gaussian Splat VulnerabilitiesCode1
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM AgentsCode1
Show:102550
← PrevPage 3 of 181Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Xu et al.Attack: PGD2078.68Unverified
23-ensemble of multi-resolution self-ensemblesAttack: AutoAttack78.13Unverified
3TRADES-ANCRA/ResNet18Attack: AutoAttack59.7Unverified
4AdvTraining [madry2018]Attack: PGD2048.44Unverified
5TRADES [zhang2019b]Attack: PGD2045.9Unverified
6XU-NetRobust Accuracy1Unverified
#ModelMetricClaimedVerifiedStatus
13-ensemble of multi-resolution self-ensemblesAttack: AutoAttack51.28Unverified
2multi-resolution self-ensemblesAttack: AutoAttack47.85Unverified