SOTAVerified

Adversarial Attack

An Adversarial Attack is a technique to find a perturbation that changes the prediction of a machine learning model. The perturbation can be very small and imperceptible to human eyes.

Source: Recurrent Attention Model with Log-Polar Mapping is Robust against Adversarial Attacks

Papers

Showing 10011025 of 1808 papers

TitleStatusHype
Perturbations in the Wild: Leveraging Human-Written Text Perturbations for Realistic Adversarial Attack and Defense0
Self-Supervised Contrastive Learning with Adversarial Perturbations for Robust Pretrained Language Models0
BufferSearch: Generating Black-Box Adversarial Texts With Lower Queries0
Robustness of Bayesian Neural Networks to White-Box Adversarial Attacks0
Improving the robustness and accuracy of biomedical language models through adversarial trainingCode0
Towards Interpretability of Speech Pause in Dementia Detection using Adversarial Learning0
Sparse Adversarial Video Attacks with Spatial TransformationsCode1
Defense Against Explanation Manipulation0
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language ModelsCode1
Adversarial Attack against Cross-lingual Knowledge Graph Alignment0
An Actor-Critic Method for Simulation-Based Optimization0
AdvCodeMix: Adversarial Attack on Code-Mixed Data0
Attacking Video Recognition Models with Bullet-Screen CommentsCode1
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack FrameworkCode1
Disrupting Deep Uncertainty Estimation Without Harming AccuracyCode0
Covariate Balancing Methods for Randomized Controlled Trials Are Not Adversarially Robust0
Generating Watermarked Adversarial Texts0
Improving Robustness of Malware Classifiers using Adversarial Strings Generated from Perturbed Latent Representations0
Socialbots on Fire: Modeling Adversarial Behaviors of Socialbots via Multi-Agent Hierarchical Reinforcement Learning0
Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information0
Boosting the Transferability of Video Adversarial Examples via Temporal TranslationCode1
Unrestricted Adversarial Attacks on ImageNet CompetitionCode1
Black-box Adversarial Attacks on Network-wide Multi-step Traffic State Prediction ModelsCode0
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Meme Stock Prediction0
Adversarial Attacks on Gaussian Process BanditsCode0
Show:102550
← PrevPage 41 of 73Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Xu et al.Attack: PGD2078.68Unverified
23-ensemble of multi-resolution self-ensemblesAttack: AutoAttack78.13Unverified
3TRADES-ANCRA/ResNet18Attack: AutoAttack59.7Unverified
4AdvTraining [madry2018]Attack: PGD2048.44Unverified
5TRADES [zhang2019b]Attack: PGD2045.9Unverified
6XU-NetRobust Accuracy1Unverified
#ModelMetricClaimedVerifiedStatus
13-ensemble of multi-resolution self-ensemblesAttack: AutoAttack51.28Unverified
2multi-resolution self-ensemblesAttack: AutoAttack47.85Unverified