Adversarial Attack

An Adversarial Attack is a technique to find a perturbation that changes the prediction of a machine learning model. The perturbation can be very small and imperceptible to human eyes.

Source: Recurrent Attention Model with Log-Polar Mapping is Robust against Adversarial Attacks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 1808 papers

Title	Date	Tasks	Status	Hype
ScoreAdv: Score-based Targeted Generation of Natural Adversarial Examples via Diffusion Models	Jul 8, 2025	Adversarial AttackDenoising	CodeCode Available	1
Adversarial Attacks and Detection in Visual Place Recognition for Safer Robot Navigation	Jun 19, 2025	Adversarial AttackRobot Navigation	CodeCode Available	1
Learning Safety Constraints for Large Language Models	May 30, 2025	Adversarial Attack	CodeCode Available	1
3D Gaussian Splat Vulnerabilities	May 30, 2025	3DGSAdversarial Attack	CodeCode Available	1
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents	May 29, 2025	Adversarial AttackLarge Language Model	CodeCode Available	1
Audio Jailbreak Attacks: Exposing Vulnerabilities in SpeechGPT in a White-Box Framework	May 24, 2025	Adversarial AttackSpeech Tokenization	CodeCode Available	1
GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models	May 16, 2025	Adversarial AttackAdversarial Defense	CodeCode Available	1
Fast and Low-Cost Genomic Foundation Models via Outlier Removal	May 1, 2025	Adversarial AttackAdversarial Robustness	CodeCode Available	1
sudo rm -rf agentic_security	Mar 26, 2025	Adversarial AttackAI and Safety	CodeCode Available	1
CyberLLMInstruct: A New Dataset for Analysing Safety of Fine-Tuned LLMs Using Cyber Security Data	Mar 12, 2025	Adversarial AttackMalware Analysis	CodeCode Available	1
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior	Feb 28, 2025	Adversarial Attack	CodeCode Available	1
Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training	Feb 18, 2025	Adversarial AttackText Detection	CodeCode Available	1
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models	Feb 16, 2025	Adversarial AttackBackdoor Attack	CodeCode Available	1
HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns	Jan 28, 2025	Adversarial AttackBenchmarking	CodeCode Available	1
Physics-Based Adversarial Attack on Near-Infrared Human Detector for Nighttime Surveillance Camera Systems	Dec 18, 2024	Adversarial Attack	CodeCode Available	1
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script	Dec 17, 2024	Adversarial AttackAdversarial Robustness	CodeCode Available	1
A2RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion	Dec 13, 2024	Adversarial AttackInfrared And Visible Image Fusion	CodeCode Available	1
Adversarial Vulnerabilities in Large Language Models for Time Series Forecasting	Dec 11, 2024	Adversarial AttackTime Series	CodeCode Available	1
Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models	Dec 11, 2024	Adversarial Attack	CodeCode Available	1
Hiding Faces in Plain Sight: Defending DeepFakes by Disrupting Face Detection	Dec 2, 2024	Adversarial AttackFace Detection	CodeCode Available	1
Semantic-Aligned Adversarial Evolution Triangle for High-Transferability Vision-Language Attack	Nov 4, 2024	Adversarial AttackDiversity	CodeCode Available	1
Transferable Adversarial Attacks on SAM and Its Downstream Models	Oct 26, 2024	Adversarial Attack	CodeCode Available	1
Malacopula: adversarial automatic speaker verification attacks using a neural-based generalised Hammerstein model	Aug 17, 2024	Adversarial AttackSpeaker Verification	CodeCode Available	1
Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness	Aug 8, 2024	Adversarial AttackAdversarial Robustness	CodeCode Available	1
Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality Metrics	Aug 2, 2024	Adversarial AttackAdversarial Purification	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 73Next →

All datasets CIFAR-10 CIFAR-100

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Xu et al.	Attack: PGD20	78.68	—	Unverified
2	3-ensemble of multi-resolution self-ensembles	Attack: AutoAttack	78.13	—	Unverified
3	TRADES-ANCRA/ResNet18	Attack: AutoAttack	59.7	—	Unverified
4	AdvTraining [madry2018]	Attack: PGD20	48.44	—	Unverified
5	TRADES [zhang2019b]	Attack: PGD20	45.9	—	Unverified
6	XU-Net	Robust Accuracy	1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	3-ensemble of multi-resolution self-ensembles	Attack: AutoAttack	51.28	—	Unverified
2	multi-resolution self-ensembles	Attack: AutoAttack	47.85	—	Unverified