Backdoor Attack

Backdoor attacks inject maliciously constructed data into a training set so that, at test time, the trained model misclassifies inputs patched with a backdoor trigger as an adversarially-desired target class.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 523 papers

Title	Date	Tasks	Status	Hype
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents	Oct 3, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases	Jul 17, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning	Aug 14, 2024	Backdoor AttackPrompt Learning	CodeCode Available	2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection	Jun 10, 2024	Backdoor AttackCode Completion	CodeCode Available	2
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Feb 17, 2024	Backdoor Attackbackdoor defense	CodeCode Available	2
Test-Time Backdoor Attacks on Multimodal Large Language Models	Feb 13, 2024	Backdoor Attack	CodeCode Available	2
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models	Jan 20, 2024	Backdoor Attack	CodeCode Available	2
Backdoor Learning: A Survey	Jul 17, 2020	Adversarial AttackBackdoor Attack	CodeCode Available	2
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models	Feb 16, 2025	Adversarial AttackBackdoor Attack	CodeCode Available	1
Invisible Backdoor Attack against Self-supervised Learning	Jan 1, 2025	Backdoor AttackSelf-Supervised Learning	CodeCode Available	1
CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers	Dec 26, 2024	Backdoor AttackSentence	CodeCode Available	1
BadCM: Invisible Backdoor Attack Against Cross-Modal Learning	Oct 3, 2024	Backdoor AttackCross-Modal Retrieval	CodeCode Available	1
BadMerging: Backdoor Attacks Against Model Merging	Aug 14, 2024	Backdoor Attackmodel	CodeCode Available	1
Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models	Jul 15, 2024	Backdoor AttackMultiple-choice	CodeCode Available	1
T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models	Jul 5, 2024	Backdoor Attack	CodeCode Available	1
Invisible Backdoor Attacks on Diffusion Models	Jun 2, 2024	Backdoor AttackHuman Detection	CodeCode Available	1
Fast-FedUL: A Training-Free Federated Unlearning with Provable Skew Resilience	May 28, 2024	Backdoor AttackData Poisoning	CodeCode Available	1
Towards Imperceptible Backdoor Attack in Self-supervised Learning	May 23, 2024	Backdoor AttackSelf-Supervised Learning	CodeCode Available	1
Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective	May 17, 2024	Backdoor AttackMemorization	CodeCode Available	1
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers	May 17, 2024	AllBackdoor Attack	CodeCode Available	1
Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning	Apr 26, 2024	Backdoor AttackFederated Learning	CodeCode Available	1
Exploring Backdoor Vulnerabilities of Chat Models	Apr 3, 2024	Backdoor Attack	CodeCode Available	1
LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning	Mar 25, 2024	Backdoor Attack	CodeCode Available	1
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion	Mar 25, 2024	Backdoor Attack	CodeCode Available	1
Mask-based Invisible Backdoor Attacks on Object Detection	Mar 20, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	1
BadEdit: Backdooring large language models by model editing	Mar 20, 2024	Backdoor Attackknowledge editing	CodeCode Available	1
Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment	Feb 22, 2024	Backdoor AttackLanguage Modelling	CodeCode Available	1
Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection	Feb 18, 2024	Backdoor Attack	CodeCode Available	1
Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability	Jan 29, 2024	Backdoor Attack	CodeCode Available	1
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers	Jan 1, 2024	AllBackdoor Attack	CodeCode Available	1
FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge	Dec 15, 2023	Backdoor AttackData Poisoning	CodeCode Available	1
Universal Jailbreak Backdoors from Poisoned Human Feedback	Nov 24, 2023	Backdoor Attack	CodeCode Available	1
BadCLIP: Dual-Embedding Guided Backdoor Attack on Multimodal Contrastive Learning	Nov 20, 2023	Backdoor AttackContrastive Learning	CodeCode Available	1
Label Poisoning is All You Need	Oct 29, 2023	AllBackdoor Attack	CodeCode Available	1
PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models	Oct 19, 2023	Backdoor Attack	CodeCode Available	1
Composite Backdoor Attacks Against Large Language Models	Oct 11, 2023	Backdoor Attack	CodeCode Available	1
VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models	Sep 28, 2023	Backdoor Attackcross-modal alignment	CodeCode Available	1
PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification	Aug 22, 2023	Adversarial AttackBackdoor Attack	CodeCode Available	1
BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models	Jul 31, 2023	Backdoor AttackImage Generation	CodeCode Available	1
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection	Jul 31, 2023	Backdoor Attack	CodeCode Available	1
You Can Backdoor Personalized Federated Learning	Jul 29, 2023	Backdoor AttackFederated Learning	CodeCode Available	1
Risk-optimized Outlier Removal for Robust 3D Point Cloud Classification	Jul 20, 2023	3D Point Cloud ClassificationAutonomous Vehicles	CodeCode Available	1
Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound	Jul 17, 2023	Backdoor Attackspeech-recognition	CodeCode Available	1
FedDefender: Backdoor Attack Defense in Federated Learning	Jul 2, 2023	Backdoor AttackData Poisoning	CodeCode Available	1
Bkd-FedGNN: A Benchmark for Classification Backdoor Attacks on Federated Graph Neural Network	Jun 17, 2023	Backdoor AttackFederated Learning	CodeCode Available	1
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models	Jun 12, 2023	Backdoor AttackDenoising	CodeCode Available	1
Backdoor Attack with Sparse and Invisible Trigger	May 11, 2023	Backdoor Attack	CodeCode Available	1
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning	May 7, 2023	Backdoor Attackbackdoor defense	CodeCode Available	1
UNICORN: A Unified Backdoor Trigger Inversion Framework	Apr 5, 2023	Backdoor Attack	CodeCode Available	1
Influencer Backdoor Attack on Semantic Segmentation	Mar 21, 2023	Backdoor AttackPosition	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 11Next →

No leaderboard results yet.