Backdoor Attack

Backdoor attacks inject maliciously constructed data into a training set so that, at test time, the trained model misclassifies inputs patched with a backdoor trigger as an adversarially-desired target class.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 523 papers

Title	Date	Tasks	Status	Hype
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents	Oct 3, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases	Jul 17, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning	Aug 14, 2024	Backdoor AttackPrompt Learning	CodeCode Available	2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection	Jun 10, 2024	Backdoor AttackCode Completion	CodeCode Available	2
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Feb 17, 2024	Backdoor Attackbackdoor defense	CodeCode Available	2
Test-Time Backdoor Attacks on Multimodal Large Language Models	Feb 13, 2024	Backdoor Attack	CodeCode Available	2
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models	Jan 20, 2024	Backdoor Attack	CodeCode Available	2
Backdoor Learning: A Survey	Jul 17, 2020	Adversarial AttackBackdoor Attack	CodeCode Available	2
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models	Feb 16, 2025	Adversarial AttackBackdoor Attack	CodeCode Available	1
Invisible Backdoor Attack against Self-supervised Learning	Jan 1, 2025	Backdoor AttackSelf-Supervised Learning	CodeCode Available	1
CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers	Dec 26, 2024	Backdoor AttackSentence	CodeCode Available	1
BadCM: Invisible Backdoor Attack Against Cross-Modal Learning	Oct 3, 2024	Backdoor AttackCross-Modal Retrieval	CodeCode Available	1
BadMerging: Backdoor Attacks Against Model Merging	Aug 14, 2024	Backdoor Attackmodel	CodeCode Available	1
Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models	Jul 15, 2024	Backdoor AttackMultiple-choice	CodeCode Available	1
T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models	Jul 5, 2024	Backdoor Attack	CodeCode Available	1
Invisible Backdoor Attacks on Diffusion Models	Jun 2, 2024	Backdoor AttackHuman Detection	CodeCode Available	1
Fast-FedUL: A Training-Free Federated Unlearning with Provable Skew Resilience	May 28, 2024	Backdoor AttackData Poisoning	CodeCode Available	1
Towards Imperceptible Backdoor Attack in Self-supervised Learning	May 23, 2024	Backdoor AttackSelf-Supervised Learning	CodeCode Available	1
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers	May 17, 2024	AllBackdoor Attack	CodeCode Available	1
Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective	May 17, 2024	Backdoor AttackMemorization	CodeCode Available	1
Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning	Apr 26, 2024	Backdoor AttackFederated Learning	CodeCode Available	1
Exploring Backdoor Vulnerabilities of Chat Models	Apr 3, 2024	Backdoor Attack	CodeCode Available	1
LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning	Mar 25, 2024	Backdoor Attack	CodeCode Available	1
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion	Mar 25, 2024	Backdoor Attack	CodeCode Available	1
Mask-based Invisible Backdoor Attacks on Object Detection	Mar 20, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 21Next →

No leaderboard results yet.