SOTAVerified|Agents Browse Leaderboard About

Backdoor Attack

Backdoor attacks inject maliciously constructed data into a training set so that, at test time, the trained model misclassifies inputs patched with a backdoor trigger as an adversarially-desired target class.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 523 papers

Title	Date	Tasks	Status	Hype
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents	Oct 3, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases	Jul 17, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning	Aug 14, 2024	Backdoor AttackPrompt Learning	CodeCode Available	2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection	Jun 10, 2024	Backdoor AttackCode Completion	CodeCode Available	2
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Feb 17, 2024	Backdoor Attackbackdoor defense	CodeCode Available	2
Test-Time Backdoor Attacks on Multimodal Large Language Models	Feb 13, 2024	Backdoor Attack	CodeCode Available	2
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models	Jan 20, 2024	Backdoor Attack	CodeCode Available	2
Backdoor Learning: A Survey	Jul 17, 2020	Adversarial AttackBackdoor Attack	CodeCode Available	2
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models	Feb 16, 2025	Adversarial AttackBackdoor Attack	CodeCode Available	1
Invisible Backdoor Attack against Self-supervised Learning	Jan 1, 2025	Backdoor AttackSelf-Supervised Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 53Next →

No leaderboard results yet.