SOTAVerified|Agents Browse Leaderboard About

Backdoor Attack

Backdoor attacks inject maliciously constructed data into a training set so that, at test time, the trained model misclassifies inputs patched with a backdoor trigger as an adversarially-desired target class.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 523 papers

Title	Date	Tasks	Status	Hype
Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks	Feb 28, 2024	Backdoor Attackopen-set classification	CodeCode Available	0
Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm	Feb 23, 2024	Backdoor Attack	—Unverified	0
Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment	Feb 22, 2024	Backdoor AttackLanguage Modelling	CodeCode Available	1
Whispers in Grammars: Injecting Covert Backdoors to Compromise Dense Retrieval Systems	Feb 21, 2024	Backdoor AttackMisinformation	CodeCode Available	0
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models	Feb 21, 2024	Backdoor AttackFew-Shot Learning	—Unverified	0
Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning	Feb 19, 2024	Backdoor Attackparameter-efficient fine-tuning	—Unverified	0
Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection	Feb 18, 2024	Backdoor Attack	CodeCode Available	1
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Feb 17, 2024	Backdoor Attackbackdoor defense	CodeCode Available	2
Backdoor Attack against One-Class Sequential Anomaly Detection Models	Feb 15, 2024	Anomaly DetectionBackdoor Attack	CodeCode Available	0
Test-Time Backdoor Attacks on Multimodal Large Language Models	Feb 13, 2024	Backdoor Attack	CodeCode Available	2
OrderBkd: Textual backdoor attack through repositioning	Feb 12, 2024	Backdoor AttackPOS	CodeCode Available	0
The last Dance : Robust backdoor attack via diffusion models and bayesian approach	Feb 5, 2024	Backdoor AttackDenoising	—Unverified	0
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models	Feb 5, 2024	Backdoor Attack	—Unverified	0
Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability	Jan 29, 2024	Backdoor Attack	CodeCode Available	1
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning	Jan 26, 2024	Backdoor Attack	—Unverified	0
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models	Jan 20, 2024	Backdoor Attack	CodeCode Available	2
Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning	Jan 11, 2024	Backdoor AttackIn-Context Learning	—Unverified	0
Inferring Properties of Graph Neural Networks	Jan 8, 2024	Backdoor Attack	—Unverified	0
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline	Jan 7, 2024	Backdoor AttackData Poisoning	—Unverified	0
TEN-GUARD: Tensor Decomposition for Backdoor Attack Detection in Deep Neural Networks	Jan 6, 2024	Backdoor AttackTensor Decomposition	—Unverified	0
Object-oriented backdoor attack against image captioning	Jan 5, 2024	Backdoor AttackImage Captioning	—Unverified	0
Effective backdoor attack on graph neural networks in link prediction tasks	Jan 5, 2024	Backdoor AttackGraph Classification	—Unverified	0
Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack	Jan 4, 2024	Backdoor Attackbackdoor defense	CodeCode Available	0
The Art of Deception: Robust Backdoor Attack using Dynamic Stacking of Triggers	Jan 3, 2024	Backdoor Attackspeech-recognition	—Unverified	0
Imperio: Language-Guided Backdoor Attacks for Arbitrary Model Control	Jan 2, 2024	Backdoor AttackImage Classification	—Unverified	0

Show:10 25 50

← PrevPage 9 of 21Next →

No leaderboard results yet.