Backdoor Attack

Backdoor attacks inject maliciously constructed data into a training set so that, at test time, the trained model misclassifies inputs patched with a backdoor trigger as an adversarially-desired target class.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 523 papers

Title	Date	Tasks	Status	Hype
Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks	Feb 28, 2024	Backdoor Attackopen-set classification	CodeCode Available	0
Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm	Feb 23, 2024	Backdoor Attack	—Unverified	0
Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment	Feb 22, 2024	Backdoor AttackLanguage Modelling	CodeCode Available	1
Whispers in Grammars: Injecting Covert Backdoors to Compromise Dense Retrieval Systems	Feb 21, 2024	Backdoor AttackMisinformation	CodeCode Available	0
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models	Feb 21, 2024	Backdoor AttackFew-Shot Learning	—Unverified	0
Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning	Feb 19, 2024	Backdoor Attackparameter-efficient fine-tuning	—Unverified	0
Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection	Feb 18, 2024	Backdoor Attack	CodeCode Available	1
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Feb 17, 2024	Backdoor Attackbackdoor defense	CodeCode Available	2
Backdoor Attack against One-Class Sequential Anomaly Detection Models	Feb 15, 2024	Anomaly DetectionBackdoor Attack	CodeCode Available	0
Test-Time Backdoor Attacks on Multimodal Large Language Models	Feb 13, 2024	Backdoor Attack	CodeCode Available	2
OrderBkd: Textual backdoor attack through repositioning	Feb 12, 2024	Backdoor AttackPOS	CodeCode Available	0
The last Dance : Robust backdoor attack via diffusion models and bayesian approach	Feb 5, 2024	Backdoor AttackDenoising	—Unverified	0
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models	Feb 5, 2024	Backdoor Attack	—Unverified	0
Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability	Jan 29, 2024	Backdoor Attack	CodeCode Available	1
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning	Jan 26, 2024	Backdoor Attack	—Unverified	0
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models	Jan 20, 2024	Backdoor Attack	CodeCode Available	2
Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning	Jan 11, 2024	Backdoor AttackIn-Context Learning	—Unverified	0
Inferring Properties of Graph Neural Networks	Jan 8, 2024	Backdoor Attack	—Unverified	0
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline	Jan 7, 2024	Backdoor AttackData Poisoning	—Unverified	0
TEN-GUARD: Tensor Decomposition for Backdoor Attack Detection in Deep Neural Networks	Jan 6, 2024	Backdoor AttackTensor Decomposition	—Unverified	0
Object-oriented backdoor attack against image captioning	Jan 5, 2024	Backdoor AttackImage Captioning	—Unverified	0
Effective backdoor attack on graph neural networks in link prediction tasks	Jan 5, 2024	Backdoor AttackGraph Classification	—Unverified	0
Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack	Jan 4, 2024	Backdoor Attackbackdoor defense	CodeCode Available	0
The Art of Deception: Robust Backdoor Attack using Dynamic Stacking of Triggers	Jan 3, 2024	Backdoor Attackspeech-recognition	—Unverified	0
Imperio: Language-Guided Backdoor Attacks for Arbitrary Model Control	Jan 2, 2024	Backdoor AttackImage Classification	—Unverified	0
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers	Jan 1, 2024	AllBackdoor Attack	CodeCode Available	1
Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A Pilot Study on MedCLIP	Jan 1, 2024	Backdoor AttackContrastive Learning	CodeCode Available	0
Does Few-shot Learning Suffer from Backdoor Attacks?	Dec 31, 2023	Backdoor AttackFew-Shot Learning	—Unverified	0
Is It Possible to Backdoor Face Forgery Detection with Natural Triggers?	Dec 31, 2023	Backdoor Attackbackdoor defense	—Unverified	0
A clean-label graph backdoor attack method in node classification task	Dec 30, 2023	Backdoor AttackNode Classification	—Unverified	0
SSL-OTA: Unveiling Backdoor Threats in Self-Supervised Learning for Object Detection	Dec 30, 2023	Autonomous DrivingBackdoor Attack	—Unverified	0
Punctuation Matters! Stealthy Backdoor Attack for Language Models	Dec 26, 2023	Backdoor Attack	—Unverified	0
BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning	Dec 19, 2023	Backdoor Attackreinforcement-learning	CodeCode Available	0
FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge	Dec 15, 2023	Backdoor AttackData Poisoning	CodeCode Available	1
Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger	Dec 3, 2023	AttributeBackdoor Attack	—Unverified	0
TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4	Nov 29, 2023	Backdoor Attack	—Unverified	0
Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective	Nov 28, 2023	Backdoor AttackDataset Distillation	—Unverified	0
Universal Jailbreak Backdoors from Poisoned Human Feedback	Nov 24, 2023	Backdoor Attack	CodeCode Available	1
Attacks on fairness in Federated Learning	Nov 21, 2023	AttributeBackdoor Attack	CodeCode Available	0
BadCLIP: Dual-Embedding Guided Backdoor Attack on Multimodal Contrastive Learning	Nov 20, 2023	Backdoor AttackContrastive Learning	CodeCode Available	1
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models	Nov 16, 2023	Backdoor AttackData Poisoning	—Unverified	0
Tabdoor: Backdoor Vulnerabilities in Transformer-based Neural Networks for Tabular Data	Nov 13, 2023	Backdoor Attack	—Unverified	0
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models	Nov 4, 2023	Backdoor Attackbackdoor defense	CodeCode Available	0
Label Poisoning is All You Need	Oct 29, 2023	AllBackdoor Attack	CodeCode Available	1
CBD: A Certified Backdoor Detector Based on Local Dominant Probability	Oct 26, 2023	Backdoor AttackConformal Prediction	CodeCode Available	0
PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models	Oct 19, 2023	Backdoor Attack	CodeCode Available	1
WaveAttack: Asymmetric Frequency Obfuscation-based Backdoor Attacks Against Deep Neural Networks	Oct 17, 2023	Backdoor AttackSSIM	—Unverified	0
Demystifying Poisoning Backdoor Attacks from a Statistical Perspective	Oct 16, 2023	Backdoor Attack	—Unverified	0
Invisible Threats: Backdoor Attack in OCR Systems	Oct 12, 2023	Backdoor AttackOptical Character Recognition	—Unverified	0
Composite Backdoor Attacks Against Large Language Models	Oct 11, 2023	Backdoor Attack	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 11Next →

No leaderboard results yet.