Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents Oct 3, 2024 Autonomous Driving Backdoor Attack
Code Code Available 3AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Jul 17, 2024 Autonomous Driving Backdoor Attack
Code Code Available 3BAPLe: Backdoor Attacks on Medical Foundational Models using Prompt Learning Aug 14, 2024 Backdoor Attack Prompt Learning
Code Code Available 2An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection Jun 10, 2024 Backdoor Attack Code Completion
Code Code Available 2Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents Feb 17, 2024 Backdoor Attack backdoor defense
Code Code Available 2Test-Time Backdoor Attacks on Multimodal Large Language Models Feb 13, 2024 Backdoor Attack
Code Code Available 2BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models Jan 20, 2024 Backdoor Attack
Code Code Available 2Backdoor Learning: A Survey Jul 17, 2020 Adversarial Attack Backdoor Attack
Code Code Available 2To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models Feb 16, 2025 Adversarial Attack Backdoor Attack
Code Code Available 1Invisible Backdoor Attack against Self-supervised Learning Jan 1, 2025 Backdoor Attack Self-Supervised Learning
Code Code Available 1CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers Dec 26, 2024 Backdoor Attack Sentence
Code Code Available 1BadCM: Invisible Backdoor Attack Against Cross-Modal Learning Oct 3, 2024 Backdoor Attack Cross-Modal Retrieval
Code Code Available 1BadMerging: Backdoor Attacks Against Model Merging Aug 14, 2024 Backdoor Attack model
Code Code Available 1Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models Jul 15, 2024 Backdoor Attack Multiple-choice
Code Code Available 1T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models Jul 5, 2024 Backdoor Attack
Code Code Available 1Invisible Backdoor Attacks on Diffusion Models Jun 2, 2024 Backdoor Attack Human Detection
Code Code Available 1Fast-FedUL: A Training-Free Federated Unlearning with Provable Skew Resilience May 28, 2024 Backdoor Attack Data Poisoning
Code Code Available 1Towards Imperceptible Backdoor Attack in Self-supervised Learning May 23, 2024 Backdoor Attack Self-Supervised Learning
Code Code Available 1Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective May 17, 2024 Backdoor Attack Memorization
Code Code Available 1Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers May 17, 2024 All Backdoor Attack
Code Code Available 1Beyond Traditional Threats: A Persistent Backdoor Attack on Federated Learning Apr 26, 2024 Backdoor Attack Federated Learning
Code Code Available 1Exploring Backdoor Vulnerabilities of Chat Models Apr 3, 2024 Backdoor Attack
Code Code Available 1LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning Mar 25, 2024 Backdoor Attack
Code Code Available 1Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion Mar 25, 2024 Backdoor Attack
Code Code Available 1Mask-based Invisible Backdoor Attacks on Object Detection Mar 20, 2024 Autonomous Driving Backdoor Attack
Code Code Available 1BadEdit: Backdooring large language models by model editing Mar 20, 2024 Backdoor Attack knowledge editing
Code Code Available 1Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment Feb 22, 2024 Backdoor Attack Language Modelling
Code Code Available 1Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection Feb 18, 2024 Backdoor Attack
Code Code Available 1Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability Jan 29, 2024 Backdoor Attack
Code Code Available 1Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers Jan 1, 2024 All Backdoor Attack
Code Code Available 1FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge Dec 15, 2023 Backdoor Attack Data Poisoning
Code Code Available 1Universal Jailbreak Backdoors from Poisoned Human Feedback Nov 24, 2023 Backdoor Attack
Code Code Available 1BadCLIP: Dual-Embedding Guided Backdoor Attack on Multimodal Contrastive Learning Nov 20, 2023 Backdoor Attack Contrastive Learning
Code Code Available 1Label Poisoning is All You Need Oct 29, 2023 All Backdoor Attack
Code Code Available 1PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models Oct 19, 2023 Backdoor Attack
Code Code Available 1Composite Backdoor Attacks Against Large Language Models Oct 11, 2023 Backdoor Attack
Code Code Available 1VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models Sep 28, 2023 Backdoor Attack cross-modal alignment
Code Code Available 1PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification Aug 22, 2023 Adversarial Attack Backdoor Attack
Code Code Available 1BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models Jul 31, 2023 Backdoor Attack Image Generation
Code Code Available 1Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection Jul 31, 2023 Backdoor Attack
Code Code Available 1You Can Backdoor Personalized Federated Learning Jul 29, 2023 Backdoor Attack Federated Learning
Code Code Available 1Risk-optimized Outlier Removal for Robust 3D Point Cloud Classification Jul 20, 2023 3D Point Cloud Classification Autonomous Vehicles
Code Code Available 1Towards Stealthy Backdoor Attacks against Speech Recognition via Elements of Sound Jul 17, 2023 Backdoor Attack speech-recognition
Code Code Available 1FedDefender: Backdoor Attack Defense in Federated Learning Jul 2, 2023 Backdoor Attack Data Poisoning
Code Code Available 1Bkd-FedGNN: A Benchmark for Classification Backdoor Attacks on Federated Graph Neural Network Jun 17, 2023 Backdoor Attack Federated Learning
Code Code Available 1VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models Jun 12, 2023 Backdoor Attack Denoising
Code Code Available 1Backdoor Attack with Sparse and Invisible Trigger May 11, 2023 Backdoor Attack
Code Code Available 1Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning May 7, 2023 Backdoor Attack backdoor defense
Code Code Available 1UNICORN: A Unified Backdoor Trigger Inversion Framework Apr 5, 2023 Backdoor Attack
Code Code Available 1Influencer Backdoor Attack on Semantic Segmentation Mar 21, 2023 Backdoor Attack Position
Code Code Available 1