| Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking | Dec 13, 2023 | backdoor defenseSelf-Supervised Learning | CodeCode Available | 0 | 5 |
| Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models | Oct 25, 2024 | backdoor defenseModel Editing | CodeCode Available | 0 | 5 |
| Cert-SSB: Toward Certified Sample-Specific Backdoor Defense | Apr 30, 2025 | backdoor defense | CodeCode Available | 0 | 5 |
| Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples | Jul 20, 2023 | backdoor defense | CodeCode Available | 0 | 5 |
| Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack | Jan 4, 2024 | Backdoor Attackbackdoor defense | CodeCode Available | 0 | 5 |
| FL-PLAS: Federated Learning with Partial Layer Aggregation for Backdoor Defense Against High-Ratio Malicious Clients | May 17, 2025 | backdoor defenseFederated Learning | CodeCode Available | 0 | 5 |
| From Shortcuts to Triggers: Backdoor Defense with Denoised PoE | May 24, 2023 | backdoor defenseData Poisoning | CodeCode Available | 0 | 5 |
| TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors | Sep 9, 2024 | backdoor defenseImage Generation | CodeCode Available | 0 | 5 |
| From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models | Nov 4, 2023 | Backdoor Attackbackdoor defense | CodeCode Available | 0 | 5 |
| TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models | Aug 7, 2023 | backdoor defenseobject-detection | CodeCode Available | 0 | 5 |
| Towards Backdoor Stealthiness in Model Parameter Space | Jan 10, 2025 | backdoor defensemodel | CodeCode Available | 0 | 5 |
| CLIP-Guided Backdoor Defense through Entropy-Based Poisoned Dataset Separation | Jul 7, 2025 | backdoor defense | CodeCode Available | 0 | 5 |
| SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs | Jun 5, 2025 | backdoor defenseImage Captioning | —Unverified | 0 | 0 |
| CleanerCLIP: Fine-grained Counterfactual Semantic Augmentation for Backdoor Defense in Contrastive Learning | Sep 26, 2024 | backdoor defenseContrastive Learning | —Unverified | 0 | 0 |
| Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations | Nov 16, 2023 | backdoor defense | —Unverified | 0 | 0 |
| Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis | Sep 24, 2024 | backdoor defenseObject | —Unverified | 0 | 0 |
| Towards Understanding How Self-training Tolerates Data Backdoor Poisoning | Jan 20, 2023 | backdoor defenseRepresentation Learning | —Unverified | 0 | 0 |
| Unlearning Backdoor Threats: Enhancing Backdoor Defense in Multimodal Contrastive Learning via Local Token Unlearning | Mar 24, 2024 | backdoor defenseContrastive Learning | —Unverified | 0 | 0 |
| Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness | May 30, 2024 | backdoor defense | —Unverified | 0 | 0 |
| WeDef: Weakly Supervised Backdoor Defense for Text Classification | May 24, 2022 | backdoor defenseClassification | —Unverified | 0 | 0 |
| Data-centric NLP Backdoor Defense from the Lens of Memorization | Sep 21, 2024 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks | Jun 12, 2025 | backdoor defenseData Poisoning | —Unverified | 0 | 0 |
| A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models | Feb 26, 2025 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| Adversarial Backdoor Defense in CLIP | Sep 24, 2024 | backdoor defenseData Augmentation | —Unverified | 0 | 0 |
| Progressive Backdoor Erasing via connecting Backdoor and Adversarial Attacks | Feb 13, 2022 | backdoor defense | —Unverified | 0 | 0 |
| Backdoor Attack and Defense for Deep Regression | Sep 6, 2021 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| Backdoor Defense in Diffusion Models via Spatial Attention Unlearning | Apr 21, 2025 | backdoor defense | —Unverified | 0 | 0 |
| Backdoor Defense in Federated Learning Using Differential Testing and Outlier Detection | Feb 21, 2022 | backdoor defenseFederated Learning | —Unverified | 0 | 0 |
| Backdoor defense, learnability and obfuscation | Sep 4, 2024 | backdoor defense | —Unverified | 0 | 0 |
| Backdoor Defense through Self-Supervised and Generative Learning | Sep 2, 2024 | backdoor defense | —Unverified | 0 | 0 |
| Backdoor Defense via Test-Time Detecting and Repairing | Jan 1, 2024 | Autonomous Drivingbackdoor defense | —Unverified | 0 | 0 |
| Backdoor Defense with Machine Unlearning | Jan 24, 2022 | backdoor defenseMachine Unlearning | —Unverified | 0 | 0 |
| Backdoors Stuck At The Frontdoor: Multi-Agent Backdoor Attacks That Backfire | Jan 28, 2022 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| BayBFed: Bayesian Backdoor Defense for Federated Learning | Jan 23, 2023 | backdoor defenseFederated Learning | —Unverified | 0 | 0 |
| BeniFul: Backdoor Defense via Middle Feature Analysis for Deep Neural Networks | Oct 15, 2024 | backdoor defense | —Unverified | 0 | 0 |
| Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack | May 25, 2024 | Adversarial Attackbackdoor defense | —Unverified | 0 | 0 |
| Class-Conditional Neural Polarizer: A Lightweight and Effective Backdoor Defense by Purifying Poisoned Features | Feb 23, 2025 | Adversarial Defensebackdoor defense | —Unverified | 0 | 0 |
| Confidence Matters: Inspecting Backdoors in Deep Neural Networks via Distribution Transfer | Aug 13, 2022 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| CopyrightShield: Spatial Similarity Guided Backdoor Defense against Copyright Infringement in Diffusion Models | Dec 2, 2024 | backdoor defenseImage Generation | —Unverified | 0 | 0 |
| CUBA: Controlled Untargeted Backdoor Attack against Deep Neural Networks | Jun 20, 2025 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks | Mar 31, 2025 | backdoor defenseFace Recognition | —Unverified | 0 | 0 |
| Defending Multimodal Backdoored Models by Repulsive Visual Prompt Tuning | Dec 29, 2024 | backdoor defenseContrastive Learning | —Unverified | 0 | 0 |
| Defense against Backdoor Attacks via Identifying and Purifying Bad Neurons | Aug 13, 2022 | backdoor defense | —Unverified | 0 | 0 |
| Defense Against Syntactic Textual Backdoor Attacks with Token Substitution | Jul 4, 2024 | backdoor defenseSentence | —Unverified | 0 | 0 |
| Efficient Backdoor Defense in Multimodal Contrastive Learning: A Token-Level Unlearning Method for Mitigating Threats | Sep 29, 2024 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| Eliminating Backdoors in Neural Code Models for Secure Code Understanding | Aug 8, 2024 | Autonomous Drivingbackdoor defense | —Unverified | 0 | 0 |
| Embedding Watermarks in Diffusion Process for Model Intellectual Property Protection | Oct 29, 2024 | backdoor defense | —Unverified | 0 | 0 |
| Enhancing Clean Label Backdoor Attack with Two-phase Specific Triggers | Jun 10, 2022 | Backdoor Attackbackdoor defense | —Unverified | 0 | 0 |
| Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization | Apr 24, 2023 | backdoor defense | —Unverified | 0 | 0 |
| Evolutionary Trigger Detection and Lightweight Model Repair Based Backdoor Defense | Jul 7, 2024 | Autonomous DrivingBackdoor Attack | —Unverified | 0 | 0 |