SOTAVerified

backdoor defense

Papers

Showing 2650 of 131 papers

TitleStatusHype
CROW: Eliminating Backdoors from Large Language Models via Internal Consistency RegularizationCode1
ONION: A Simple and Effective Defense Against Textual Backdoor AttacksCode1
VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and PurificationCode1
Black-box Backdoor Defense via Zero-shot Image PurificationCode1
DFB: A Data-Free, Low-Budget, and High-Efficacy Clean-Label Backdoor AttackCode0
Progressive Poisoned Data Isolation for Training-time Backdoor DefenseCode0
OCGEC: One-class Graph Embedding Classification for DNN Backdoor DetectionCode0
"No Matter What You Do": Purifying GNN Models via Backdoor UnlearningCode0
BadActs: A Universal Backdoor Defense in the Activation SpaceCode0
Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language ModelsCode0
MSDT: Masked Language Model Scoring Defense in Text DomainCode0
Mitigating Backdoor Attack by Injecting Proactive Defensive BackdoorCode0
Mask and Restore: Blind Backdoor Defense at Test Time with Masked AutoencoderCode0
Model-Contrastive Learning for Backdoor DefenseCode0
Neural Polarizer: A Lightweight and Effective Backdoor Defense via Purifying Poisoned FeaturesCode0
Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial ExamplesCode0
Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion ModelsCode0
Beating Backdoor Attack at Its Own GameCode0
From Shortcuts to Triggers: Backdoor Defense with Denoised PoECode0
Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor AttacksCode0
Efficient Backdoor Removal Through Natural Gradient Fine-tuningCode0
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction ConsistencyCode0
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion ModelsCode0
Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning ParadigmCode0
Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion ModelsCode0
Show:102550
← PrevPage 2 of 6Next →

No leaderboard results yet.