SOTAVerified

backdoor defense

Papers

Showing 150 of 131 papers

TitleStatusHype
REFINE: Inversion-Free Backdoor Defense via Model ReprogrammingCode4
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based AgentsCode2
Clean-Label Backdoor Attacks on Video Recognition ModelsCode1
Backdoor Attacks for Remote Sensing Data with Wavelet TransformCode1
Backdoor Defense via Decoupling the Training ProcessCode1
Backdoor Defense via Adaptively Splitting Poisoned DatasetCode1
Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace SubnetworkCode1
Lockdown: Backdoor Defense for Federated Learning with Isolated Subspace TrainingCode1
Eliminating Backdoor Triggers for Deep Neural Networks Using Attention Relation Graph DistillationCode1
Effective Backdoor Defense by Exploiting Sensitivity of Poisoned SamplesCode1
Gracefully Filtering Backdoor Samples for Generative Large Language Models without RetrainingCode1
Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor AttacksCode1
Backdoor Defense via Deconfounded Representation LearningCode1
Reconstructive Neuron Pruning for Backdoor DefenseCode1
Fisher Information guided Purification against Backdoor AttacksCode1
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data PoisoningCode1
FIBA: Frequency-Injection based Backdoor Attack in Medical Image AnalysisCode1
Towards Probabilistic Verification of Machine UnlearningCode1
Lockdown: Backdoor Defense for Federated Learning with Isolated Subspace TrainingCode1
MM-BD: Post-Training Detection of Backdoor Attacks with Arbitrary Backdoor Pattern Types Using a Maximum Margin StatisticCode1
LIRA: Learnable, Imperceptible and Robust Backdoor AttacksCode1
BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense EvaluationCode1
Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor DefenseCode1
FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated LearningCode1
ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning ParadigmsCode1
CROW: Eliminating Backdoors from Large Language Models via Internal Consistency RegularizationCode1
ONION: A Simple and Effective Defense Against Textual Backdoor AttacksCode1
VFLIP: A Backdoor Defense for Vertical Federated Learning via Identification and PurificationCode1
Black-box Backdoor Defense via Zero-shot Image PurificationCode1
DFB: A Data-Free, Low-Budget, and High-Efficacy Clean-Label Backdoor AttackCode0
Progressive Poisoned Data Isolation for Training-time Backdoor DefenseCode0
OCGEC: One-class Graph Embedding Classification for DNN Backdoor DetectionCode0
"No Matter What You Do": Purifying GNN Models via Backdoor UnlearningCode0
BadActs: A Universal Backdoor Defense in the Activation SpaceCode0
Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language ModelsCode0
MSDT: Masked Language Model Scoring Defense in Text DomainCode0
Mitigating Backdoor Attack by Injecting Proactive Defensive BackdoorCode0
Mask and Restore: Blind Backdoor Defense at Test Time with Masked AutoencoderCode0
Model-Contrastive Learning for Backdoor DefenseCode0
Neural Polarizer: A Lightweight and Effective Backdoor Defense via Purifying Poisoned FeaturesCode0
Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial ExamplesCode0
Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion ModelsCode0
Beating Backdoor Attack at Its Own GameCode0
From Shortcuts to Triggers: Backdoor Defense with Denoised PoECode0
Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor AttacksCode0
Efficient Backdoor Removal Through Natural Gradient Fine-tuningCode0
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction ConsistencyCode0
From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion ModelsCode0
Obliviate: Neutralizing Task-agnostic Backdoors within the Parameter-efficient Fine-tuning ParadigmCode0
Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion ModelsCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.