SOTAVerified

Data Poisoning

Data Poisoning is an adversarial attack that tries to manipulate the training dataset in order to control the prediction behavior of a trained model such that the model will label malicious examples into a desired classes (e.g., labeling spam e-mails as safe).

Source: Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Papers

Showing 121130 of 492 papers

TitleStatusHype
FullCert: Deterministic End-to-End Certification for Training and Inference of Neural NetworksCode0
Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition0
A Study of Backdoors in Instruction Fine-tuned Language Models0
Certified Robustness to Data Poisoning in Gradient-Based TrainingCode0
Generalization Bound and New Algorithm for Clean-Label Backdoor AttackCode0
Exploring Vulnerabilities and Protections in Large Language Models: A Survey0
PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based ModelsCode1
PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model DynamicsCode1
Fast-FedUL: A Training-Free Federated Unlearning with Provable Skew ResilienceCode1
Mitigating Backdoor Attack by Injecting Proactive Defensive BackdoorCode0
Show:102550
← PrevPage 13 of 50Next →

No leaderboard results yet.