SOTAVerified

Data Poisoning

Data Poisoning is an adversarial attack that tries to manipulate the training dataset in order to control the prediction behavior of a trained model such that the model will label malicious examples into a desired classes (e.g., labeling spam e-mails as safe).

Source: Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Papers

Showing 121130 of 492 papers

TitleStatusHype
Approaching the Harm of Gradient Attacks While Only Flipping Labels0
Atlas: A Framework for ML Lifecycle Provenance & Transparency0
No, of course I can! Refusal Mechanisms Can Be Exploited Using Harmless Fine-Tuning Data0
Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs0
Keeping up with dynamic attackers: Certifying robustness to adaptive online data poisoningCode0
FedNIA: Noise-Induced Activation Analysis for Mitigating Data Poisoning in FL0
Multi-Faceted Studies on Data Poisoning can Advance LLM DevelopmentCode0
A Robust Attack: Displacement Backdoor Attack0
DROP: Poison Dilution via Knowledge Distillation for Federated LearningCode0
Detection of Physiological Data Tampering Attacks with Quantum Machine Learning0
Show:102550
← PrevPage 13 of 50Next →

No leaderboard results yet.