SOTAVerified

Data Poisoning

Data Poisoning is an adversarial attack that tries to manipulate the training dataset in order to control the prediction behavior of a trained model such that the model will label malicious examples into a desired classes (e.g., labeling spam e-mails as safe).

Source: Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Papers

Showing 3140 of 492 papers

TitleStatusHype
ControlNET: A Firewall for RAG-based LLM System0
Diversity-aware Dual-promotion Poisoning Attack on Sequential Recommendation0
Sky of Unlearning (SoUL): Rewiring Federated Machine Unlearning via Selective Pruning0
Data Poisoning in Deep Learning: A SurveyCode1
Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing0
Optimizing ML Training with Metagradient Descent0
Policy Teaching via Data Poisoning in Learning from Human Preferences0
Targeted Data Poisoning for Black-Box Audio Datasets Ownership Verification0
Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models0
PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models0
Show:102550
← PrevPage 4 of 50Next →

No leaderboard results yet.