Provably effective detection of effective data poisoning attacks

2025-01-21Unverified0· sign in to hype

Jonathan Gallagher, Yasaman Esfandiari, Callen MacPhee, Michael Warren

Unverified — Be the first to reproduce this paper.

Abstract

This paper establishes a mathematically precise definition of dataset poisoning attack and proves that the very act of effectively poisoning a dataset ensures that the attack can be effectively detected. On top of a mathematical guarantee that dataset poisoning is identifiable by a new statistical test that we call the Conformal Separability Test, we provide experimental evidence that we can adequately detect poisoning attempts in the real world.

Tasks

Data Poisoning

Provably effective detection of effective data poisoning attacks

Abstract

Tasks

Reproductions