SOTAVerified

Provably effective detection of effective data poisoning attacks

2025-01-21Unverified0· sign in to hype

Jonathan Gallagher, Yasaman Esfandiari, Callen MacPhee, Michael Warren

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper establishes a mathematically precise definition of dataset poisoning attack and proves that the very act of effectively poisoning a dataset ensures that the attack can be effectively detected. On top of a mathematical guarantee that dataset poisoning is identifiable by a new statistical test that we call the Conformal Separability Test, we provide experimental evidence that we can adequately detect poisoning attempts in the real world.

Tasks

Reproductions