Priv'IT: Private and Sample Efficient Identity Testing
Bryan Cai, Constantinos Daskalakis, Gautam Kamath
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/hoonose/privitOfficialnone★ 0
Abstract
We develop differentially private hypothesis testing methods for the small sample regime. Given a sample D from a categorical distribution p over some domain , an explicitly described distribution q over , some privacy parameter , accuracy parameter , and requirements _ I and _ II for the type I and type II errors of our test, the goal is to distinguish between p=q and d_TV(p,q) . We provide theoretical bounds for the sample size | D| so that our method both satisfies (,0)-differential privacy, and guarantees _ I and _ II type I and type II errors. We show that differential privacy may come for free in some regimes of parameters, and we always beat the sample complexity resulting from running the ^2-test with noisy counts, or standard approaches such as repetition for endowing non-private ^2-style statistics with differential privacy guarantees. We experimentally compare the sample complexity of our method to that of recently proposed methods for private hypothesis testing.