Smooth p-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications
Sloan Nietert, Ziv Goldfeld, Kengo Kato
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Discrepancy measures between probability distributions, often termed statistical distances, are ubiquitous in probability theory, statistics and machine learning. To combat the curse of dimensionality when estimating these distances from data, recent work has proposed smoothing out local irregularities in the measured distributions via convolution with a Gaussian kernel. Motivated by the scalability of this framework to high dimensions, we investigate the structural and statistical behavior of the Gaussian-smoothed p-Wasserstein distance W_p^(), for arbitrary p 1. After establishing basic metric and topological properties of W_p^(), we explore the asymptotic statistical behavior of W_p^()(_n,), where _n is the empirical distribution of n independent observations from . We prove that W_p^() enjoys a parametric empirical convergence rate of n^-1/2, which contrasts the n^-1/d rate for unsmoothed W_p when d 3. Our proof relies on controlling W_p^() by a pth-order smooth Sobolev distance d_p^() and deriving the limit distribution of n\,d_p^()(_n,), for all dimensions d. As applications, we provide asymptotic guarantees for two-sample testing and minimum distance estimation using W_p^(), with experiments for p=2 using a maximum mean discrepancy formulation of d_2^().