SOTAVerified

Optimal Coreset for Gaussian Kernel Density Estimation

2020-07-15Unverified0· sign in to hype

Wai Ming Tai

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Given a point set P R^d, the kernel density estimate of P is defined as \[ G_P(x) = 1 |P | _p Pe^- x-p ^2 \] for any xR^d. We study how to construct a small subset Q of P such that the kernel density estimate of P is approximated by the kernel density estimate of Q. This subset Q is called a coreset. The main technique in this work is constructing a 1 coloring on the point set P by discrepancy theory and we leverage Banaszczyk's Theorem. When d>1 is a constant, our construction gives a coreset of size O(1) as opposed to the best-known result of O(11). It is the first result to give a breakthrough on the barrier of factor even when d=2.

Tasks

Reproductions