SOTAVerified

Random Projections for k-means Clustering

2010-12-01NeurIPS 2010Unverified0· sign in to hype

Christos Boutsidis, Anastasios Zouzias, Petros Drineas

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper discusses the topic of dimensionality reduction for k-means clustering. We prove that any set of n points in d dimensions (rows in a matrix A ^n d) can be projected into t = (k / ^2) dimensions, for any (0,1/3), in O(n d ^-2 k/ (d) ) time, such that with constant probability the optimal k-partition of the point set is preserved within a factor of 2+. The projection is done by post-multiplying A with a d t random matrix R having entries +1/t or -1/t with equal probability. A numerical implementation of our technique and experiments on a large face images dataset verify the speed and the accuracy of our theoretical results.

Tasks

Reproductions