SOTAVerified

Distributed k-Means and k-Median Clustering on General Topologies

2013-06-03NeurIPS 2013Unverified0· sign in to hype

Maria Florina Balcan, Steven Ehrlich, YIngyu Liang

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper provides new algorithms for distributed clustering for two popular center-based objectives, k-median and k-means. These algorithms have provable guarantees and improve communication complexity over existing approaches. Following a classic approach in clustering by har2004coresets, we reduce the problem of finding a clustering with low cost to the problem of finding a coreset of small size. We provide a distributed method for constructing a global coreset which improves over the previous methods by reducing the communication complexity, and which works over general communication topologies. Experimental results on large scale data sets show that this approach outperforms other coreset-based distributed clustering algorithms.

Tasks

Reproductions