SOTAVerified

Fast Exact k-Means, k-Medians and Bregman Divergence Clustering in 1D

2017-01-25Code Available0· sign in to hype

Allan Grønlund, Kasper Green Larsen, Alexander Mathiasen, Jesper Sindahl Nielsen, Stefan Schneider, Mingzhou Song

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

The k-Means clustering problem on n points is NP-Hard for any dimension d 2, however, for the 1D case there exists exact polynomial time algorithms. Previous literature reported an O(kn^2) time dynamic programming algorithm that uses O(kn) space. It turns out that the problem has been considered under a different name more than twenty years ago. We present all the existing work that had been overlooked and compare the various solutions theoretically. Moreover, we show how to reduce the space usage for some of them, as well as generalize them to data structures that can quickly report an optimal k-Means clustering for any k. Finally we also generalize all the algorithms to work for the absolute distance and to work for any Bregman Divergence. We complement our theoretical contributions by experiments that compare the practical performance of the various algorithms.

Tasks

Reproductions