FastEx: Hash Clustering with Exponential Families
2012-12-01NeurIPS 2012Unverified0· sign in to hype
Amr Ahmed, Sujith Ravi, Alex J. Smola, Shravan M. Narayanamurthy
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Clustering is a key component in data analysis toolbox. Despite its importance, scalable algorithms often eschew rich statistical models in favor of simpler descriptions such as k-means clustering. In this paper we present a sampler, capable of estimating mixtures of exponential families. At its heart lies a novel proposal distribution using random projections to achieve high throughput in generating proposals, which is crucial for clustering models with large numbers of clusters.