SOTAVerified

Phase transitions and optimal algorithms in high-dimensional Gaussian mixture clustering

2016-10-10Unverified0· sign in to hype

Thibault Lesieur, Caterina De Bacco, Jess Banks, Florent Krzakala, Cris Moore, Lenka Zdeborová

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We consider the problem of Gaussian mixture clustering in the high-dimensional limit where the data consists of m points in n dimensions, n,m and = m/n stays finite. Using exact but non-rigorous methods from statistical physics, we determine the critical value of and the distance between the clusters at which it becomes information-theoretically possible to reconstruct the membership into clusters better than chance. We also determine the accuracy achievable by the Bayes-optimal estimation algorithm. In particular, we find that when the number of clusters is sufficiently large, r > 4 + 2 , there is a gap between the threshold for information-theoretically optimal performance and the threshold at which known algorithms succeed.

Tasks

Reproductions