SOTAVerified

Git: Clustering Based on Graph of Intensity Topology

2021-10-04Code Available1· sign in to hype

Zhangyang Gao, Haitao Lin, Cheng Tan, Lirong Wu, Stan. Z Li

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Accuracy, Robustness to noises and scales, Interpretability, Speed, and Easy to use (ARISE) are crucial requirements of a good clustering algorithm. However, achieving these goals simultaneously is challenging, and most advanced approaches only focus on parts of them. Towards an overall consideration of these aspects, we propose a novel clustering algorithm, namely GIT (Clustering Based on Graph of Intensity Topology). GIT considers both local and global data structures: firstly forming local clusters based on intensity peaks of samples, and then estimating the global topological graph (topo-graph) between these local clusters. We use the Wasserstein Distance between the predicted and prior class proportions to automatically cut noisy edges in the topo-graph and merge connected local clusters as final clusters. Then, we compare GIT with seven competing algorithms on five synthetic datasets and nine real-world datasets. With fast local cluster detection, robust topo-graph construction and accurate edge-cutting, GIT shows attractive ARISE performance and significantly exceeds other non-convex clustering methods. For example, GIT outperforms its counterparts about 10\% (F1-score) on MNIST and FashionMNIST. Code is available at https://github.com/gaozhangyang/GIT.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Fashion-MNISTAE+GITARI49Unverified
Fashion-MNISTk-Means++ARI35Unverified
Fashion-MNISTSpectral ClusteringARI34Unverified
Fashion-MNISTGITARI32Unverified
Fashion-MNISTSpectACIARI29Unverified
Fashion-MNISTQuickShiftPPARI16Unverified
MNISTAE+GITARI77Unverified
MNISTGITARI42Unverified
MNISTk-Means++ARI36Unverified
MNISTSpectral ClusteringARI33Unverified
MNISTSpectACIARI17Unverified
MNISTQuickShiftPPARI13Unverified
Olivetti faceGITF1-score62Unverified
Olivetti faceQuickShiftPPF1-score60Unverified
Olivetti facek-Means++F1-score52Unverified
Olivetti faceSpectral ClusteringF1-score37Unverified
Olivetti faceSpectACIF1-score34Unverified

Reproductions