SOTAVerified

Merging K-means with hierarchical clustering for identifying general-shaped groups

2017-12-23Unverified0· sign in to hype

Anna D. Peterson, Arka P. Ghosh, Ranjan Maitra

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Clustering partitions a dataset such that observations placed together in a group are similar but different from those in other groups. Hierarchical and K-means clustering are two approaches but have different strengths and weaknesses. For instance, hierarchical clustering identifies groups in a tree-like structure but suffers from computational complexity in large datasets while K-means clustering is efficient but designed to identify homogeneous spherically-shaped clusters. We present a hybrid non-parametric clustering approach that amalgamates the two methods to identify general-shaped clusters and that can be applied to larger datasets. Specifically, we first partition the dataset into spherical groups using K-means. We next merge these groups using hierarchical methods with a data-driven distance measure as a stopping criterion. Our proposal has the potential to reveal groups with general shapes and structure in a dataset. We demonstrate good performance on several simulated and real datasets.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
North American EnglishmeansMean Opinion Score0Unverified

Reproductions