SOTAVerified

A Min-max Cult Algorithm for Graph Partitioning and Data Clustering

2002-08-07Proceedings 2001 IEEE International Conference on Data Mining 2002Code Available0· sign in to hype

Chris H.Q. Ding, Xiaofeng He, Hongyuan Zhab, Ming Gu, Horst D. Simon

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

An important application of graph partitioning is data clustering using a graph model - the pairwise similarities between all data objects form a weighted graph adjacency matrix that contains all necessary information for clustering. In this paper, we propose a new algorithm for graph partitioning with an objective function that follows the min-max clustering principle. The relaxed version of the optimization of the min-max cut objective function leads to the Fiedler vector in spectral graph partitioning. Theoretical analyses of min-max cut indicate that it leads to balanced partitions, and lower bounds are derived. The min-max cut algorithm is tested on newsgroup data sets and is found to out-perform other current popular partitioning/clustering methods. The linkage-based refinements to the algorithm further improve the quality of clustering substantially. We also demonstrate that a linearized search order based on linkage differential is better than that based on the Fiedler vector, providing another effective partitioning method.

Tasks

Reproductions