A novel graph-based k-means for nonlinear manifold clustering and representative selection

Enmei Tu, Longbing Cao, Jie Yang*, Nicola Kasabov

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

44 Citations (Scopus)

Abstract

Many real-world applications expose the nonlinear manifold structure of the lower dimension rather than its high-dimensional input space. This greatly challenges most existing clustering and representative selection algorithms which do not take the manifold characteristics into consideration. The performance of the corresponding learning algorithms can be greatly improved if the manifold structure is considered. In this paper, we propose a graph-based k-means algorithm, GKM, which bears the simplicity of classic k-means while incorporating global information of data geometric distribution. GKM fully exploits the intrinsic manifold structure for appropriate data clustering and representative selection. GKM is evaluated on both synthetic and real-life data sets and achieves very impressive results compared to the state-of-the-art approaches, including classic k-means, kernel k-means, spectral clustering, and clustering through ranking and for representative selection. Given the widespread appearance of manifold structures in real world problems, GKM shows promising potential for partitioning manifold-distributed data.

Original languageEnglish
Pages (from-to)109-122
Number of pages14
JournalNeurocomputing
Volume143
DOIs
Publication statusPublished - 2 Nov 2014
Externally publishedYes

Keywords

  • k-means
  • Manifold clustering
  • Random walk
  • Graph learning

Fingerprint

Dive into the research topics of 'A novel graph-based k-means for nonlinear manifold clustering and representative selection'. Together they form a unique fingerprint.

Cite this