Clustering nodes in large-scale biological networks using external memory algorithms

Ahmed Shamsul Arefin*, Mario Inostroza-Ponta, Luke Mathieson, Regina Berretta, Pablo Moscato

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

13 Citations (Scopus)

Abstract

Novel analytical techniques have dramatically enhanced our understanding of many application domains including biological networks inferred from gene expression studies. However, there are clear computational challenges associated to the large datasets generated from these studies. The algorithmic solution of some NP-hard combinatorial optimization problems that naturally arise on the analysis of large networks is difficult without specialized computer facilities (i.e. supercomputers). In this work, we address the data clustering problem of large-scale biological networks with a polynomial-time algorithm that uses reasonable computing resources and is limited by the available memory. We have adapted and improved the MSTkNN graph partitioning algorithm and redesigned it to take advantage of external memory (EM) algorithms. We evaluate the scalability and performance of our proposed algorithm on a well-known breast cancer microarray study and its associated dataset.

Original languageEnglish
Title of host publicationAlgorithms and Architectures for Parallel Processing - 11th International Conference, ICA3PP 2011, Proceedings
EditorsYang Xiang, Alfredo Cuzzocrea, Michael Hobbs, Wanlei Zhou
Pages375-386
Number of pages12
Volume7017 LNCS
EditionPART 2
DOIs
Publication statusPublished - 2011
Event11th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2011 - Melbourne, VIC, Australia
Duration: 24 Oct 201126 Oct 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7017 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other11th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2011
Country/TerritoryAustralia
CityMelbourne, VIC
Period24/10/1126/10/11

Keywords

  • Data clustering
  • external memory algorithms
  • gene expression data analysis
  • graph algorithms

Fingerprint

Dive into the research topics of 'Clustering nodes in large-scale biological networks using external memory algorithms'. Together they form a unique fingerprint.

Cite this