Reliable workflow execution in distributed systems for cost efficiency

Young Choon Lee, Albert Y. Zomaya, Mazin Yousif

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

6 Citations (Scopus)

Abstract

Reliability is of great practical importance in distributed computing systems (DCSs) due to its immediate impact on system performance, i.e., quality of service. The issue of reliability becomes more crucial particularly for 'cost-conscious' DCSs like grids and clouds. Unreliability brings about additional-often excessive-capital and operating costs. Resource failures are considered as the main source of unreliability in this study. In this study, we investigate the reliability of workflow execution in the context of scheduling and its effect on operating costs in DCSs, and present the reliability for profit assurance (RPA) algorithm as a novel workflow scheduling heuristic. The proposed RPA algorithm incorporates a (operating) cost-aware replication scheme to increase reliability. The incorporation of cost awareness greatly contributes to efficient replication decisions in terms of profitability. To the best of our knowledge, the work in this paper is the first attempt to explicitly take into account (monetary) reliability cost in workflow scheduling.

Original languageEnglish
Title of host publicationProceedings of the 2010 11th IEEE/ACM International Conference on Grid Computing, Grid 2010
Place of PublicationPiscataway, NJ
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages89-96
Number of pages8
ISBN (Print)9781424493487
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event2010 11th IEEE/ACM International Conference on Grid Computing, Grid 2010 - Brussels, Belgium
Duration: 25 Oct 201029 Oct 2010

Other

Other2010 11th IEEE/ACM International Conference on Grid Computing, Grid 2010
CountryBelgium
CityBrussels
Period25/10/1029/10/10

Fingerprint Dive into the research topics of 'Reliable workflow execution in distributed systems for cost efficiency'. Together they form a unique fingerprint.

Cite this