Efficient energy management using adaptive reinforcement learning-based scheduling in large-scale distributed systems

Masnida Hussin*, Young Choon Lee, Albert Y. Zomaya

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

15 Citations (Scopus)

Abstract

Energy consumption in large-scale distributed systems, such as computational grids and clouds gains a lot of attention recently due to its significant performance, environmental and economic implications. These systems consume a massive amount of energy not only for powering them, but also cooling them. More importantly, the explosive increase in energy consumption is not linear to resource utilization as only a marginal percentage of energy is consumed for actual computational works. This energy problem becomes more challenging with uncertainty and variability of workloads and heterogeneous resources in those systems. This paper presents a dynamic scheduling algorithm incorporating reinforcement learning for good performance and energy efficiency. This incorporation helps the scheduler observe and adapt to various processing requirements (tasks) and different processing capacities (resources). The learning process of our scheduling algorithm develops an association between the best action (schedule) and the current state of the environment (parallel system). We have also devised a task-grouping technique to help the decision-making process of our algorithm. The grouping technique is adaptive in nature since it incorporates current workload and energy consumption for the best action. Results from our extensive simulations with varying processing capacities and a diverse set of tasks demonstrate the effectiveness of this learning approach

Original languageEnglish
Title of host publicationProceedings - 2011 International Conference on Parallel Processing, ICPP 2011
EditorsGuang R. Gao, Yu-Chee Tseng
Place of PublicationPiscataway, NJ
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages385-393
Number of pages9
ISBN (Electronic)9780769545103
ISBN (Print)9781457713361
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event40th International Conference on Parallel Processing, ICPP 2011 - Taipei City, Taiwan
Duration: 13 Sept 201116 Sept 2011

Other

Other40th International Conference on Parallel Processing, ICPP 2011
Country/TerritoryTaiwan
CityTaipei City
Period13/09/1116/09/11

Fingerprint

Dive into the research topics of 'Efficient energy management using adaptive reinforcement learning-based scheduling in large-scale distributed systems'. Together they form a unique fingerprint.

Cite this