CRS: a cost-aware resource scheduling framework for deep learning task orchestration in mobile clouds

Linchang Xiao, Zili Xiao, Di Wu*, Miao Hu, Yipeng Zhou

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Deep learning (DL) has found extensive application in supporting various mobile applications. The efficient execution of DL tasks is paramount for ensuring the effectiveness of AI-driven mobile applications. While previous research has predominantly focused on minimizing the completion time of DL tasks, the associated cost of execution has often been overlooked. Nonetheless, cost becomes a critical factor, particularly when utilizing DL infrastructure rented from third-party cloud service providers. In this paper, we propose a cost-aware resource scheduling framework named CRS for orchestrating DL task execution in mobile cloud systems. Our aim is to minimize server rental costs by strategically orchestrating DL jobs with diverse deadlines and workload scales across rented cloud servers. We formally define the problem and prove its NP-hardness by reducing it to a multiple knapsack problem (MKP). To solve this problem, we devise an approximation algorithm with a guaranteed upper bound performance ratio of 1 + 1/e−1 . We evaluate CRS against state-of-the-art baselines through simulations of various job arrival scenarios in a real elastic mobile cloud system. The results demonstrate that CRS, on average, reduces rental costs by 45.1% compared to other baselines, while simultaneously achieving a shorter average job completion time (JCT) and maximum job completion time (i.e., makespan).

Original languageEnglish
Pages (from-to)600-613
Number of pages14
JournalIEEE Transactions on Mobile Computing
Volume24
Issue number2
Early online date19 Sept 2024
DOIs
Publication statusPublished - Feb 2025

Fingerprint

Dive into the research topics of 'CRS: a cost-aware resource scheduling framework for deep learning task orchestration in mobile clouds'. Together they form a unique fingerprint.

Cite this