Cloud-aware processing of MapReduce-based OLAP applications

Hyuck Han, Young Choon Lee, Seungmi Choi, Heon Y. Yeom, Albert Y. Zomaya

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

11 Citations (Scopus)


As the volume of data to be processed in a timely manner soars, the scale of computing and storage systems has much trouble keeping up with such a rate of explosive data growth. A hybrid cloud combining two or more clouds is emerging as an appealing alternative to expand local/private systems. However, the effective use of such an expanded cloud system is limited primarily by low network bandwidth and high latency between clouds (i.e., large intercloud data transmission overheads) when applications/services span across clouds, and they deal with large data in particular. Online analytical processing (OLAP) applications are a typical class of data-intensive application. These applications process multi-dimensional analytical queries dealing with 'big data' (or data warehouses). In this paper, we address the effective processing of MapReduce-based OLAP applications in a hybrid-cloud environment, and present a (hybrid) cloud-aware OLAP system incorporating data filtering techniques. Our system filters out unnecessary data for intercloud transmission with the ultimate goal of optimizing the performance to cost ratio, or cost efficiency. Based on experimental results obtained using two large-scale data analysis benchmarks, our system demonstrates its efficacy in improving the cost efficiency with the reduction in intercloud network traffic from 76%-99%.

Original languageEnglish
Title of host publicationParallel and Distributed Computing 2013 - Proceedings of the Eleventh Australasian Symposium on Parallel and Distributed Computing, AusPDC 2013
EditorsBahman Javadi, Saurabh Kumar Garg
Place of PublicationSydney
PublisherAustralian Computer Society
Number of pages8
ISBN (Print)9781921770258
Publication statusPublished - 2013
Externally publishedYes
Event11th Australasian Symposium on Parallel and Distributed Computing, AusPDC 2013 - Adelaide, Australia
Duration: 29 Jan 20131 Feb 2013


Other11th Australasian Symposium on Parallel and Distributed Computing, AusPDC 2013


Dive into the research topics of 'Cloud-aware processing of MapReduce-based OLAP applications'. Together they form a unique fingerprint.

Cite this