Identifying and caching hot triples for efficient RDF query processing

Wei Emma Zhang*, Quan Z. Sheng, Kerry Taylor, Yongrui Qin

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

13 Citations (Scopus)

Abstract

Resource Description Framework (RDF) has been used as a general model for conceptual description and information modelling. As the growing number and volume of RDF datasets emerged recently, many techniques have been developed for accelerating the query answering process on triple stores, which handle large-scale RDF data. Caching is one of the popular solutions. Non-RDBMS based triple stores, which leverage the intrinsic nature of RDF graphs, are emerging and attracting more research attention in recent years. However, as their fundamental structure is different from RDBMS triple stores, they can not leverage the RDBMS caching mechanism. In this paper, we develop a time-aware frequency based caching algorithm to address this issue. Our approach retrieves the accessed triples by analyzing and expanding previous queries and collects most frequently accessed triples by evaluating their access frequencies using Exponential Smoothing, a forecasting method. We evaluate our approach using real world queries from a publicly available SPARQL endpoint. Our theoretical analysis and empirical results show that the proposed approach outperforms the state-of-the-art approaches with higher hit rates.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications
Subtitle of host publication20th International Conference, DASFAA 2015, Hanoi, Vietnam, April 20-23, 2015, Proceedings, Part II
EditorsMatthias Renz, Cyrus Shahabi, Xiaofang Zhou, Muhammad Aamir Cheema
Place of PublicationCham, Switzerland
PublisherSpringer, Springer Nature
Pages259-274
Number of pages16
ISBN (Print)9783319181226
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event20th International Conference on Database Systems for Advanced Applications, DASFAA 2015 - Hanoi, Viet Nam
Duration: 20 Apr 201523 Apr 2015

Publication series

NameLecture Notes in Computer Science
Volume9050
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other20th International Conference on Database Systems for Advanced Applications, DASFAA 2015
Country/TerritoryViet Nam
CityHanoi
Period20/04/1523/04/15

Keywords

  • Caching
  • Exponential smoothing
  • Query expansion
  • RDF

Fingerprint

Dive into the research topics of 'Identifying and caching hot triples for efficient RDF query processing'. Together they form a unique fingerprint.

Cite this