GrandBase: generating actionable knowledge from Big Data

Xiu Susie Fang, Quan Z. Sheng, Xianzhi Wang, Anne H. H. Ngu, Yihong Zhang

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
21 Downloads (Pure)

Abstract

Purpose: This paper aims to propose a system for generating actionable knowledge from Big Data and use this system to construct a comprehensive knowledge base (KB), called GrandBase. 

Design/methodology/approach: In particular, this study extracts new predicates from four types of data sources, namely, Web texts, Document Object Model (DOM) trees, existing KBs and query stream to augment the ontology of the existing KB (i.e. Freebase). In addition, a graph-based approach to conduct better truth discovery for multi-valued predicates is also proposed. 

Findings: Empirical studies demonstrate the effectiveness of the approaches presented in this study and the potential of GrandBase. The future research directions regarding GrandBase construction and extension has also been discussed. 

Originality/value: To revolutionize our modern society by using the wisdom of Big Data, considerable KBs have been constructed to feed the massive knowledge-driven applications with Resource Description Framework triples. The important challenges for KB construction include extracting information from large-scale, possibly conflicting and different-structured data sources (i.e. the knowledge extraction problem) and reconciling the conflicts that reside in the sources (i.e. the truth discovery problem). Tremendous research efforts have been contributed on both problems. However, the existing KBs are far from being comprehensive and accurate: first, existing knowledge extraction systems retrieve data from limited types of Web sources; second, existing truth discovery approaches commonly assume each predicate has only one true value. In this paper, the focus is on the problem of generating actionable knowledge from Big Data. A system is proposed, which consists of two phases, namely, knowledge extraction and truth discovery, to construct a broader KB, called GrandBase.

Original languageEnglish
Pages (from-to)105-126
Number of pages22
JournalPSU Research Review
Volume1
Issue number2
DOIs
Publication statusPublished - 14 Aug 2017

Bibliographical note

Copyright the Author(s) 2017. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • Big Data
  • DOM trees
  • Information extraction
  • Knowledge bases
  • Multi-valued predicates
  • Truth discovery

Fingerprint

Dive into the research topics of 'GrandBase: generating actionable knowledge from Big Data'. Together they form a unique fingerprint.

Cite this