SmartVote: a full-fledged graph-based model for multi-valued truth discovery

Xiu Susie Fang*, Quan Z. Sheng, Xianzhi Wang, Dianhui Chu, Anne H. H. Ngu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)

Abstract

In the era of Big Data, truth discovery has emerged as a fundamental research topic, which estimates data veracity by determining the reliability of multiple, often conflicting data sources. Although considerable research efforts have been conducted on this topic, most current approaches assume only one true value for each object. In reality, objects with multiple true values widely exist and the existing approaches that cope with multi-valued objects still lack accuracy. In this paper, we propose a full-fledged graph-based model, SmartVote, which models two types of source relations with additional quantification to precisely estimate source reliability for effective multi-valued truth discovery. Two graphs are constructed and further used to derive different aspects of source reliability (i.e., positive precision and negative precision) via random walk computations. Our model incorporates four important implications, including two types of source relations, object popularity, loose mutual exclusion, and long-tail phenomenon on source coverage, to pursue better accuracy in truth discovery. Empirical studies on two large real-world datasets demonstrate the effectiveness of our approach.

Original languageEnglish
Pages (from-to)1855-1885
Number of pages31
JournalWorld Wide Web
Volume22
Issue number4
Early online date22 Aug 2018
DOIs
Publication statusPublished - Jul 2019

Keywords

  • Graph-based model
  • Long-tail phenomenon
  • Multi-valued objects
  • Object popularity
  • Source relations
  • Truth discovery

Fingerprint

Dive into the research topics of 'SmartVote: a full-fledged graph-based model for multi-valued truth discovery'. Together they form a unique fingerprint.

Cite this