An integrated Bayesian approach for effective multi-truth discovery

Xianzhi Wang, Quan Z. Sheng, Xiu Susie Fang, Lina Yao, Xiaofei Xu, Xue Li

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

45 Citations (Scopus)

Abstract

Truth-finding is the fundamental technique for corroborating reports from multiple sources in both data integration and collective intelligent applications. Traditional truth-finding methods assume a single true value for each data item and therefore cannot deal will multiple true values (i.e., the multi-truth-finding problem). So far, the existing approaches handle the multi-truth-finding problem in the same way as the single-truth-finding problems. Unfortunately, the multi-truth-finding problem has its unique features, such as the involvement of sets of values in claims, different implications of inter-value mutual exclusion, and larger source profiles. Considering these features could provide new opportunities for obtaining more accurate truth-finding results. Based on this insight, we propose an integrated Bayesian approach to the multi-truth-finding problem, by taking these features into account. To improve the truth-finding efficiency, we reformulate the multi-truth-finding problem model based on the mappings between sources and (sets of) values. New mutual exclusive relations are defined to reflect the possible co-existence of multiple true values. A finer-grained copy detection method is also proposed to deal with sources with large profiles. The experimental results on three real-world datasets show the effectiveness of our approach.

Original languageEnglish
Title of host publicationCIKM 2015
Subtitle of host publicationProceedings of the 24th ACM International Conference on Information and Knowledge Management
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery
Pages493-502
Number of pages10
ISBN (Electronic)9781450337946
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event24th ACM International Conference on Information and Knowledge Management, CIKM 2015 - Melbourne, Australia
Duration: 19 Oct 201523 Oct 2015

Other

Other24th ACM International Conference on Information and Knowledge Management, CIKM 2015
Country/TerritoryAustralia
CityMelbourne
Period19/10/1523/10/15

Keywords

  • truth discovery
  • multi-truth-finding features
  • Bayesian model
  • data source dependence

Fingerprint

Dive into the research topics of 'An integrated Bayesian approach for effective multi-truth discovery'. Together they form a unique fingerprint.

Cite this