A multi-truth discovery approach based on confidence interval estimation of truths

Xiu Fang, Chenling Shen, Quan Z. Sheng, Guohao Sun*, Yating Tang, Haiyan Zhuo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

2 Citations (Scopus)

Abstract

The rapid development of the Internet makes it easier to spread and obtain data. However, conflicting descriptions of an object from different sources make identifying trustworthy information challenging. This is known as the truth discovery task. In truth discovery, an object may have multiple values, such as a book written by multiple authors. Existing multi-truth discovery methods primarily focus on the probability of each candidate value being correct and provide a point estimate. However, practical applications face the problem of unbalanced object distribution, where a single point estimate may overlook critical confidence information. Additionally, ambiguous terms like “etc.” and “et. al” can lead to estimation deviations. To address these issues, we propose MTD_VCI, an optimization model for confidence perception of multiple truths to detect truth from unbalanced data distribution. MTD_VCI estimates the credibility score of each candidate value and considers the confidence interval to reflect the unevenness distribution, improving decision-making. Additionally, the number of values claimed by ambiguous sources is re-estimated using other sources as a reference. Experiment results on real-world and simulated datasets demonstrate that MTD_VCI produces better results and effective confidence intervals for each value.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications
Subtitle of host publication19th International Conference, ADMA 2023, Shenyang, China, August 21–23, 2023, proceedings, part V
EditorsXiaochun Yang, Heru Suhartanto, Guoren Wang, Bin Wang, Jing Jiang, Bing Li, Huaijie Zhu, Ningning Cui
Place of PublicationCham
PublisherSpringer, Springer Nature
Pages599-615
Number of pages17
ISBN (Electronic)9783031466779
ISBN (Print)9783031466762
DOIs
Publication statusPublished - 2023
Event19th International Conference on Advanced Data Mining and Applications, ADMA 2023 - Shenyang, China
Duration: 21 Aug 202323 Aug 2023

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume14180
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Advanced Data Mining and Applications, ADMA 2023
Country/TerritoryChina
CityShenyang
Period21/08/2323/08/23

Keywords

  • Multi-Truth discovery
  • Unbalanced distribution
  • Confidence interval

Fingerprint

Dive into the research topics of 'A multi-truth discovery approach based on confidence interval estimation of truths'. Together they form a unique fingerprint.

Cite this