Abstract
Truth-finding is the fundamental technique for corroborating reports from multiple sources in both data integration and collective intelligent applications. Traditional truth-finding methods assume a single true value for each data item and therefore cannot deal will multiple true values (i.e., the multi-truth-finding problem). So far, the existing approaches handle the multi-truth-finding problem in the same way as the single-truth-finding problems. Unfortunately, the multi-truth-finding problem has its unique features, such as the involvement of sets of values in claims, different implications of inter-value mutual exclusion, and larger source profiles. Considering these features could provide new opportunities for obtaining more accurate truth-finding results. Based on this insight, we propose an integrated Bayesian approach to the multi-truth-finding problem, by taking these features into account. To improve the truth-finding efficiency, we reformulate the multi-truth-finding problem model based on the mappings between sources and (sets of) values. New mutual exclusive relations are defined to reflect the possible co-existence of multiple true values. A finer-grained copy detection method is also proposed to deal with sources with large profiles. The experimental results on three real-world datasets show the effectiveness of our approach.
| Original language | English |
|---|---|
| Title of host publication | CIKM 2015 |
| Subtitle of host publication | Proceedings of the 24th ACM International Conference on Information and Knowledge Management |
| Place of Publication | New York, NY |
| Publisher | Association for Computing Machinery |
| Pages | 493-502 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781450337946 |
| DOIs | |
| Publication status | Published - 2015 |
| Externally published | Yes |
| Event | 24th ACM International Conference on Information and Knowledge Management, CIKM 2015 - Melbourne, Australia Duration: 19 Oct 2015 → 23 Oct 2015 |
Other
| Other | 24th ACM International Conference on Information and Knowledge Management, CIKM 2015 |
|---|---|
| Country/Territory | Australia |
| City | Melbourne |
| Period | 19/10/15 → 23/10/15 |
Keywords
- truth discovery
- multi-truth-finding features
- Bayesian model
- data source dependence