Abstract
Many organizations, including businesses, government agencies and research organizations, are collecting vast amounts of data, which are stored, processed and analyzed to mine interesting patterns and knowledge to support efficient and quality decision making. In order to improve data quality and to facilitate further analysis, many application domains require information from multiple sources to be integrated and combined. The process of matching and aggregating records that relate to the same entities from different data sources without compromising their privacy is known as 'privacy-preserving record linkage' (PPRL), 'blind data linkage' or 'private record linkage'. In this paper we present MERLIN, an online tool that demonstrates various PPRL methods in a multi-party context. In this demonstration we show different private multi-party blocking and matching techniques, and illustrate the usability of MERLIN by presenting quality and performance measures of various PPRL methods. We believe MERLIN will help practitioners and researchers to better understand the pipeline of the PPRL process, to compare different multi-party PPRL techniques, and to determine the best technique to use for their needs.
Original language | English |
---|---|
Title of host publication | Proceedings - 15th IEEE International Conference on Data Mining Workshop |
Editors | Peng Cui, Jennifer Dy, Charu Aggarwal, Zhi-Hua Zhou, Alexander Tuzhilin, Hui Xiong, Xindong Wu |
Place of Publication | Piscataway, NJ |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 1640-1643 |
Number of pages | 4 |
ISBN (Electronic) | 9781467384926 |
DOIs | |
Publication status | Published - 2015 |
Externally published | Yes |
Event | 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015 - Atlantic City, United States Duration: 14 Nov 2015 → 17 Nov 2015 |
Other
Other | 15th IEEE International Conference on Data Mining Workshop, ICDMW 2015 |
---|---|
Country/Territory | United States |
City | Atlantic City |
Period | 14/11/15 → 17/11/15 |
Keywords
- Privacy
- data matching
- Bloom filters
- scalability
- online demo