Abstract
Record linkage is the challenging task of deciding which records, coming from disparate data sources, refer to the same entity. Established back in 1946 by Halbert L. Dunn, the area of record linkage has received tremendous attention over the years due to its numerous real-world applications, and has led to a plethora of technologies, methods, metrics, and systems. A major direction in record linkage regards methods for linking records in a privacy-preserving manner, where sensitive and personally identifiable information in the records is not leaked as part of the linkage process. In this article, we provide an overview of the large body of research literature in privacy-preserving record linkage, discuss the different generations of techniques that have been proposed, their advantages and limitations, and present a taxonomy as well as an extensive survey on the latest generation of methods. We conclude this work with a roadmap to the new generation of analytics-driven techniques that aims to address some of the major challenges in the field.
Original language | English |
---|---|
Pages (from-to) | 4966-4987 |
Number of pages | 22 |
Journal | IEEE Transactions on Information Forensics and Security |
Volume | 16 |
Early online date | 20 Sept 2021 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Keywords
- Databases
- information sharing
- data privacy