An overview of big data issues in privacy-preserving record linkage

Dinusha Vatsalan, Dimitrios Karapiperis*, Aris Gkoulalas-Divanis

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review


Nearly 90% of today's data have been produced only in the last two years! These data come from a multitude of human activities, including social networking sites, mobile phone applications, electronic medical records systems, e-commerce sites, etc. Integrating and analyzing this wealth and volume of data offers remarkable opportunities in sectors that are of high interest to businesses, governments, and academia. Given that the majority of the data are proprietary and may contain personal or business sensitive information, Privacy-Preserving Record Linkage (PPRL) techniques are essential to perform data integration. In this paper, we review existing work in PPRL, focusing on the computational aspect of the proposed algorithms, which is crucial when dealing with Big data. We propose an analysis tool for the computational aspects of PPRL, and characterize existing PPRL techniques along five dimensions. Based on our analysis, we identify research gaps in current literature and promising directions for future work.
Original languageEnglish
Title of host publicationAlgorithmic Aspects of Cloud Computing
Subtitle of host publication4th International Symposium, ALGOCLOUD 2018 Helsinki, Finland, August 20–21, 2018 Revised Selected Papers
EditorsYann Disser, Vassilios S. Verykios
Place of PublicationCham, Switzerland
PublisherSpringer, Springer Nature
Number of pages19
ISBN (Electronic)9783030197599
ISBN (Print)9783030197582
Publication statusPublished - 2019
Externally publishedYes
Event4th International Symposium on Algorithmic Aspects of Cloud Computing, ALGOCLOUD 2018 - Helsinki, Finland
Duration: 20 Aug 201821 Aug 2018

Publication series

NameLecture Notes in Computer Science
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference4th International Symposium on Algorithmic Aspects of Cloud Computing, ALGOCLOUD 2018


  • Privacy-Preserving Record Linkage
  • Entity resolution


Dive into the research topics of 'An overview of big data issues in privacy-preserving record linkage'. Together they form a unique fingerprint.

Cite this