P4Mobi

a probabilistic privacy-preserving framework for publishing mobility datasets

Qing Yang, Yiran Shen*, Dinusha Vatsalan, Jianpei Zhang, Mohamed Ali Kaafar, Wen Hu

*Corresponding author for this work

Research output: Contribution to journalArticle

Abstract

The large-scale collection of individuals' mobility data poses serious privacy concerns. Instead of perturbing data by adding noise to the raw location data to preserve privacy of individuals, we propose an approach that achieves privacy-preservation at the statistics level of aggregating mobility datasets with the probabilistic data structure Count-Min Sketch (CMS) [1], which has been widely used to provide efficient statistic functions with a tunable error bound. We use CMS to estimate the population density distributions in the mobility datasets, where the error bound determines utility guarantees. We develop P4Mobi, a novel Probabilistic Privacy-Preserving Publishing framework for Mobility datasets that protects individuals' privacy while complying to a specific utility requirement. We empirically validate the performance of P4Mobi in terms of utility and privacy-preservation by demonstrating its resilience against a recently proposed reconstruction attack model using two real-world datasets. We compare P4Mobi to two state-of-the-art methods and show that with the same level of privacy achieved against our attack model, P4Mobi significantly improves the utility of the published mobility datasets by up to 20%. We also provide a theoretical estimate of the utility achieved by P4Mobi. We found a very consistent match between the estimated and empirical utility of P4Mobi as evaluated on two datasets.

Original languageEnglish
Pages (from-to)6987-6999
Number of pages13
JournalIEEE Transactions on Vehicular Technology
Volume69
Issue number7
DOIs
Publication statusPublished - Jul 2020

Keywords

  • Mobility datasets
  • count-min sketch
  • privacy
  • utility
  • aggregation
  • data publishing

Fingerprint Dive into the research topics of 'P4Mobi: a probabilistic privacy-preserving framework for publishing mobility datasets'. Together they form a unique fingerprint.

Cite this