Measuring re-identification risk

CJ Carey, Travis Dick, Alessandro Epasto, Adel Javanmard, Josh Karlin, Shankar Kumar, Andrés Muñoz Medina, Vahab Mirrokni, Gabriel Henrique Nunes, Sergei Vassilvitskii, Peilin Zhong

Research output: Working paperPreprint

Abstract

Compact user representations (such as embeddings) form the backbone of personalization services. In this work, we present a new theoretical framework to measure re-identification risk in such user representations. Our framework, based on hypothesis testing, formally bounds the probability that an attacker may be able to obtain the identity of a user from their representation. As an application, we show how our framework is general enough to model important real-world applications such as the Chrome's Topics API for interest-based advertising. We complement our theoretical bounds by showing provably good attack algorithms for re-identification that we use to estimate the re-identification risk in the Topics API. We believe this work provides a rigorous and interpretable notion of re-identification risk and a framework to measure it that can be used to inform real-world applications.
Original languageEnglish
Pages1-26
Number of pages26
DOIs
Publication statusSubmitted - 12 Apr 2023
Externally publishedYes

Publication series

NamearXiv

Fingerprint

Dive into the research topics of 'Measuring re-identification risk'. Together they form a unique fingerprint.
  • Measuring re-identification risk

    Carey, CJ., Dick, T., Epasto, A., Javanmard, A., Karlin, J., Kumar, S., Medina, A. M., Mirrokni, V., Nunes, G. H., Vassilvitskii, S. & Zhong, P., Jun 2023, In: Proceedings of the ACM on Management of Data. 1, 2, p. 1-26 26 p., 149.

    Research output: Contribution to journalConference paperpeer-review

    Open Access
    File
    70 Downloads (Pure)

Cite this