A weighted K-member clustering algorithm for K-anonymization

Yan Yan, Eyeleko Anselme Herman*, Adnan Mahmood, Tao Feng, Pengshou Xie

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)

Abstract

As a representative model for privacy preserving data publishing, K-anonymity has raised a considerable number of questions for researchers over the past few decades. Among them, how to achieve data release without sacrificing the users’ privacy and how to maximize the availability of published data is the ultimate goal of privacy preserving data publishing. In order to enhance the clustering effect and reduce the unnecessary computation, this paper proposes a weighted K-member clustering algorithm. A series of weight indicators are designed to evaluate the outlyingness of records, distance between records, and information loss of the published data. The proposed algorithm can reduce the influence of outliers on the clustering effect and maintain the availability of data to the best possible extent during the clustering process. Experimental analysis suggests that the proposed method generates lower information loss, improves the clustering effect, and is less sensitive to outliers as compared with some existing methods.

Original languageEnglish
Pages (from-to)2251-2273
Number of pages23
JournalComputing
Volume103
Issue number10
Early online date20 Feb 2021
DOIs
Publication statusPublished - Oct 2021

Keywords

  • K-anonymity
  • Privacy preserving data publishing
  • Information loss
  • Clustering
  • Outliers

Fingerprint

Dive into the research topics of 'A weighted K-member clustering algorithm for K-anonymization'. Together they form a unique fingerprint.

Cite this