Divide and denoise: empowering simple models for robust graph semi-supervised learning against label noise

Kaize Ding*, Xiaoxiao Ma, Yixin Liu, Shirui Pan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

4 Citations (Scopus)

Abstract

Graph neural networks (GNNs) based on message passing have achieved remarkable performance in graph machine learning. By combining it with the power of pseudo labeling, one can further push forward the performance on the task of semi-supervised node classification. However, most existing works assume that the training node labels are purely noise-free, while this strong assumption usually does not hold in practice. GNNs will overfit the noisy training labels and the adverse effects of mislabeled nodes can be exaggerated by being propagated to the remaining nodes through the graph structure, exacerbating the model failure. Worse still, the noisy pseudo labels could also largely undermine the model's reliability without special treatment. In this paper, we revisit the role of (1) message passing and (2) pseudo labels in the studied problem and try to address two denoising subproblems from the model architecture and algorithm perspective, respectively. Specifically, we first develop a label-noise robust GNN that discards the coupled message-passing scheme. Despite its simple architecture, this learning backbone prevents overfitting to noisy labels and also inherently avoids the noise propagation issue. Moreover, we propose a novel reliable graph pseudo labeling algorithm that can effectively leverage the knowledge of unlabeled nodes while mitigating the adverse effects of noisy pseudo labels. Based on those novel designs, we can attain exceptional effectiveness and efficiency in solving the studied problem. We conduct extensive experiments on benchmark datasets for semi-supervised node classification with different levels of label noise and show new state-of-the-art performance. The code is available at https://github.com/DND-NET/DND-NET.

Original languageEnglish
Title of host publicationKDD '24
Subtitle of host publicationProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery
Pages574-584
Number of pages11
ISBN (Electronic)9798400704901
DOIs
Publication statusPublished - 24 Aug 2024
EventACM SIGKDD Conference on Knowledge Discovery and Data Mining (30th : 2024) - Barcelona, Spain
Duration: 25 Aug 202429 Aug 2024

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
ISSN (Print)2154-817X

Conference

ConferenceACM SIGKDD Conference on Knowledge Discovery and Data Mining (30th : 2024)
Abbreviated titleKDD '24
Country/TerritorySpain
CityBarcelona
Period25/08/2429/08/24

Keywords

  • graph neural networks
  • noisy labels
  • semi-supervised learning

Fingerprint

Dive into the research topics of 'Divide and denoise: empowering simple models for robust graph semi-supervised learning against label noise'. Together they form a unique fingerprint.

Cite this