Large-scale entity alignment via knowledge graph merging, partitioning and embedding

Kexuan Xin, Zequn Sun, Wen Hua, Wei Hu, Jianfeng Qu, Xiaofang Zhou

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

Entity alignment is a crucial task in knowledge graph fusion. However, most entity alignment approaches have the scalability problem. Recent methods address this issue by dividing large KGs into small blocks for embedding and alignment learning in each. However, such a partitioning and learning process results in an excessive loss of structure and alignment. Therefore, in this work, we propose a scalable GNN-based entity alignment approach to reduce the structure and alignment loss from three perspectives. First, we propose a centrality-based subgraph generation algorithm to recall some landmark entities serving as the bridges between different subgraphs. Second, we introduce self-supervised entity reconstruction to recover entity representations from incomplete neighborhood subgraphs, and design cross-subgraph negative sampling to incorporate entities from other subgraphs in alignment learning. Third, during the inference process, we merge the embeddings of subgraphs to make a single space for alignment search. Experimental results on the benchmark OpenEA dataset and the proposed large DBpedia1M dataset verify the effectiveness of our approach.
Original languageEnglish
Title of host publicationCIKM '22
Subtitle of host publicationProceedings of the 31st ACM International Conference on Information & Knowledge Management
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages2240-2249
Number of pages10
ISBN (Electronic)9781450392365
DOIs
Publication statusPublished - 17 Oct 2022
Externally publishedYes
Event31st ACM International Conference on Information and Knowledge Management, CIKM 2022 - Atlanta, United States
Duration: 17 Oct 202221 Oct 2022

Conference

Conference31st ACM International Conference on Information and Knowledge Management, CIKM 2022
Country/TerritoryUnited States
CityAtlanta
Period17/10/2221/10/22

Keywords

  • large-scale
  • entity alignment
  • graph neural networks

Cite this