TY - JOUR
T1 - Variational co-embedding learning for attributed network clustering
AU - Yang, Shuiqiao
AU - Verma, Sunny
AU - Cai, Borui
AU - Jiang, Jiaojiao
AU - Yu, Kun
AU - Chen, Fang
AU - Yu, Shui
PY - 2023/6/21
Y1 - 2023/6/21
N2 - Recent developments in attributed network clustering combine graph neural networks and autoencoders for unsupervised learning. Although effective, these techniques suffer from either (a) clustering-unfriendly embedding spaces or (b) limited utilization of attribute information. To address these issues, we propose a novel model called Variational Co-embedding Learning Model for Attributed Network Clustering (VCLANC), which utilizes much deeper information from the network by reconstructing both the network structure and the node attributes to perform self-supervised learning. Technically, VCLANC consists of dual variational autoencoders that co-embed nodes and attributes into the same latent space, along with a trainable Gaussian mixture prior that simultaneously performs representation learning and node clustering. To optimize the variational autoencoders and infer the latent variables of embeddings and clustering assignments, we derive a new variational lower bound that maximizes the joint likelihood of the observed network structure and node attributes. Furthermore, we also adopt a mutual distance loss on the cluster centers and a clustering assignment hardening loss on the node embeddings to strengthen clustering quality. Our experimental results on four real-world datasets demonstrate the outstanding performance of VCLANC for attributed network clustering.
AB - Recent developments in attributed network clustering combine graph neural networks and autoencoders for unsupervised learning. Although effective, these techniques suffer from either (a) clustering-unfriendly embedding spaces or (b) limited utilization of attribute information. To address these issues, we propose a novel model called Variational Co-embedding Learning Model for Attributed Network Clustering (VCLANC), which utilizes much deeper information from the network by reconstructing both the network structure and the node attributes to perform self-supervised learning. Technically, VCLANC consists of dual variational autoencoders that co-embed nodes and attributes into the same latent space, along with a trainable Gaussian mixture prior that simultaneously performs representation learning and node clustering. To optimize the variational autoencoders and infer the latent variables of embeddings and clustering assignments, we derive a new variational lower bound that maximizes the joint likelihood of the observed network structure and node attributes. Furthermore, we also adopt a mutual distance loss on the cluster centers and a clustering assignment hardening loss on the node embeddings to strengthen clustering quality. Our experimental results on four real-world datasets demonstrate the outstanding performance of VCLANC for attributed network clustering.
KW - Attributed network clustering
KW - Graph neural network
KW - Variational autoencoder
UR - http://www.scopus.com/inward/record.url?scp=85152123515&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2023.110530
DO - 10.1016/j.knosys.2023.110530
M3 - Article
AN - SCOPUS:85152123515
SN - 0950-7051
VL - 270
SP - 1
EP - 13
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 110530
ER -