Bayesian attention-based user behaviour modelling for click-through rate prediction

Yihao Zhang*, Mian Chen, Ruizhen Chen, Chu Zhao, Meng Yuan, Zhu Sun

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)
39 Downloads (Pure)

Abstract

Exploiting the hierarchical dependence behind user behaviour is critical for click-through rate (CRT) prediction in recommender systems. Existing methods apply attention mechanisms to obtain the weights of items; however, the authors argue that deterministic attention mechanisms cannot capture the hierarchical dependence between user behaviours because they treat each user behaviour as an independent individual and cannot accurately express users' flexible and changeable interests. To tackle this issue, the authors introduce the Bayesian attention to the CTR prediction model, which treats attention weights as data-dependent local random variables and learns their distribution by approximating their posterior distribution. Specifically, the prior knowledge is constructed into the attention weight distribution, and then the posterior inference is utilised to capture the implicit and flexible user intentions. Extensive experiments on public datasets demonstrate that our algorithm outperforms state-of-the-art algorithms. Empirical evidence shows that random attention weights can predict user intentions better than deterministic ones.

Original languageEnglish
Pages (from-to)1320-1330
Number of pages11
JournalCAAI Transactions on Intelligence Technology
Volume9
Issue number5
DOIs
Publication statusPublished - Oct 2024

Bibliographical note

Copyright the Author(s) 2024. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • data mining
  • machine learning
  • natural language processing
  • recommender systems

Fingerprint

Dive into the research topics of 'Bayesian attention-based user behaviour modelling for click-through rate prediction'. Together they form a unique fingerprint.

Cite this