Abstract
Determining causal directions in sentences plays a critical role in understanding the cause-and-effect relationship between entities. In this paper, we empirically show that word occurrences from several Internet domains resemble the characteristics of causal directions. Our research contributes to the knowledge of the underlying data generation process behind causal directions. We propose a two-phase method: (1) a Bayesian framework that generates synthetic data from posteriors by incorporating word occurrences from Internet domains, and (2) a pre-trained BERT model that utilizes the semantics of words based on context to perform classification. The proposed method demonstrates an improvement in performance for Cause-Effect relations in the SemEval-2010 dataset compared to random guessing.
Original language | English |
---|---|
Title of host publication | Proceedings of the 21st AnnualWorkshop of the Australasian Language Technology Association |
Place of Publication | Online |
Publisher | Association for Computational Linguistics |
Pages | 30-39 |
Number of pages | 10 |
Publication status | Published - 2023 |
Event | 21st Annual Workshop of the Australasian Language Technology Association, ALTA 2023 - Melbourne, Australia Duration: 29 Nov 2023 → 1 Dec 2023 |
Conference
Conference | 21st Annual Workshop of the Australasian Language Technology Association, ALTA 2023 |
---|---|
Country/Territory | Australia |
City | Melbourne |
Period | 29/11/23 → 1/12/23 |