Exploring causal directions through word occurrences: semi-supervised Bayesian classification framework

King Tao Jason Ng, Diego Mollá

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

Abstract

Determining causal directions in sentences plays a critical role in understanding the cause-and-effect relationship between entities. In this paper, we empirically show that word occurrences from several Internet domains resemble the characteristics of causal directions. Our research contributes to the knowledge of the underlying data generation process behind causal directions. We propose a two-phase method: (1) a Bayesian framework that generates synthetic data from posteriors by incorporating word occurrences from Internet domains, and (2) a pre-trained BERT model that utilizes the semantics of words based on context to perform classification. The proposed method demonstrates an improvement in performance for Cause-Effect relations in the SemEval-2010 dataset compared to random guessing.

Original languageEnglish
Title of host publicationProceedings of the 21st AnnualWorkshop of the Australasian Language Technology Association
Place of PublicationOnline
PublisherAssociation for Computational Linguistics
Pages30-39
Number of pages10
Publication statusPublished - 2023
Event21st Annual Workshop of the Australasian Language Technology Association, ALTA 2023 - Melbourne, Australia
Duration: 29 Nov 20231 Dec 2023

Conference

Conference21st Annual Workshop of the Australasian Language Technology Association, ALTA 2023
Country/TerritoryAustralia
CityMelbourne
Period29/11/231/12/23

Cite this