Word sense disambiguation with knowledge-enhanced and local self-attention-based extractive sense comprehension

Guobiao Zhang, Wenpeng Lu*, Xueping Peng, Shoujin Wang, Baoshuo Kan, Rui Yu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

16 Citations (Scopus)

Abstract

Word sense disambiguation (WSD), identifying the most suitable meaning of ambiguous words in the given contexts according to a predefined sense inventory, is one of the most classical and challenging tasks in natural language processing. Reformulating WSD as a text span extraction task is an effective approach, which accepts a sentence context of an ambiguous word together with all definitions of its candidate senses simultaneously, and requires to extract the text span corresponding with the right sense. However, the approach merely depends on a short definition to learn sense representation, which neglects abundant semantic knowledge from related senses and leads to data-inefficient learning and suboptimal WSD performance. To address the limitations, we propose a novel WSD method with Knowledge-Enhanced and Local self-attention-based Extractive Sense Comprehension (KELESC). Specifically, a knowledge-enhanced method is proposed to enrich semantic representation by incorporating additional examples and definitions of the related senses in WordNet. Then, in order to avoid the huge computing complexity induced by the additional information, a local self-attention mechanism is utilized to constrain attention to be local, which allows longer input texts without large-scale computing burdens. Extensive experimental results demonstrate that KELESC achieves better performance than baseline models on public benchmark datasets.

Original languageEnglish
Title of host publicationProceedings of the Main Conference The 29th International Conference on Computational Linguistics
Place of PublicationNew York
PublisherInternational Committee on Computational Linguistics
Pages4061-4070
Number of pages10
Publication statusPublished - 2022
Externally publishedYes
Event29th International Conference on Computational Linguistics, COLING 2022 - Gyeongju, Korea, Republic of
Duration: 12 Oct 202217 Oct 2022

Publication series

NameCOLING
PublisherInternational Committee on Computational Linguistics
Number1
Volume29
ISSN (Print)2951-2093

Conference

Conference29th International Conference on Computational Linguistics, COLING 2022
Country/TerritoryKorea, Republic of
CityGyeongju
Period12/10/2217/10/22

Fingerprint

Dive into the research topics of 'Word sense disambiguation with knowledge-enhanced and local self-attention-based extractive sense comprehension'. Together they form a unique fingerprint.

Cite this