Skip to main navigation Skip to search Skip to main content

A novel hybrid methodology of measuring sentence similarity

Yongmin Yoo, Tak-Sung Heo, Yeongjoon Park, Kyungsun Kim*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The problem of measuring sentence similarity is an essential issue in the natural language processing area. It is necessary to measure the similarity between sentences accurately. Sentence similarity measuring is the task of finding semantic symmetry between two sentences, regardless of word order and context of the words. There are many approaches to measuring sentence similarity. Deep learning methodology shows a state-of-the-art performance in many natural language processing fields and is used a lot in sentence similarity measurement methods. However, in the natural language processing field, considering the structure of the sentence or the word structure that makes up the sentence is also important. In this study, we propose a methodology combined with both deep learning methodology and a method considering lexical relationships. Our evaluation metric is the Pearson correlation coefficient and Spearman correlation coefficient. As a result, the proposed method outperforms the current approaches on a KorSTS standard benchmark Korean dataset. Moreover, it performs a maximum of a 65% increase than only using deep learning methodology. Experiments show that our proposed method generally results in better performance than those with only a deep learning model.

Original languageEnglish
Article number1442
Pages (from-to)1-10
Number of pages10
JournalSymmetry
Volume13
Issue number8
DOIs
Publication statusPublished - Aug 2021
Externally publishedYes

Bibliographical note

Copyright the Author(s) 2021. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Keywords

  • natural language processing
  • sentence similarity
  • deep learning
  • lexical relationship

Fingerprint

Dive into the research topics of 'A novel hybrid methodology of measuring sentence similarity'. Together they form a unique fingerprint.

Cite this