TY - GEN
T1 - Chinese sentence matching with multiple alignments and feature augmentation
AU - Zuo, Youhui
AU - Peng, Xueping
AU - Lu, Wenpeng
AU - Wang, Shoujin
AU - Li, Zhao
AU - Zhang, Weiyu
AU - Zhai, Yi
PY - 2022
Y1 - 2022
N2 - Chinese sentence matching is a critical and yet challenging task in natural language processing. Recent work on modeling sentence semantic relations with deep neural models has shown its great potential in improving the performance of sentence matching. However, existing sentence matching methods usually focus on generating word-level sentence representation, which neglects the character-level information and leads to weak semantic representations. Also, they usually capture the interactive features with an attention-based alignment, which are typically implemented on sentence level and neglect the interactions among characters, words and sentences. This paper proposes a novel Chinese sentence matching model with Multiple Alignments and Feature Augmentation (MAFA). Specifically, the model first employs the multi-level embedding layer to accept the character and word sequences of sentences, and introduces the multiple alignment layer to capture the interactions among characters, words and sentences in turn. Then, the feature augmentation layer is applied to combine the interactive features to generate the final semantic matching representations. Finally, the prediction layer is utilized to judge the matching degree of the input sentences. Substantial and extensive experiments are conducted on two real-world data sets to show that MAFA significantly outperforms the competing methods and achieve comnarable nerformance with BERT-based methods.
AB - Chinese sentence matching is a critical and yet challenging task in natural language processing. Recent work on modeling sentence semantic relations with deep neural models has shown its great potential in improving the performance of sentence matching. However, existing sentence matching methods usually focus on generating word-level sentence representation, which neglects the character-level information and leads to weak semantic representations. Also, they usually capture the interactive features with an attention-based alignment, which are typically implemented on sentence level and neglect the interactions among characters, words and sentences. This paper proposes a novel Chinese sentence matching model with Multiple Alignments and Feature Augmentation (MAFA). Specifically, the model first employs the multi-level embedding layer to accept the character and word sequences of sentences, and introduces the multiple alignment layer to capture the interactions among characters, words and sentences in turn. Then, the feature augmentation layer is applied to combine the interactive features to generate the final semantic matching representations. Finally, the prediction layer is utilized to judge the matching degree of the input sentences. Substantial and extensive experiments are conducted on two real-world data sets to show that MAFA significantly outperforms the competing methods and achieve comnarable nerformance with BERT-based methods.
KW - Chinese Sentence Matching
KW - Semantic Representation
KW - Multiple Alignments
KW - Feature Augmentation
UR - http://www.scopus.com/inward/record.url?scp=85140800840&partnerID=8YFLogxK
U2 - 10.1109/IJCNN55064.2022.9892521
DO - 10.1109/IJCNN55064.2022.9892521
M3 - Conference proceeding contribution
AN - SCOPUS:85140800840
SN - 9781665495264
T3 - IEEE International Joint Conference on Neural Networks (IJCNN)
BT - 2022 International Joint Conference on Neural Networks (IJCNN)
PB - Institute of Electrical and Electronics Engineers (IEEE)
CY - Piscataway, NJ
T2 - 2022 International Joint Conference on Neural Networks, IJCNN 2022
Y2 - 18 July 2022 through 23 July 2022
ER -