TY - GEN
T1 - Analyzing the sensitivity of Deep Neural Networks for sentiment analysis
T2 - 2020 International Joint Conference on Neural Networks
AU - Alhazmi, Ahoud
AU - Zhang, Wei Emma
AU - Sheng, Quan Z.
AU - Aljubairy, Abdulwahab
PY - 2020
Y1 - 2020
N2 - Deep Neural Networks (DNNs) have gained significant popularity in various Natural Language Processing tasks. However, the lack of interpretability of DNNs induces challenges to evaluate the robustness of DNNs. In this paper, we particularly focus on DNNs on sentiment analysis and conduct an empirical investigation on the sensitivity of DNNs. Specifically, we apply a scoring function to rank words importance without depending on the parameters or structure of the deep neural model. Then, we scan characteristics of these words to identify the model's weakness and perturb words to craft targeted attacks that exploit this weakness. We conduct extensive experiments on different neural network models across several real-world datasets. We report four intriguing findings: i) modern deep learning models for sentiment analysis ignore important sentiment terms such as opinion adjectives (i.e., amazing or terrible), ii) adjective words contribute to fooling sentiment analysis models more than other Parts-of-Speech (POS) categories, iii) changing or removing up to 10 adjectives words in a review text only decreases the accuracy up to 2%, and iv) modern models are unable to recognize the difference between an objective and a subjective review text.
AB - Deep Neural Networks (DNNs) have gained significant popularity in various Natural Language Processing tasks. However, the lack of interpretability of DNNs induces challenges to evaluate the robustness of DNNs. In this paper, we particularly focus on DNNs on sentiment analysis and conduct an empirical investigation on the sensitivity of DNNs. Specifically, we apply a scoring function to rank words importance without depending on the parameters or structure of the deep neural model. Then, we scan characteristics of these words to identify the model's weakness and perturb words to craft targeted attacks that exploit this weakness. We conduct extensive experiments on different neural network models across several real-world datasets. We report four intriguing findings: i) modern deep learning models for sentiment analysis ignore important sentiment terms such as opinion adjectives (i.e., amazing or terrible), ii) adjective words contribute to fooling sentiment analysis models more than other Parts-of-Speech (POS) categories, iii) changing or removing up to 10 adjectives words in a review text only decreases the accuracy up to 2%, and iv) modern models are unable to recognize the difference between an objective and a subjective review text.
KW - Adversarial Examples
KW - Deep Neural Networks
KW - Sentiment Analysis
UR - http://www.scopus.com/inward/record.url?scp=85093859047&partnerID=8YFLogxK
U2 - 10.1109/IJCNN48605.2020.9207000
DO - 10.1109/IJCNN48605.2020.9207000
M3 - Conference proceeding contribution
AN - SCOPUS:85093859047
T3 - IEEE International Joint Conference on Neural Networks (IJCNN)
SP - 1
EP - 7
BT - 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers (IEEE)
CY - Piscataway, NJ
Y2 - 19 July 2020 through 24 July 2020
ER -