Abstract
While most recent work has focused on instances of opinion spam which are manually identifiable or deceptive opinion spam which are written by paid writers separately, in this work we study both of these interesting topics and propose an effective framework which has good performance on both datasets. Based on the golden-standard opinion spam dataset, we propose a novel model which integrates some deep linguistic features derived from a syntactic dependency parsing tree to discriminate deceptive opinions from normal ones. On a background of multiple language tasks, our model is evaluated on both English (gold-standard) and Chinese (non-gold) datasets. The experimental results show that our model produces state-of-the-art results on both of the topics.
Original language | English |
---|---|
Title of host publication | Proceedings of COLING 2012 |
Subtitle of host publication | posters |
Editors | Martin Kay, Christian Boitet |
Place of Publication | Powai, India |
Publisher | The COLING 2012 Organizing Committee |
Pages | 1341-1350 |
Number of pages | 10 |
Publication status | Published - 2012 |
Externally published | Yes |
Event | International Conference on Computational Linguistics (24th : 2012) - Mumbai, India Duration: 8 Dec 2012 → 15 Dec 2012 |
Conference
Conference | International Conference on Computational Linguistics (24th : 2012) |
---|---|
City | Mumbai, India |
Period | 8/12/12 → 15/12/12 |
Keywords
- Opinion Spam
- Multi-Language
- Deep Linguistic Features