Abstract
In many natural language processing tasks, supervised machine learning approaches have proved most effective, and substantial effort has been made into collecting and annotating corpora for building such models. Emotion detection from text is no exception; however, research in this area is in its relative infancy, and few emotion annotated corpora exist to date. A further issue regarding the development of emotion annotated corpora is the difficulty of the annotation task and resulting inconsistencies in human annotations. One approach to address these problems is to use self-annotated data, using explicit indications of emotions included by the author of the data in question. We present a study of the use of unicode emoji as self-annotation of a Twitter user’s emotional state. Emoji are found to be used far more extensively than hash tags and we argue that they present a more faithful representation of a user’s emotional state. A substantial set of tweets containing emotion indicative emoji are collected and a sample annotated for emotions. The accuracy and utility of emoji as emotion labels are evaluated directly (with respect to annotations) and through trained statistical models. Results are cautiously optimisticand suggest further study of emotji usage.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Emotion and Sentiment Analysis Workshop, at LREC2016 |
| Editors | J. Fernando Sánchez-Rada, Björn Schuller |
| Place of Publication | Luxembourg |
| Publisher | European Language Resources Association (ELRA) |
| Pages | 76-79 |
| Number of pages | 4 |
| Publication status | Published - 23 May 2016 |
| Externally published | Yes |
| Event | International Conference on Language Resources and Evaluation (10th : 2016) - Portorož, Slovenia Duration: 23 May 2016 → 28 May 2016 |
Conference
| Conference | International Conference on Language Resources and Evaluation (10th : 2016) |
|---|---|
| Abbreviated title | LREC 2016 |
| Country/Territory | Slovenia |
| City | Portorož |
| Period | 23/05/16 → 28/05/16 |
Bibliographical note
Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.Keywords
- hash tags
- emotion annotation
- emotion detection
- emoji
- emoticons