Using social connection information to improve opinion mining: Identifying negative sentiment about HPV vaccines on Twitter

Xujuan Zhou*, Enrico Coiera, Guy Tsafnat, Diana Arachi, Mei Sing Ong, Adam G. Dunn

*Corresponding author for this work

Research output: Contribution to journalConference paperpeer-review

44 Citations (Scopus)
21 Downloads (Pure)


The manner in which people preferentially interact with others like themselves suggests that information about social connections may be useful in the surveillance of opinions for public health purposes. We examined if social connection information from tweets about human papillomavirus (HPV) vaccines could be used to train classifiers that identify anti-vaccine opinions. From 42,533 tweets posted between October 2013 and March 2014, 2,098 were sampled at random and two investigators independently identified anti-vaccine opinions. Machine learning methods were used to train classifiers using the first three months of data, including content (8,261 text fragments) and social connections (10,758 relationships). Connection-based classifiers performed similarly to content-based classifiers on the first three months of training data, and performed more consistently than content-based classifiers on test data from the subsequent three months. The most accurate classifier achieved an accuracy of 88.6% on the test data set, and used only social connection features. Information about how people are connected, rather than what they write, may be useful for improving public health surveillance methods on Twitter.

Original languageEnglish
Pages (from-to)761-765
Number of pages5
JournalStudies in Health Technology and Informatics
Publication statusPublished - 2015
Event15th World Congress on Health and Biomedical Informatics, MEDINFO 2015 - Sao Paulo, Brazil
Duration: 19 Aug 201523 Aug 2015

Bibliographical note

Copyright the Publisher 2015. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.


Dive into the research topics of 'Using social connection information to improve opinion mining: Identifying negative sentiment about HPV vaccines on Twitter'. Together they form a unique fingerprint.

Cite this