Abstract
This paper describes an approach to detect one particular demographic characteristic, occupation (or profession) in Twitter user profiles. In this paper, we show how effective the approach is for estimating occupational population statistics in Australian Twitter by correlating them with real-world population obtained from 2011 Australian census data. We also demonstrate that we can gain more reliable social media insights in the context of occupational representativeness in Twitter if a non-standard occupation name is mapped into a standard occupation name. To our knowledge, this is the first attempt to build a machine learning model that automatically identifies linguistically noisy or open-ended occupations in Twitter, resulting in more reliable occupational population.
Original language | English |
---|---|
Title of host publication | ADCS 2016 |
Subtitle of host publication | Proceedings of the 21st Australasian Document Computing Symposium |
Editors | Sarvnaz Karimi, Mark Carman |
Publisher | Association for Computing Machinery |
Pages | 57-64 |
Number of pages | 8 |
ISBN (Electronic) | 9781450348652 |
DOIs | |
Publication status | Published - 5 Dec 2016 |
Externally published | Yes |
Event | 21st Australasian Document Computing Symposium, ADCS 2016 - Caulfield, Australia Duration: 6 Dec 2016 → 7 Dec 2016 |
Other
Other | 21st Australasian Document Computing Symposium, ADCS 2016 |
---|---|
Country/Territory | Australia |
City | Caulfield |
Period | 6/12/16 → 7/12/16 |
Keywords
- occupation
- census
- Conditional Random Fields