Abstract
Detecting events in real-time from the Twitter data stream has gained substantial attention in recent years from researchers around the world. Different event detection approaches have been proposed as a result of these research efforts. One of the major challenges faced in this context is the high computational cost associated with event detection in real-time. We propose, TwitterNews+, an event detection system that incorporates specialized inverted indices and an incremental clustering approach to provide a low computational cost solution to detect both major and minor newsworthy events in real-time from the Twitter data stream. In addition, we conduct an extensive parameter sensitivity analysis to fine-tune the parameters used in TwitterNews+ to achieve the best performance. Finally, we evaluate the effectiveness of our system using a publicly available corpus as a benchmark dataset. The results of the evaluation show a significant improvement in terms of recall and precision over five state-of-the-art baselines we have used.
Original language | English |
---|---|
Pages (from-to) | 1146-1165 |
Number of pages | 20 |
Journal | Information Processing and Management |
Volume | 56 |
Issue number | 3 |
Early online date | 13 Mar 2018 |
DOIs | |
Publication status | Published - May 2019 |
Keywords
- Event detection
- Incremental clustering
- Microblog
- Parameter sensitivity analysis
- Social media