Abstract
Interactions among tweets, i.e., mentions, retweets, replies, are important factors contributing to the quality of topic derivation on Twitter. If applied correctly, the incorporation of tweet interactions can significantly improve the quality of topic derivation in comparison with approaches that are mainly based on the content similarity analysis. However, how interactions can be measured and integrated with content similarity for topic derivation remains a challenge. In previous work, the strength of tweet-to-tweet relationship has been computed by simply adding measures for content similarity, mentions, and reply-retweets. This simple linear addition does not accurately reflect the various impacts these factors have on tweet relationships. In order to address this issue, we propose a joint probability model that can effectively integrate the effects of the content similarity, mentions, and reply-retweets to measure the tweet relationship for the purpose of topic derivation. The proposed method is based on matrix factorization techniques, which enables a flexible implementation on a distributed system in an incremental manner. Experimental results show that the proposed model results in a significant improvement in the quality of topic derivation over existing methods.
Original language | English |
---|---|
Title of host publication | ICDCS 2017 |
Subtitle of host publication | Proceedings of the IEEE 37th International Conference on Distributed Computing Systems |
Editors | Kisung Lee, Ling Liu |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 2338-2343 |
Number of pages | 6 |
ISBN (Electronic) | 9781538617922 |
ISBN (Print) | 9781538617939 |
DOIs | |
Publication status | Published - 13 Jul 2017 |
Event | 37th IEEE International Conference on Distributed Computing Systems, ICDCS 2017 - Atlanta, United States Duration: 5 Jun 2017 → 8 Jun 2017 |
Other
Other | 37th IEEE International Conference on Distributed Computing Systems, ICDCS 2017 |
---|---|
Country/Territory | United States |
City | Atlanta |
Period | 5/06/17 → 8/06/17 |
Keywords
- Interactions of Tweets
- Joint Matrix Factorization
- Topic Derivation