TY - GEN
T1 - Deriving Topics in Twitter by Exploiting Tweet Interactions
AU - Nugroho, Robertus
AU - Yang, Jian
AU - Zhong, Youliang
AU - Paris, Cecile
AU - Nepal, Surya
PY - 2015/8/17
Y1 - 2015/8/17
N2 - Twitter as a big data social network becomes one of the most important sources for capturing the up-To-date events happening in the world. Topic derivation from Twitter is important for various applications such as situation awareness, market analysis, content filtering, and recommendations. However, tweets are short messages, which makes topic derivation challenging. Current methods employ various semantic features of tweet content but mostly overlook the interactions among tweets. In this paper, we propose a novel topic derivation method that takes into account the interactions among tweets, defined as the reciprocal activities related to people who send the tweets, as well as actions and tweet contents. In particular, topics are derived by performing a two-step matrix factorization jointly over the interactions and semantic features of the tweets. We have conducted a number of experiments on tweets collected over a period of time, showing that the proposed method consistently outperforms other advanced topic derivation methods in the literature. Our experiments also reveal that the interactions among tweets do significantly relieve the sparsity problem caused by the short-Text nature of Twitter.
AB - Twitter as a big data social network becomes one of the most important sources for capturing the up-To-date events happening in the world. Topic derivation from Twitter is important for various applications such as situation awareness, market analysis, content filtering, and recommendations. However, tweets are short messages, which makes topic derivation challenging. Current methods employ various semantic features of tweet content but mostly overlook the interactions among tweets. In this paper, we propose a novel topic derivation method that takes into account the interactions among tweets, defined as the reciprocal activities related to people who send the tweets, as well as actions and tweet contents. In particular, topics are derived by performing a two-step matrix factorization jointly over the interactions and semantic features of the tweets. We have conducted a number of experiments on tweets collected over a period of time, showing that the proposed method consistently outperforms other advanced topic derivation methods in the literature. Our experiments also reveal that the interactions among tweets do significantly relieve the sparsity problem caused by the short-Text nature of Twitter.
UR - https://www.scopus.com/pages/publications/84959525239
U2 - 10.1109/BigDataCongress.2015.22
DO - 10.1109/BigDataCongress.2015.22
M3 - Conference proceeding contribution
AN - SCOPUS:84959525239
SN - 9781467372794
T3 - IEEE International Congress on Big Data
SP - 87
EP - 94
BT - Proceedings - 2015 IEEE International Congress on Big Data, BigData Congress 2015
A2 - Barbara, Carminati
A2 - Khan, Latifur
PB - Institute of Electrical and Electronics Engineers (IEEE)
CY - Piscataway, NJ
T2 - 4th IEEE International Congress on Big Data, BigData Congress 2015
Y2 - 27 June 2015 through 2 July 2015
ER -