TY - GEN
T1 - Matrix Inter-joint Factorization-A New Approach for Topic Derivation in Twitter
AU - Nugroho, Robertus
AU - Zhong, Youliang
AU - Yang, Jian
AU - Paris, Cecile
AU - Nepal, Surya
PY - 2015/8/17
Y1 - 2015/8/17
N2 - Amongst all the social media platforms available, Twitter is rapidly becoming the main one used for communications about real-Time events. As a result, there is a lot of interest in monitoring Twitter and understanding the topics of conversations. However, the fact that tweets are short in content makes topics derivation a challenge, as most existing methods use content features only, sometimes integrated with limited interaction information. In this paper, we propose a novel method: Non-negative Matrix inter-joint Factorization (NMijF), in which we conduct co-factorization jointly over Twitter interaction features and content attributes within a single iterative-update process. We have conducted comprehensive experiments on real Twitter datasets and evaluated the performance of the proposed method, especially comparing it with the Joint Non-negative Matrix Factorization (joint-NMF) and Non-negative Matrix co-Factorization (NMcF) methods. Our experiment results show that the proposed NMijF method outperforms joint-NMF, NMcF and other advanced topic derivation methods in terms of Topic Coherence, Purity, Normalized Mutual Information and Precision-Recall.
AB - Amongst all the social media platforms available, Twitter is rapidly becoming the main one used for communications about real-Time events. As a result, there is a lot of interest in monitoring Twitter and understanding the topics of conversations. However, the fact that tweets are short in content makes topics derivation a challenge, as most existing methods use content features only, sometimes integrated with limited interaction information. In this paper, we propose a novel method: Non-negative Matrix inter-joint Factorization (NMijF), in which we conduct co-factorization jointly over Twitter interaction features and content attributes within a single iterative-update process. We have conducted comprehensive experiments on real Twitter datasets and evaluated the performance of the proposed method, especially comparing it with the Joint Non-negative Matrix Factorization (joint-NMF) and Non-negative Matrix co-Factorization (NMcF) methods. Our experiment results show that the proposed NMijF method outperforms joint-NMF, NMcF and other advanced topic derivation methods in terms of Topic Coherence, Purity, Normalized Mutual Information and Precision-Recall.
UR - http://www.scopus.com/inward/record.url?scp=84959531863&partnerID=8YFLogxK
U2 - 10.1109/BigDataCongress.2015.21
DO - 10.1109/BigDataCongress.2015.21
M3 - Conference proceeding contribution
AN - SCOPUS:84959531863
SN - 9781467372794
T3 - IEEE International Congress on Big Data
SP - 79
EP - 86
BT - Proceedings - 2015 IEEE International Congress on Big Data, BigData Congress 2015
A2 - Barbara, Carminati
A2 - Khan, Latifur
PB - Institute of Electrical and Electronics Engineers (IEEE)
CY - Piscataway, NJ
T2 - 4th IEEE International Congress on Big Data, BigData Congress 2015
Y2 - 27 June 2015 through 2 July 2015
ER -