TY - GEN
T1 - Modeling emerging, evolving and fading topics using dynamic soft orthogonal NMF with sparse representation
AU - Chen, Yong
AU - Zhang, Hui
AU - Wu, Junjie
AU - Wang, Xingguang
AU - Liu, Rui
AU - Lin, Mengxiang
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/1/5
Y1 - 2016/1/5
N2 - Dynamic topic models (DTM) are of great use toanalyze the evolution of unobserved topics of a text collectionover time. Recent years have witnessed the explosive growth ofstreaming text data emerging from online media, which createsan unprecedented need for DTMs for timely event analysis. While there have been some matrix factorization methods inthe literature for dynamic topic modeling, further study is stillin great need to model emerging, evolving and fading topicsin a more natural and effective way. In light of this, we firstpropose a matrix factorization model called SONMFSR (SoftOrthogonal NMF with Sparse Representation), which makes fulluse of soft orthogonal and sparsity constraints for static topicmodeling. Furthermore, by introducing the constraints of emerging, evolving and fading topics to SONMFSR, we easily obtain a novel DTM called SONMFSRd for dynamic event analysis. Extensive experiments on two public corpora demonstrate the superiority of SONMFSRd to some state-of-the-art DTMs in both topic detection and tracking. In particular, SONMFSRd shows great potential in real-world applications, where popular topics in Two Sessions 2015 are captured and traced dynamically for possible insights.
AB - Dynamic topic models (DTM) are of great use toanalyze the evolution of unobserved topics of a text collectionover time. Recent years have witnessed the explosive growth ofstreaming text data emerging from online media, which createsan unprecedented need for DTMs for timely event analysis. While there have been some matrix factorization methods inthe literature for dynamic topic modeling, further study is stillin great need to model emerging, evolving and fading topicsin a more natural and effective way. In light of this, we firstpropose a matrix factorization model called SONMFSR (SoftOrthogonal NMF with Sparse Representation), which makes fulluse of soft orthogonal and sparsity constraints for static topicmodeling. Furthermore, by introducing the constraints of emerging, evolving and fading topics to SONMFSR, we easily obtain a novel DTM called SONMFSRd for dynamic event analysis. Extensive experiments on two public corpora demonstrate the superiority of SONMFSRd to some state-of-the-art DTMs in both topic detection and tracking. In particular, SONMFSRd shows great potential in real-world applications, where popular topics in Two Sessions 2015 are captured and traced dynamically for possible insights.
KW - Dynamic Topic Model (DTM)
KW - Non-negative Matrix Factorization (NMF)
KW - Soft Orthogonality
KW - Sparse Representation
KW - Topic Detection and Tracking (TDT)
UR - https://www.scopus.com/pages/publications/84963510465
U2 - 10.1109/ICDM.2015.96
DO - 10.1109/ICDM.2015.96
M3 - 会议稿件
AN - SCOPUS:84963510465
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 61
EP - 70
BT - Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015
A2 - Aggarwal, Charu
A2 - Zhou, Zhi-Hua
A2 - Tuzhilin, Alexander
A2 - Xiong, Hui
A2 - Wu, Xindong
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th IEEE International Conference on Data Mining, ICDM 2015
Y2 - 14 November 2015 through 17 November 2015
ER -