TY - JOUR
T1 - Affinity Regularized Non-Negative Matrix Factorization for Lifelong Topic Modeling
AU - Chen, Yong
AU - Wu, Junjie
AU - Lin, Jianying
AU - Liu, Rui
AU - Zhang, Hui
AU - Ye, Zhiwen
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - Lifelong topic model (LTM), an emerging paradigm for never-ending topic learning, aims to yield higher-quality topics as time passes through knowledge accumulated from the past yet learned for the future. In this paper, we propose a novel lifelong topic model based on non-negative matrix factorization (NMF), called Affinity Regularized NMF for LTM (NMF-LTM), which to our best knowledge is distinctive from the popular LDA-based LTMs. NMF-LTM achieves lifelong learning by introducing word-word graph Laplacian as semantic affinity regularization. Other priors such as sparsity, diversity, and between-class affinity are incorporated as well for better performance, and a theoretical guarantee is provided for the algorithmic convergence to a local minimum. Extensive experiments on various public corpora demonstrate the effectiveness of NMF-LTM, particularly its human-like behaviors in two carefully designed learning tasks and the ability in topic modeling of big data. A further exploration of semantic relatedness in knowledge graphs and a case study on a large-scale real-world corpus exhibit the strength of NMF-LTM in discovering high-quality topics in an efficient and robust way.
AB - Lifelong topic model (LTM), an emerging paradigm for never-ending topic learning, aims to yield higher-quality topics as time passes through knowledge accumulated from the past yet learned for the future. In this paper, we propose a novel lifelong topic model based on non-negative matrix factorization (NMF), called Affinity Regularized NMF for LTM (NMF-LTM), which to our best knowledge is distinctive from the popular LDA-based LTMs. NMF-LTM achieves lifelong learning by introducing word-word graph Laplacian as semantic affinity regularization. Other priors such as sparsity, diversity, and between-class affinity are incorporated as well for better performance, and a theoretical guarantee is provided for the algorithmic convergence to a local minimum. Extensive experiments on various public corpora demonstrate the effectiveness of NMF-LTM, particularly its human-like behaviors in two carefully designed learning tasks and the ability in topic modeling of big data. A further exploration of semantic relatedness in knowledge graphs and a case study on a large-scale real-world corpus exhibit the strength of NMF-LTM in discovering high-quality topics in an efficient and robust way.
KW - Lifelong topic model (LTM)
KW - knowledge graph
KW - non-negative matrix factorization (NMF)
KW - semantic affinity
UR - https://www.scopus.com/pages/publications/85086477097
U2 - 10.1109/TKDE.2019.2904687
DO - 10.1109/TKDE.2019.2904687
M3 - 文章
AN - SCOPUS:85086477097
SN - 1041-4347
VL - 32
SP - 1249
EP - 1262
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 7
M1 - 8666058
ER -